GitHub topics: localllama
awaescher/OllamaSharp
The easiest way to use the Ollama API in .NET
Language: C# - Size: 26.6 MB - Last synced at: 5 days ago - Pushed at: 14 days ago - Stars: 935 - Forks: 125

mostlygeek/llama-swap
Model swapping for llama.cpp (or any local OpenAPI compatible server)
Language: Go - Size: 552 KB - Last synced at: 6 days ago - Pushed at: 9 days ago - Stars: 537 - Forks: 31

SqueezeAILab/KVQuant
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Language: Python - Size: 19.8 MB - Last synced at: 6 days ago - Pushed at: 8 months ago - Stars: 339 - Forks: 30

sozercan/aikit
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
Language: Go - Size: 4.6 MB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 441 - Forks: 39

moritztng/fltr
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.
Language: Rust - Size: 63.5 MB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 377 - Forks: 8

BrutalCoding/aub.ai
AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.
Language: Dart - Size: 119 MB - Last synced at: about 10 hours ago - Pushed at: 12 months ago - Stars: 279 - Forks: 25

poloclub/wordflow
Social and customizable AI writing assistant! ✍️
Language: TypeScript - Size: 16.7 MB - Last synced at: 12 days ago - Pushed at: 10 months ago - Stars: 231 - Forks: 30

sozercan/kubectl-ai
✨ Kubectl plugin to create manifests with LLMs
Language: Go - Size: 243 KB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 1,085 - Forks: 84

yankeexe/llm-rag-with-reranker-demo
LLM RAG Application with Cross-Encoders Re-ranking for YouTube video 🎥
Language: Python - Size: 387 KB - Last synced at: 4 days ago - Pushed at: 3 months ago - Stars: 58 - Forks: 27

awaescher/OllamaSharpConsole 📦
Full featured demo application for OllamaSharp
Language: C# - Size: 137 KB - Last synced at: about 17 hours ago - Pushed at: 6 months ago - Stars: 12 - Forks: 3

lef-fan/aria
A local and uncensored AI entity.
Language: Python - Size: 7.7 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 61 - Forks: 14

seyf1elislam/LocalLLM_OneClick_Colab
Run gguf LLM models in Latest Version TextGen-webui
Language: Jupyter Notebook - Size: 102 KB - Last synced at: 23 days ago - Pushed at: 7 months ago - Stars: 9 - Forks: 0

palash-jain-cw/LocalLLMChatbot
This project allows you to run your own local Large Language Model (LLM) chatbot using an API like Ollama.
Language: Python - Size: 313 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

knilink/tamperpilot
Copilot hack for running local copilot without auth and proxying
Language: JavaScript - Size: 123 KB - Last synced at: 17 days ago - Pushed at: 4 months ago - Stars: 4 - Forks: 1

av1d/LAISer
Local AI Search assistant web or CLI for ollama and llama.cpp. Lightweight and easy to run, providing a Perplexity-like experience.
Language: Python - Size: 1.36 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 1

Belluxx/LlamaTerm
Use your open source local model from the terminal
Language: Python - Size: 7.49 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 8 - Forks: 0
