GitHub topics: localllama
imrightguy/CloudToLocalLLM
Secure Flutter desktop app connecting Auth0 authentication with local Ollama AI models via encrypted tunneling. Access your private AI instances remotely while keeping data on your hardware.
Language: Dart - Size: 1.65 GB - Last synced at: about 16 hours ago - Pushed at: about 17 hours ago - Stars: 15 - Forks: 2

qutoh/LMRL
A narrative/roleplay engine with TCOD levels driven by unreliable narrators - both in the literal and literature sense. Currently hooks into LMStudio and gemini for responses, allowing overrides of tasks to player control.
Language: Python - Size: 792 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

awaescher/OllamaSharp
The easiest way to use Ollama in .NET
Language: C# - Size: 26.7 MB - Last synced at: 3 days ago - Pushed at: 14 days ago - Stars: 1,112 - Forks: 157

kaito-project/aikit
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
Language: Go - Size: 4.91 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 469 - Forks: 46

mostlygeek/llama-swap
Model swapping for llama.cpp (or any local OpenAPI compatible server)
Language: Go - Size: 1.92 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,396 - Forks: 84

SqueezeAILab/KVQuant
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Language: Python - Size: 19.8 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 374 - Forks: 36

poloclub/wordflow
Social and customizable AI writing assistant! ✍️
Language: TypeScript - Size: 20.7 MB - Last synced at: 2 days ago - Pushed at: 2 months ago - Stars: 253 - Forks: 32

michaelsoftmd/zenbot-chrome
LLM-powered live web browser automation from the complete safety of a Podman/Docker container using Smolagents, Zendriver, chrome dev tools and VNC. Features caching as memory!
Language: Python - Size: 1.17 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

BrutalCoding/aub.ai
AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.
Language: Dart - Size: 119 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 300 - Forks: 27

yankeexe/llm-rag-with-reranker-demo
LLM RAG Application with Cross-Encoders Re-ranking for YouTube video 🎥
Language: Python - Size: 387 KB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 66 - Forks: 30

moritztng/fltr
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.
Language: Rust - Size: 63.5 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 384 - Forks: 8

michaelsoftmd/ai-pet-project
A set of guides for fully contained, daemonless, secure methods of storing and using LLMs locally on a mounted SSD. Uses Podman, supports AMD with Vulkan, uses llama.cpp, llamafiles, ollama w/ Openhands, Zendriver
Size: 163 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 2 - Forks: 0

seyf1elislam/LocalLLM_OneClick_Colab
Run gguf LLM models in Latest Version TextGen-webui and koboldcpp
Language: Jupyter Notebook - Size: 120 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 12 - Forks: 0

maifeeulasad/LocalLLaMA
📚 LocalLLaMA Archive — Community-powered static archive for r/LocalLLaMA
Language: TypeScript - Size: 427 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 1

awaescher/OllamaSharpConsole 📦
Full featured demo application for OllamaSharp
Language: C# - Size: 137 KB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 14 - Forks: 4

BrunoArsioli/llama-optimus
Lightweight Python tool using Optuna for tuning llama.cpp flags: towards optimal tok/s for your machine
Language: Python - Size: 2.97 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Snap-gen/Snapgen
🏗️ Build, fine-tune, and run generative models locally!
Language: Go - Size: 3.56 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 159 - Forks: 42

Belluxx/LlamaTerm
Use your open source local model from the terminal
Language: Python - Size: 7.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

mvdnesss/llm-demo
llm-demo offers a straightforward way to leverage large language models for translation tasks. 🌐 With this toolkit, developers can easily fine-tune and deploy models like Mistral-7B for English and Polish. 🐙
Language: Python - Size: 262 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

sozercan/kubectl-ai
✨ Kubectl plugin to create manifests with LLMs
Language: Go - Size: 243 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 1,174 - Forks: 92

hathibelagal-dev/llamashell
A powerful shell that's powered by a locally running LLM (ideally Llama 3.x or Qwen 2.5)
Language: Python - Size: 55.7 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lef-fan/aria
A local and uncensored AI entity.
Language: Python - Size: 7.7 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 61 - Forks: 14

palash-jain-cw/LocalLLMChatbot
This project allows you to run your own local Large Language Model (LLM) chatbot using an API like Ollama.
Language: Python - Size: 313 KB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

knilink/tamperpilot
Copilot hack for running local copilot without auth and proxying
Language: JavaScript - Size: 123 KB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 4 - Forks: 1

av1d/LAISer
Local AI Search assistant web or CLI for ollama and llama.cpp. Lightweight and easy to run, providing a Perplexity-like experience.
Language: Python - Size: 1.36 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1
