An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: localllama

imrightguy/CloudToLocalLLM

Secure Flutter desktop app connecting Auth0 authentication with local Ollama AI models via encrypted tunneling. Access your private AI instances remotely while keeping data on your hardware.

Language: Dart - Size: 1.65 GB - Last synced at: about 16 hours ago - Pushed at: about 17 hours ago - Stars: 15 - Forks: 2

qutoh/LMRL

A narrative/roleplay engine with TCOD levels driven by unreliable narrators - both in the literal and literature sense. Currently hooks into LMStudio and gemini for responses, allowing overrides of tasks to player control.

Language: Python - Size: 792 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

awaescher/OllamaSharp

The easiest way to use Ollama in .NET

Language: C# - Size: 26.7 MB - Last synced at: 3 days ago - Pushed at: 14 days ago - Stars: 1,112 - Forks: 157

kaito-project/aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

Language: Go - Size: 4.91 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 469 - Forks: 46

mostlygeek/llama-swap

Model swapping for llama.cpp (or any local OpenAPI compatible server)

Language: Go - Size: 1.92 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,396 - Forks: 84

SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language: Python - Size: 19.8 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 374 - Forks: 36

poloclub/wordflow

Social and customizable AI writing assistant! ✍️

Language: TypeScript - Size: 20.7 MB - Last synced at: 2 days ago - Pushed at: 2 months ago - Stars: 253 - Forks: 32

michaelsoftmd/zenbot-chrome

LLM-powered live web browser automation from the complete safety of a Podman/Docker container using Smolagents, Zendriver, chrome dev tools and VNC. Features caching as memory!

Language: Python - Size: 1.17 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 0 - Forks: 0

BrutalCoding/aub.ai

AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.

Language: Dart - Size: 119 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 300 - Forks: 27

yankeexe/llm-rag-with-reranker-demo

LLM RAG Application with Cross-Encoders Re-ranking for YouTube video 🎥

Language: Python - Size: 387 KB - Last synced at: 15 days ago - Pushed at: 7 months ago - Stars: 66 - Forks: 30

moritztng/fltr

Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.

Language: Rust - Size: 63.5 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 384 - Forks: 8

michaelsoftmd/ai-pet-project

A set of guides for fully contained, daemonless, secure methods of storing and using LLMs locally on a mounted SSD. Uses Podman, supports AMD with Vulkan, uses llama.cpp, llamafiles, ollama w/ Openhands, Zendriver

Size: 163 KB - Last synced at: 23 days ago - Pushed at: 23 days ago - Stars: 2 - Forks: 0

seyf1elislam/LocalLLM_OneClick_Colab

Run gguf LLM models in Latest Version TextGen-webui and koboldcpp

Language: Jupyter Notebook - Size: 120 KB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 12 - Forks: 0

maifeeulasad/LocalLLaMA

📚 LocalLLaMA Archive — Community-powered static archive for r/LocalLLaMA

Language: TypeScript - Size: 427 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 1

awaescher/OllamaSharpConsole 📦

Full featured demo application for OllamaSharp

Language: C# - Size: 137 KB - Last synced at: 6 days ago - Pushed at: 11 months ago - Stars: 14 - Forks: 4

BrunoArsioli/llama-optimus

Lightweight Python tool using Optuna for tuning llama.cpp flags: towards optimal tok/s for your machine

Language: Python - Size: 2.97 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

Snap-gen/Snapgen

🏗️ Build, fine-tune, and run generative models locally!

Language: Go - Size: 3.56 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 159 - Forks: 42

Belluxx/LlamaTerm

Use your open source local model from the terminal

Language: Python - Size: 7.5 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 8 - Forks: 0

mvdnesss/llm-demo

llm-demo offers a straightforward way to leverage large language models for translation tasks. 🌐 With this toolkit, developers can easily fine-tune and deploy models like Mistral-7B for English and Polish. 🐙

Language: Python - Size: 262 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

sozercan/kubectl-ai

✨ Kubectl plugin to create manifests with LLMs

Language: Go - Size: 243 KB - Last synced at: 3 months ago - Pushed at: 7 months ago - Stars: 1,174 - Forks: 92

hathibelagal-dev/llamashell

A powerful shell that's powered by a locally running LLM (ideally Llama 3.x or Qwen 2.5)

Language: Python - Size: 55.7 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lef-fan/aria

A local and uncensored AI entity.

Language: Python - Size: 7.7 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 61 - Forks: 14

palash-jain-cw/LocalLLMChatbot

This project allows you to run your own local Large Language Model (LLM) chatbot using an API like Ollama.

Language: Python - Size: 313 KB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

knilink/tamperpilot

Copilot hack for running local copilot without auth and proxying

Language: JavaScript - Size: 123 KB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 4 - Forks: 1

av1d/LAISer

Local AI Search assistant web or CLI for ollama and llama.cpp. Lightweight and easy to run, providing a Perplexity-like experience.

Language: Python - Size: 1.36 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 1