GitHub topics: localllm
n4ze3m/page-assist
Use your locally running AI models to assist you in your web browsing
Language: TypeScript - Size: 7.18 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 6,657 - Forks: 599

lofcz/LlmTornado
The .NET library to consume 100+ APIs: OpenAI, Anthropic, Google, DeepSeek, Cohere, Mistral, Azure, xAI, Perplexity, Groq, Voyage, DeepInfra, Ollama, vLLM, and many more!
Language: C# - Size: 32.7 MB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 171 - Forks: 22

perk11/large-model-proxy
Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on different ports and loading/unloading them on demand
Language: Go - Size: 197 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 65 - Forks: 4

BodhiSearch/BodhiApp
Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs
Language: Rust - Size: 183 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 99 - Forks: 9

3-ark/Cognito-AI_Sidekick
Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language.
Language: TypeScript - Size: 153 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 46 - Forks: 3

Hayashi-Yudai/aichat
A customizable AI chat application powered by Flet.
Language: Python - Size: 1.98 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 1

tegridydev/dnd-llm-game
MVP of an idea using multiple local LLM models to simulate and play D&D
Language: Python - Size: 215 KB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 71 - Forks: 6

SqueezeAILab/SqueezeLLM
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
Language: Python - Size: 1.5 MB - Last synced at: 8 days ago - Pushed at: 10 months ago - Stars: 689 - Forks: 45

SqueezeAILab/KVQuant
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Language: Python - Size: 19.8 MB - Last synced at: 8 days ago - Pushed at: 10 months ago - Stars: 355 - Forks: 31

mostlygeek/llama-swap
Model swapping for llama.cpp (or any local OpenAPI compatible server)
Language: Go - Size: 952 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 816 - Forks: 42

aruntemme/go-rag
Advanced RAG System with Go featuring intelligent adaptive chunking, hierarchical document processing, semantic search, and flexible LLM integration
Language: Go - Size: 4.31 MB - Last synced at: 7 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

codeasarjun/chatwithyourpdf
This repo will help to understand how you can use LLM to chat with your given pdf or pdfs
Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

twinnydotdev/symmetry-cli
The client for the Symmetry peer-to-peer inference network. Enabling users to connect with each other, share computational resources, and collect valuable machine learning data.
Language: JavaScript - Size: 1.17 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 26 - Forks: 4

mirpo/datamatic
Generate synthetic datasets using local LLMs via Ollama and LMstudio with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other major language models.
Language: Go - Size: 89.8 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

lebrunel/ollama-ex
A nifty little library for working with Ollama in Elixir.
Language: Elixir - Size: 122 KB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 115 - Forks: 7

KwaiKEG/KwaiAgents
A generalized information-seeking agent system with Large Language Models (LLMs).
Language: Python - Size: 7.65 MB - Last synced at: 18 days ago - Pushed at: 12 months ago - Stars: 1,160 - Forks: 114

sauravpanda/BrowserAI
Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser
Language: TypeScript - Size: 293 MB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 1,098 - Forks: 95

UtkarshTheDev/LocalLab
LocalLab allows you to easily run Hugging Face AI models locally or on Google Colab, featuring automatic API setup, model management, performance optimization, and system monitoring.
Language: Python - Size: 577 KB - Last synced at: 4 days ago - Pushed at: 21 days ago - Stars: 5 - Forks: 0

Alfer-Star/document-ai-workshop
A german workshop where you learn how to build RAGs with Langchain
Language: Python - Size: 8.44 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 5 - Forks: 0

Wakoma/OfflineAI
Local/Offline Machine Learning Resources
Size: 104 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 8 - Forks: 1

PromptEngineer48/MemGPT-AutoGEN-LLM
Run MemGPT-AutoGEN-Local LLM Together
Language: Python - Size: 6.84 KB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 303 - Forks: 87

arvindjuneja/OwnAI
Local LLM (using Ollama) interface for MacOS
Language: Swift - Size: 29.3 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

unusual9guy/pizzeria-review-agent
Pizzeria AI Agent - An intelligent assistant that answers questions about a pizzeria based on reviews. Built with LangChain and Ollama, this project demonstrates how to create a simple AI agent using vector search to retrieve relevant information from restaurant reviews.
Language: Python - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

seyf1elislam/LocalLLM_OneClick_Colab
Run gguf LLM models in Latest Version TextGen-webui
Language: Jupyter Notebook - Size: 102 KB - Last synced at: 28 days ago - Pushed at: 8 months ago - Stars: 10 - Forks: 0

WilliamKarolDiCioccio/open_local_ui
OpenLocalUI: Native desktop app for Windows, MacOS and Linux. Easily run Large Language Models locally, no complex setups required. Inspired by OpenWebUI's simplicity for LLM use.
Language: Dart - Size: 4.97 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 59 - Forks: 3

MDGrey33/pyvisionai
The PyVisionAI Official Repo
Language: Python - Size: 9.93 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 102 - Forks: 11

sujithhubpost/initialterm
Local LLM enabled Human terminal interaction made easy.
Language: Python - Size: 15.6 KB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 13 - Forks: 4

nayan359/assistive-ai
Zero-shot object detection system for visually impaired users using CLIP, OWL-ViT, and real-time audio feedback.
Language: JavaScript - Size: 3.07 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

smaranjitghose/SightGuardAI
Capitalizing moondream's capabilities to build a CCTV frame-on-framer analyzer
Language: Python - Size: 1.24 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

hathibelagal-dev/llamashell
A powerful shell that's powered by a locally running LLM (ideally Llama 3.x or Qwen 2.5)
Language: Python - Size: 55.7 KB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

yeeking/llamacpp-minimal-example
Minimal example of using llama cpp as library from cpp
Language: C++ - Size: 198 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Yuvraj960/LLM-ChatBot
Generates AI-based responses with help of LocalLLM running on Ollama.
Language: Python - Size: 21.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

pahulgogna/localGPT
An ollama interface which provides models with MCPs
Language: TypeScript - Size: 106 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

mohitkumarrajbadi/ifusionone
iFusionOne the one tool you need
Language: TypeScript - Size: 2.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

arjunprabhulal/adk-gemma3-function-calling
ADK Gemma3 Function Calling Example
Language: Python - Size: 28.2 MB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

promptmesh/InferAdmin
A lightweight management interface for local LLM infrastructure.
Language: Python - Size: 791 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

undici77/DoxyPatch
Doxygen 🚀 AI POWERED Generator
Language: C# - Size: 129 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

docspedia/docspedia
Chat with your pdf using your local LLM, OLLAMA client.(incomplete)
Language: TypeScript - Size: 3.12 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 37 - Forks: 1

hariharen9/localseek
LocalSeek 🤖💬 LocalSeek is a powerful, privacy-first AI chat extension for Visual Studio Code that brings conversational AI directly to your development environment - completely locally.
Language: TypeScript - Size: 631 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

hathibelagal-dev/LocalLLMHub
Chat with local Llama, Qwen, and Gemma models
Language: HTML - Size: 43 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

dwain-barnes/fastrtc-job-interview-simulator
A FastRTC-powered job interview simulator with real-time voice interaction. Practice with an AI interviewer that adapts to your job description and provides personalised feedback. Customise difficulty levels, practice in a risk-free environment, and improve your interview skills before the real thing.
Language: HTML - Size: 110 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

joshua2705/Ollama-Policy-Reader-Extension
An AI charged chrome extension to read those pesky privacy policies and save you from accidentally agreeing to selling your soul
Language: TypeScript - Size: 71.3 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

aronweiler/DocTalk
This started out as a POC for chatting over my documents, but has turned into a whole framework for using LLMs.
Language: Python - Size: 10.1 MB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

mskry/dotfiles
Alacritty + Fish + Zellij + Starship + Neovim + i3 + Supermaven + Ollama 🦙 = 🚀
Language: Shell - Size: 589 KB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 9 - Forks: 1

Darthph0enix7/DocPOI_repo
A local chatbot for managing docs
Language: Python - Size: 5.59 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 23 - Forks: 0

AK3847/sumsum
A minimal CLI tool to locally summarize any text using LLM!
Language: Python - Size: 29.3 KB - Last synced at: 16 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 1

Priyansusahoo/ollama-webUI
Streamlined Ollama WebUI Setup: Automated Scripts, LLM Integration, and Desktop Shortcut Creation
Language: Shell - Size: 46.9 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

tegridydev/multi-agent-secops-llm
This project is a multi-agent security framework that utilizes multiple LLM models to analyze and generate comprehensive security briefs.
Language: Python - Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

neodyland/entropix
Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral
Language: Python - Size: 76.2 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 17 - Forks: 1

smaranjitghose/LunarSightAI
Unleashing the power of local vlms with moondream and streamlit
Language: Python - Size: 471 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

ngruychev/finite-craft
A clone of InfiniteCraft (AI!!! LLMs!!) you can run on a laptop _without_ a good GPU!!
Language: Python - Size: 22.5 KB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

av1d/LAISer
Local AI Search assistant web or CLI for ollama and llama.cpp. Lightweight and easy to run, providing a Perplexity-like experience.
Language: Python - Size: 1.36 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 1

ivkos/jan-models-bggpt
BgGPT for Jan 👋
Language: Python - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

10Nates/Humanlike-AI-Chat
Humanlike AI Chat is a terminal-based LLM UI designed to study how to bypass AI text detection.
Language: Python - Size: 373 KB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 1

tristan-mcinnis/Ollama-Web-Summarization
This repository contains a Python-based tool for summarizing web content using the Ollama API. It scrapes articles from URLs, cleans and processes the HTML content, and generates summaries using a pre-trained language model. The repository also includes a rich-based logging utility for improved console output.
Language: Python - Size: 17.6 KB - Last synced at: 23 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

cperazza/RFM_Segmentation
This is a basic workflow with CrewAI agents working with sales transactions to draw business insights and marketing recommendations. The agents will work on everything from the execution plan to the business insights report. It works with local LLM via Ollama (I'm using llama3:8B but you can easily change it).
Language: Python - Size: 1.89 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

josepharielct/LocalRAG
This projects build a local retrieval augmented generation (pipeline) from scratch, connects it to a local llm, and is deployed as a chatbot via Gradio.
Language: Jupyter Notebook - Size: 107 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

kaminoer/ScrAIbe-Assistant Fork of AndreDalwin/Whisper2Summarize
ScrAIbe Assistant is designed to leverage Whisper for precise audio processing and local LLMs via Ollama for efficient summarization. This tool is perfect for tasks such as taking notes from team meetings or lectures, offering a secure environment where no data—be it text, audio, or otherwise—leaves your local machine.
Language: Python - Size: 1.36 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

gustavostz/Local-AI-Open-Orca-For-Dummies
Local AI Open Orca For Dummies is a user-friendly guide to running Large Language Models locally. Simplify your AI journey with easy-to-follow instructions and minimal setup. Perfect for developers tired of complex processes!
Language: Python - Size: 698 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

village0/slack_bot
Slack bot that integrates local LLM into your workflows
Language: Python - Size: 48.8 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 2

comaier/comor
Local, customizable, open-sourced role-play app.
Size: 17.6 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

JaySandoz/Auto-GPT Fork of Significant-Gravitas/Auto-GPT
Tiny Starcoder LLM Implementation, added to commands
Language: Python - Size: 3.67 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0
