An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: localllm

n4ze3m/page-assist

Use your locally running AI models to assist you in your web browsing

Language: TypeScript - Size: 7.18 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 6,657 - Forks: 599

lofcz/LlmTornado

The .NET library to consume 100+ APIs: OpenAI, Anthropic, Google, DeepSeek, Cohere, Mistral, Azure, xAI, Perplexity, Groq, Voyage, DeepInfra, Ollama, vLLM, and many more!

Language: C# - Size: 32.7 MB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 171 - Forks: 22

perk11/large-model-proxy

Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on different ports and loading/unloading them on demand

Language: Go - Size: 197 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 65 - Forks: 4

BodhiSearch/BodhiApp

Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs

Language: Rust - Size: 183 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 99 - Forks: 9

3-ark/Cognito-AI_Sidekick

Cognito: Supercharge your Chrome browser with AI. Guide, query, and control everything using natural language.

Language: TypeScript - Size: 153 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 46 - Forks: 3

Hayashi-Yudai/aichat

A customizable AI chat application powered by Flet.

Language: Python - Size: 1.98 MB - Last synced at: 4 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 1

tegridydev/dnd-llm-game

MVP of an idea using multiple local LLM models to simulate and play D&D

Language: Python - Size: 215 KB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 71 - Forks: 6

SqueezeAILab/SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

Language: Python - Size: 1.5 MB - Last synced at: 8 days ago - Pushed at: 10 months ago - Stars: 689 - Forks: 45

SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language: Python - Size: 19.8 MB - Last synced at: 8 days ago - Pushed at: 10 months ago - Stars: 355 - Forks: 31

mostlygeek/llama-swap

Model swapping for llama.cpp (or any local OpenAPI compatible server)

Language: Go - Size: 952 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 816 - Forks: 42

aruntemme/go-rag

Advanced RAG System with Go featuring intelligent adaptive chunking, hierarchical document processing, semantic search, and flexible LLM integration

Language: Go - Size: 4.31 MB - Last synced at: 7 days ago - Pushed at: 16 days ago - Stars: 1 - Forks: 0

codeasarjun/chatwithyourpdf

This repo will help to understand how you can use LLM to chat with your given pdf or pdfs

Language: Jupyter Notebook - Size: 9.77 KB - Last synced at: 14 days ago - Pushed at: 14 days ago - Stars: 0 - Forks: 0

twinnydotdev/symmetry-cli

The client for the Symmetry peer-to-peer inference network. Enabling users to connect with each other, share computational resources, and collect valuable machine learning data.

Language: JavaScript - Size: 1.17 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 26 - Forks: 4

mirpo/datamatic

Generate synthetic datasets using local LLMs via Ollama and LMstudio with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other major language models.

Language: Go - Size: 89.8 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

lebrunel/ollama-ex

A nifty little library for working with Ollama in Elixir.

Language: Elixir - Size: 122 KB - Last synced at: 17 days ago - Pushed at: 5 months ago - Stars: 115 - Forks: 7

KwaiKEG/KwaiAgents

A generalized information-seeking agent system with Large Language Models (LLMs).

Language: Python - Size: 7.65 MB - Last synced at: 18 days ago - Pushed at: 12 months ago - Stars: 1,160 - Forks: 114

sauravpanda/BrowserAI

Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser

Language: TypeScript - Size: 293 MB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 1,098 - Forks: 95

UtkarshTheDev/LocalLab

LocalLab allows you to easily run Hugging Face AI models locally or on Google Colab, featuring automatic API setup, model management, performance optimization, and system monitoring.

Language: Python - Size: 577 KB - Last synced at: 4 days ago - Pushed at: 21 days ago - Stars: 5 - Forks: 0

Alfer-Star/document-ai-workshop

A german workshop where you learn how to build RAGs with Langchain

Language: Python - Size: 8.44 MB - Last synced at: 21 days ago - Pushed at: 21 days ago - Stars: 5 - Forks: 0

Wakoma/OfflineAI

Local/Offline Machine Learning Resources

Size: 104 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 8 - Forks: 1

PromptEngineer48/MemGPT-AutoGEN-LLM

Run MemGPT-AutoGEN-Local LLM Together

Language: Python - Size: 6.84 KB - Last synced at: 22 days ago - Pushed at: over 1 year ago - Stars: 303 - Forks: 87

arvindjuneja/OwnAI

Local LLM (using Ollama) interface for MacOS

Language: Swift - Size: 29.3 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

unusual9guy/pizzeria-review-agent

Pizzeria AI Agent - An intelligent assistant that answers questions about a pizzeria based on reviews. Built with LangChain and Ollama, this project demonstrates how to create a simple AI agent using vector search to retrieve relevant information from restaurant reviews.

Language: Python - Size: 0 Bytes - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

seyf1elislam/LocalLLM_OneClick_Colab

Run gguf LLM models in Latest Version TextGen-webui

Language: Jupyter Notebook - Size: 102 KB - Last synced at: 28 days ago - Pushed at: 8 months ago - Stars: 10 - Forks: 0

WilliamKarolDiCioccio/open_local_ui

OpenLocalUI: Native desktop app for Windows, MacOS and Linux. Easily run Large Language Models locally, no complex setups required. Inspired by OpenWebUI's simplicity for LLM use.

Language: Dart - Size: 4.97 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 59 - Forks: 3

MDGrey33/pyvisionai

The PyVisionAI Official Repo

Language: Python - Size: 9.93 MB - Last synced at: 16 days ago - Pushed at: 3 months ago - Stars: 102 - Forks: 11

sujithhubpost/initialterm

Local LLM enabled Human terminal interaction made easy.

Language: Python - Size: 15.6 KB - Last synced at: 26 days ago - Pushed at: 5 months ago - Stars: 13 - Forks: 4

nayan359/assistive-ai

Zero-shot object detection system for visually impaired users using CLIP, OWL-ViT, and real-time audio feedback.

Language: JavaScript - Size: 3.07 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

smaranjitghose/SightGuardAI

Capitalizing moondream's capabilities to build a CCTV frame-on-framer analyzer

Language: Python - Size: 1.24 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

hathibelagal-dev/llamashell

A powerful shell that's powered by a locally running LLM (ideally Llama 3.x or Qwen 2.5)

Language: Python - Size: 55.7 KB - Last synced at: 3 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

yeeking/llamacpp-minimal-example

Minimal example of using llama cpp as library from cpp

Language: C++ - Size: 198 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

Yuvraj960/LLM-ChatBot

Generates AI-based responses with help of LocalLLM running on Ollama.

Language: Python - Size: 21.5 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

pahulgogna/localGPT

An ollama interface which provides models with MCPs

Language: TypeScript - Size: 106 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

mohitkumarrajbadi/ifusionone

iFusionOne the one tool you need

Language: TypeScript - Size: 2.2 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

arjunprabhulal/adk-gemma3-function-calling

ADK Gemma3 Function Calling Example

Language: Python - Size: 28.2 MB - Last synced at: 20 days ago - Pushed at: about 2 months ago - Stars: 0 - Forks: 0

promptmesh/InferAdmin

A lightweight management interface for local LLM infrastructure.

Language: Python - Size: 791 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

undici77/DoxyPatch

Doxygen 🚀 AI POWERED Generator

Language: C# - Size: 129 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

docspedia/docspedia

Chat with your pdf using your local LLM, OLLAMA client.(incomplete)

Language: TypeScript - Size: 3.12 MB - Last synced at: about 2 months ago - Pushed at: 8 months ago - Stars: 37 - Forks: 1

hariharen9/localseek

LocalSeek 🤖💬 LocalSeek is a powerful, privacy-first AI chat extension for Visual Studio Code that brings conversational AI directly to your development environment - completely locally.

Language: TypeScript - Size: 631 KB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

hathibelagal-dev/LocalLLMHub

Chat with local Llama, Qwen, and Gemma models

Language: HTML - Size: 43 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

dwain-barnes/fastrtc-job-interview-simulator

A FastRTC-powered job interview simulator with real-time voice interaction. Practice with an AI interviewer that adapts to your job description and provides personalised feedback. Customise difficulty levels, practice in a risk-free environment, and improve your interview skills before the real thing.

Language: HTML - Size: 110 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

joshua2705/Ollama-Policy-Reader-Extension

An AI charged chrome extension to read those pesky privacy policies and save you from accidentally agreeing to selling your soul

Language: TypeScript - Size: 71.3 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

aronweiler/DocTalk

This started out as a POC for chatting over my documents, but has turned into a whole framework for using LLMs.

Language: Python - Size: 10.1 MB - Last synced at: 8 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 0

mskry/dotfiles

Alacritty + Fish + Zellij + Starship + Neovim + i3 + Supermaven + Ollama 🦙 = 🚀

Language: Shell - Size: 589 KB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 9 - Forks: 1

Darthph0enix7/DocPOI_repo

A local chatbot for managing docs

Language: Python - Size: 5.59 MB - Last synced at: 6 days ago - Pushed at: 4 months ago - Stars: 23 - Forks: 0

AK3847/sumsum

A minimal CLI tool to locally summarize any text using LLM!

Language: Python - Size: 29.3 KB - Last synced at: 16 days ago - Pushed at: 5 months ago - Stars: 2 - Forks: 1

Priyansusahoo/ollama-webUI

Streamlined Ollama WebUI Setup: Automated Scripts, LLM Integration, and Desktop Shortcut Creation

Language: Shell - Size: 46.9 KB - Last synced at: about 2 months ago - Pushed at: 4 months ago - Stars: 2 - Forks: 0

tegridydev/multi-agent-secops-llm

This project is a multi-agent security framework that utilizes multiple LLM models to analyze and generate comprehensive security briefs.

Language: Python - Size: 10.7 KB - Last synced at: 2 months ago - Pushed at: about 1 year ago - Stars: 6 - Forks: 0

neodyland/entropix

Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral

Language: Python - Size: 76.2 KB - Last synced at: about 1 month ago - Pushed at: 5 months ago - Stars: 17 - Forks: 1

smaranjitghose/LunarSightAI

Unleashing the power of local vlms with moondream and streamlit

Language: Python - Size: 471 KB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 0

ngruychev/finite-craft

A clone of InfiniteCraft (AI!!! LLMs!!) you can run on a laptop _without_ a good GPU!!

Language: Python - Size: 22.5 KB - Last synced at: 3 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

av1d/LAISer

Local AI Search assistant web or CLI for ollama and llama.cpp. Lightweight and easy to run, providing a Perplexity-like experience.

Language: Python - Size: 1.36 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 1

ivkos/jan-models-bggpt

BgGPT for Jan 👋

Language: Python - Size: 5.86 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

10Nates/Humanlike-AI-Chat

Humanlike AI Chat is a terminal-based LLM UI designed to study how to bypass AI text detection.

Language: Python - Size: 373 KB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 1

tristan-mcinnis/Ollama-Web-Summarization

This repository contains a Python-based tool for summarizing web content using the Ollama API. It scrapes articles from URLs, cleans and processes the HTML content, and generates summaries using a pre-trained language model. The repository also includes a rich-based logging utility for improved console output.

Language: Python - Size: 17.6 KB - Last synced at: 23 days ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

cperazza/RFM_Segmentation

This is a basic workflow with CrewAI agents working with sales transactions to draw business insights and marketing recommendations. The agents will work on everything from the execution plan to the business insights report. It works with local LLM via Ollama (I'm using llama3:8B but you can easily change it).

Language: Python - Size: 1.89 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

josepharielct/LocalRAG

This projects build a local retrieval augmented generation (pipeline) from scratch, connects it to a local llm, and is deployed as a chatbot via Gradio.

Language: Jupyter Notebook - Size: 107 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

kaminoer/ScrAIbe-Assistant Fork of AndreDalwin/Whisper2Summarize

ScrAIbe Assistant is designed to leverage Whisper for precise audio processing and local LLMs via Ollama for efficient summarization. This tool is perfect for tasks such as taking notes from team meetings or lectures, offering a secure environment where no data—be it text, audio, or otherwise—leaves your local machine.

Language: Python - Size: 1.36 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

gustavostz/Local-AI-Open-Orca-For-Dummies

Local AI Open Orca For Dummies is a user-friendly guide to running Large Language Models locally. Simplify your AI journey with easy-to-follow instructions and minimal setup. Perfect for developers tired of complex processes!

Language: Python - Size: 698 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

village0/slack_bot

Slack bot that integrates local LLM into your workflows

Language: Python - Size: 48.8 KB - Last synced at: 20 days ago - Pushed at: 20 days ago - Stars: 1 - Forks: 2

comaier/comor

Local, customizable, open-sourced role-play app.

Size: 17.6 KB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

JaySandoz/Auto-GPT Fork of Significant-Gravitas/Auto-GPT

Tiny Starcoder LLM Implementation, added to commands

Language: Python - Size: 3.67 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0