GitHub topics: llamacpp
huggingface/llm-ls
LSP server leveraging LLMs for code completion (and more?)
Language: Rust - Size: 343 KB - Last synced at: about 6 hours ago - Pushed at: 7 months ago - Stars: 761 - Forks: 61

containers/ramalama
The goal of RamaLama is to make working with AI boring.
Language: Python - Size: 2.37 MB - Last synced at: about 12 hours ago - Pushed at: about 13 hours ago - Stars: 1,538 - Forks: 162

mirpo/fastapi-gen
Build LLM-enabled FastAPI applications without build configuration.
Language: Python - Size: 336 KB - Last synced at: about 14 hours ago - Pushed at: about 15 hours ago - Stars: 8 - Forks: 1

Genta-Technology/Kolosal
Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.
Language: C++ - Size: 67.2 MB - Last synced at: about 12 hours ago - Pushed at: about 13 hours ago - Stars: 194 - Forks: 14

menloresearch/jan
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
Language: TypeScript - Size: 1.03 GB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 28,522 - Forks: 1,685

khoj-ai/khoj
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.
Language: Python - Size: 109 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 28,742 - Forks: 1,608

awaescher/OllamaSharp
The easiest way to use the Ollama API in .NET
Language: C# - Size: 26.6 MB - Last synced at: about 23 hours ago - Pushed at: 10 days ago - Stars: 935 - Forks: 125

LostRuins/koboldcpp Fork of ggml-org/llama.cpp
Run GGUF models easily with a KoboldAI UI. One File. Zero Install.
Language: C++ - Size: 238 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 7,099 - Forks: 454

serge-chat/serge
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.
Language: Svelte - Size: 3.34 MB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 5,715 - Forks: 403

innightwolfsleep/llm_telegram_bot
LLM telegram bot
Language: Python - Size: 1.18 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 119 - Forks: 24

vercel/modelfusion
The TypeScript library for building AI applications.
Language: TypeScript - Size: 15.6 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 1,253 - Forks: 89

Fuzzy-Search/realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see
Language: Python - Size: 2.84 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 383 - Forks: 43

getumbrel/llama-gpt
A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
Language: TypeScript - Size: 1.71 MB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 10,959 - Forks: 713

SilasMarvin/lsp-ai
LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.
Language: Rust - Size: 1.61 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 2,695 - Forks: 94

CommanderLake/LMStud
Chat with GGUF LLMs using llama.cpp and a classic Windows Forms interface for minimal GUI bloat.
Language: C# - Size: 557 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

menloresearch/cortex.cpp
Local AI API Platform
Language: C++ - Size: 139 MB - Last synced at: 1 day ago - Pushed at: 7 days ago - Stars: 2,622 - Forks: 163

Josh-XT/AGiXT
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
Language: Python - Size: 168 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 2,974 - Forks: 398

vinhnx/VT.ai
VT.ai - Minimal multimodal AI chat app with dynamic conversation routing
Language: Python - Size: 2.43 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 72 - Forks: 9

mostlygeek/llama-swap
Model swapping for llama.cpp (or any local OpenAPI compatible server)
Language: Go - Size: 552 KB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 537 - Forks: 31

1b5d/llm-api
Run any Large Language Model behind a unified API
Language: Python - Size: 53.7 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 169 - Forks: 27

yeeking/llamacpp-minimal-example
Minimal example of using llama cpp as library from cpp
Language: C++ - Size: 198 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

AmpereComputingAI/llama.cpp
Ampere optimized llama.cpp
Language: Python - Size: 143 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 12 - Forks: 1

kelindar/search
Go library for embedded vector search and semantic embeddings using llama.cpp
Language: Go - Size: 714 KB - Last synced at: about 11 hours ago - Pushed at: about 1 month ago - Stars: 430 - Forks: 13

alexrozanski/LlamaChat
Chat with your favourite LLaMA models in a native macOS app
Language: Swift - Size: 14.6 MB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 1,501 - Forks: 62

Mobile-Artificial-Intelligence/maid
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.
Language: Dart - Size: 114 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,973 - Forks: 212

karanIPS/claude-deep-research
Claude Deep Research config for Claude Code.
Size: 10.7 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 0

rish-1997/wsl-assistant
A fully auto configured, self-hosted local AI & database stack on Debian WSL2.
Language: Shell - Size: 123 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

SciSharp/LLamaSharp
A C#/.NET library to run LLM (π¦LLaMA/LLaVA) on your local device efficiently.
Language: C# - Size: 391 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,120 - Forks: 414

l29ah/llama-cpp-haskell
Haskell bindings for the llama.cpp llama-server
Language: Haskell - Size: 23.4 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 1

twinnydotdev/twinny
The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but 100% free.
Language: TypeScript - Size: 60.7 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 3,464 - Forks: 192

OEvortex/Webscout
Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and Phind; access cutting-edge AI models; transcribe YouTube videos; generate temporary emails and phone numbers; perform text-to-speech conversions; and much more!
Language: Python - Size: 8.56 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 219 - Forks: 41

floneum/floneum
Instant, controllable, local pre-trained AI models in Rust
Language: Rust - Size: 257 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,833 - Forks: 96

nathanlesage/local-chat
LocalChat is a ChatGPT-like chat that runs on your computer
Language: TypeScript - Size: 2.69 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 103 - Forks: 4

morpheuslord/HackBot
AI-powered cybersecurity chatbot designed to provide helpful and accurate answers to your cybersecurity-related queries and also do code analysis and scan analysis.
Language: Python - Size: 56.6 KB - Last synced at: 1 day ago - Pushed at: 5 months ago - Stars: 300 - Forks: 47

xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Language: Python - Size: 44.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7,500 - Forks: 636

mrdbourke/mac-ml-speed-test
A few quick scripts focused on testing TensorFlow/PyTorch/Llama 2 on macOS.
Language: Jupyter Notebook - Size: 1.51 MB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 190 - Forks: 33

Mobile-Artificial-Intelligence/llama_sdk
lcpp is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)
Language: Dart - Size: 1.63 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 74 - Forks: 18

ddh0/easy-llama
Text generation in Python, as easy as possible
Language: Python - Size: 1.29 MB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 58 - Forks: 3

habib-source/pdf-rag
PDF chatbot question and answer
Language: Python - Size: 438 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

poteminr/instruct-ner
Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)
Language: Python - Size: 297 KB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 82 - Forks: 8

Nexesenex/croco.cpp Fork of LostRuins/koboldcpp
Croco.Cpp is a 3rd party testground for KoboldCPP, a simple one-file way to run various GGML/GGUF models with KoboldAI's UI. (for Croco.Cpp, in Cuda mode mainly!)
Language: C++ - Size: 272 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 103 - Forks: 3

gptme/gptme
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.
Language: Python - Size: 14.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 3,724 - Forks: 301

J4NN0/llm-rag
LLMs prompt augmentation with RAG by integrating external custom data from a variety of sources, allowing chat with such documents
Language: Python - Size: 126 KB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 19 - Forks: 5

inferflow/inferflow
Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).
Language: C++ - Size: 1.89 MB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 242 - Forks: 25

mgonzs13/llama_ros
llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2
Language: C++ - Size: 6.95 MB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 196 - Forks: 30

michaelgiba/the-traitors
Simulation of the TV show The Traitors with Open Source LLMs
Language: HTML - Size: 3.71 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 0

joone/loz
Loz is a command-line tool that enables your preferred LLM to execute system commands and utilize Unix pipes, integrating AI capabilities with other Unix tools.
Language: TypeScript - Size: 1.51 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 260 - Forks: 15

asiff00/Orpheus-Local-TTS
Run Orpheus TTS locally.
Language: Python - Size: 6.84 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

Adriankhl/godot-llm-template
Godot LLM Template/Demo
Language: GDScript - Size: 2.79 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 25 - Forks: 2

331Dala/llm_local
Run LLM finance apps hyper fast on local machine.
Language: Python - Size: 7.81 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

InftyAI/llmaz
βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!
Language: Go - Size: 6.1 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 120 - Forks: 19

if-ai/ComfyUI-IF_AI_tools
ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.
Language: Python - Size: 18 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 623 - Forks: 48

reorproject/reor
Private & local AI personal knowledge management app for high entropy people.
Language: JavaScript - Size: 92.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7,827 - Forks: 464

samestrin/llm-interface
A simple NPM interface for seamlessly interacting with 36 Large Language Model (LLM) providers, including OpenAI, Anthropic, Google Gemini, Cohere, Hugging Face Inference, NVIDIA AI, Mistral AI, AI21 Studio, LLaMA.CPP, and Ollama, and hundreds of models.
Language: JavaScript - Size: 1.1 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 103 - Forks: 14

distantmagic/paddler
Stateful load balancer custom-tailored for llama.cpp ππ¦
Language: Rust - Size: 4.47 MB - Last synced at: 7 days ago - Pushed at: 17 days ago - Stars: 737 - Forks: 30

JohnSnowLabs/spark-nlp
State of the Art Natural Language Processing
Language: Scala - Size: 3.36 GB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 3,951 - Forks: 722

shubham0204/SmolChat-Android
Running any GGUF SLMs/LLMs locally, on-device in Android
Language: Kotlin - Size: 13.3 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 251 - Forks: 29

pythops/tenere
π€ TUI interface for LLMs written in Rust
Language: Rust - Size: 642 KB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 505 - Forks: 22

RahulSChand/gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
Language: JavaScript - Size: 1.56 MB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 1,284 - Forks: 68

andrewkchan/yalm
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O
Language: C++ - Size: 396 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 282 - Forks: 29

mukel/llama3.java
Practical Llama 3 inference in Java
Language: Java - Size: 187 KB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 686 - Forks: 83

rajatasusual/llamabox
Run a full local RAG pipeline on your low-end, CPU-only Windows machine using wsl2 with Debian. Private, resilient, secure.
Language: Python - Size: 1.38 MB - Last synced at: 5 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

ngxson/wllama
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
Language: TypeScript - Size: 26.9 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 653 - Forks: 36

rpidanny/llm-prompt-templates
Empower your LLM to do more than you ever thought possible with these state-of-the-art prompt templates.
Language: TypeScript - Size: 641 KB - Last synced at: 1 day ago - Pushed at: 7 months ago - Stars: 41 - Forks: 2

AstraBert/qdurllm
Search your favorite websites and chat with them, on your desktopπ
Language: Python - Size: 1.06 MB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 30 - Forks: 2

Abhi5h3k/PrivateDocBot
π Local PDF-Integrated Chat Bot: Secure Conversations and Document Assistance with LLM-Powered Privacy
Language: Python - Size: 2.39 MB - Last synced at: 1 day ago - Pushed at: 27 days ago - Stars: 83 - Forks: 19

nekomeowww/ollama-operator
π’ Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«
Language: Go - Size: 1.82 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 184 - Forks: 22

tinyBigGAMES/AIToolkit
AIToolkit - AI Construction Set
Language: Pascal - Size: 6.67 MB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 14 - Forks: 2

jeromeboivin/ollama-chat
A single file, customizable Python CLI tool for interacting with local Language Models, ensuring data privacy while providing conversation memory and extensibility through plugins and efficient Retrieval-Augmented Generation capabilities with ChromaDB integration. Also compatible with OpenAI API.
Language: Python - Size: 793 KB - Last synced at: 1 day ago - Pushed at: 11 days ago - Stars: 12 - Forks: 3

BrutalCoding/aub.ai
AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.
Language: Dart - Size: 119 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 279 - Forks: 24

Adriankhl/godot-llm
LLM in Godot
Language: C++ - Size: 175 MB - Last synced at: 9 days ago - Pushed at: 10 months ago - Stars: 176 - Forks: 11

llmware-ai/llmware
Unified framework for building enterprise RAG pipelines with small, specialized models
Language: Python - Size: 968 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 12,689 - Forks: 1,818

saadkh1/DocQA-TextSummarization-App
A Streamlit app for document question answering and text summarization.
Language: Jupyter Notebook - Size: 190 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 3

yportne13/chatbot-ui-llama.cpp
A static web ui for llama.cpp server. The llama.cpp chat interface for everyone. base on chatbot-ui
Language: TypeScript - Size: 2.05 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 16 - Forks: 4

cycneuramus/signal-aichat π¦
An AI chatbot for Signal powered by Google Bard, Bing Chat, ChatGPT, HuggingChat, and llama.cpp
Language: Python - Size: 275 KB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 87 - Forks: 17

CentralFloridaAttorney/zmongo_retriever
Use data from MongoDB in LangChain, Llama and OpenAI
Language: Python - Size: 27.2 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 4 - Forks: 1

EZForever/llama.cpp-static
Static builds of llama.cpp (Currently only amd64 server builds are available)
Language: Dockerfile - Size: 50.8 KB - Last synced at: 1 day ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

matrixsmaster/ANNA
Automatic Neural Network Assistant (ANNA) is a very powerful cross-platform AI toolkit. It contains versatile CLI/GUI LLM tools optimized for local inference, and supports optional remote offloading.
Language: C - Size: 1.93 MB - Last synced at: 1 day ago - Pushed at: 12 days ago - Stars: 6 - Forks: 0

Freed-Wu/translate-shell
Translate text by google, bing, youdaozhiyun, haici, stardict, openai, large language model of local machine, etc at same time from CLI, GUI (GNU/Linux, Android, macOS and Windows), REPL, python, shell and vim.
Language: Python - Size: 452 KB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 39 - Forks: 5

xorbitsai/xllamacpp Fork of shakfu/cyllama
xllamacpp - a Python wrapper of llama.cpp
Language: C++ - Size: 4.14 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 31 - Forks: 4

innightwolfsleep/old_llm_telegram_bot
Connect llama-cpp, transformers or text-generation-webui to telegram bot api.
Language: Python - Size: 1.03 MB - Last synced at: 8 days ago - Pushed at: 7 months ago - Stars: 28 - Forks: 10

argonne-lcf/LLM-Inference-Bench
LLM-Inference-Bench
Language: Jupyter Notebook - Size: 11.2 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 39 - Forks: 4

Opla/opla
Empower Your Productivity with Local AI Assistants
Language: TypeScript - Size: 59.5 MB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 36 - Forks: 3

AgustinAllamanoCosta/Pulpero
Plugin for Neovim to explain code using a local AI
Language: Lua - Size: 979 KB - Last synced at: 7 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 0

xNul/code-llama-for-vscode
Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.
Language: Python - Size: 10.7 KB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 568 - Forks: 32

sorohere/Hand-Pose-Detection
This project offers a versatile platform for hand-related tasks, including dataset generation and custom hand gesture detection using Google's MediaPipe library and accelerated real-time sign language translation with LLMs on edge devices.
Language: Python - Size: 821 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 13 - Forks: 2

shinomakoi/AI-Messenger
A QT GUI for large language models
Language: Python - Size: 231 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 32 - Forks: 6

nerve-sparks/iris_android
IRIS is an android app for interfacing with GGUF / llama.cpp models locally.
Language: Kotlin - Size: 9.3 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 192 - Forks: 18

AstraBert/PrAIvateSearch
Own your AI, search the web with itππ
Language: Python - Size: 3.56 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 84 - Forks: 12

jabberjabberjabber/ImageIndexer
Creates an index of images, queries a local LLM and adds tags to the image metadata
Language: Python - Size: 28.1 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 159 - Forks: 7

saddam213/LLamaStack π¦
ASP.NET Core Web, WebApi & WPF implementations for LLama.cpp & LLamaSharp
Language: C# - Size: 8.84 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 58 - Forks: 20

fynnfluegge/codeqai
Local first semantic code search and chat | Leverage custom copilots with fine-tuning datasets from code in Alpaca, Conversational, Completion and Instruction format
Language: Python - Size: 562 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 470 - Forks: 50

Atome-FE/llama-node π¦
Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.
Language: Rust - Size: 30.4 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 870 - Forks: 64

mrtrizer/UnityLlamaCpp
Llama.cpp in Unity, straightforward and clean
Language: C# - Size: 22.5 KB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 18 - Forks: 1

MorganRO8/Lucys_Labyrinth
A game made for a school project, dedicated to my daughter.
Language: C++ - Size: 97.1 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 94 - Forks: 4

Dicklesworthstone/swiss_army_llama
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
Language: Python - Size: 7.25 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 1,014 - Forks: 61

kspviswa/pyOllaMx
Your gateway to both Ollama & Apple MlX models
Language: Python - Size: 5.19 MB - Last synced at: 14 days ago - Pushed at: about 2 months ago - Stars: 115 - Forks: 8

AutonomicPerfectionist/PipeInfer
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation
Language: C++ - Size: 17.5 MB - Last synced at: 14 days ago - Pushed at: 5 months ago - Stars: 29 - Forks: 4

eliranwong/toolmate
ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in conversations, generative work, and task execution. Supports custom workflow and plugins to automate multi-step actions.
Language: Python - Size: 40.2 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 152 - Forks: 15

dieharders/obrew-studio-server
Obrew Studio - Server: A self-hostable machine learning engine. Build agents and schedule workflows private to you.
Language: Python - Size: 138 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 9 - Forks: 1
