An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: llamacpp

huggingface/llm-ls

LSP server leveraging LLMs for code completion (and more?)

Language: Rust - Size: 343 KB - Last synced at: about 6 hours ago - Pushed at: 7 months ago - Stars: 761 - Forks: 61

containers/ramalama

The goal of RamaLama is to make working with AI boring.

Language: Python - Size: 2.37 MB - Last synced at: about 12 hours ago - Pushed at: about 13 hours ago - Stars: 1,538 - Forks: 162

mirpo/fastapi-gen

Build LLM-enabled FastAPI applications without build configuration.

Language: Python - Size: 336 KB - Last synced at: about 14 hours ago - Pushed at: about 15 hours ago - Stars: 8 - Forks: 1

Genta-Technology/Kolosal

Kolosal AI is an OpenSource and Lightweight alternative to LM Studio to run LLMs 100% offline on your device.

Language: C++ - Size: 67.2 MB - Last synced at: about 12 hours ago - Pushed at: about 13 hours ago - Stars: 194 - Forks: 14

menloresearch/jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer

Language: TypeScript - Size: 1.03 GB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 28,522 - Forks: 1,685

khoj-ai/khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

Language: Python - Size: 109 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 28,742 - Forks: 1,608

awaescher/OllamaSharp

The easiest way to use the Ollama API in .NET

Language: C# - Size: 26.6 MB - Last synced at: about 23 hours ago - Pushed at: 10 days ago - Stars: 935 - Forks: 125

LostRuins/koboldcpp Fork of ggml-org/llama.cpp

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

Language: C++ - Size: 238 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 7,099 - Forks: 454

serge-chat/serge

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Language: Svelte - Size: 3.34 MB - Last synced at: 1 day ago - Pushed at: 3 days ago - Stars: 5,715 - Forks: 403

innightwolfsleep/llm_telegram_bot

LLM telegram bot

Language: Python - Size: 1.18 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 119 - Forks: 24

vercel/modelfusion

The TypeScript library for building AI applications.

Language: TypeScript - Size: 15.6 MB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 1,253 - Forks: 89

Fuzzy-Search/realtime-bakllava

llama.cpp with BakLLaVA model describes what does it see

Language: Python - Size: 2.84 MB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 383 - Forks: 43

getumbrel/llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

Language: TypeScript - Size: 1.71 MB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 10,959 - Forks: 713

SilasMarvin/lsp-ai

LSP-AI is an open-source language server that serves as a backend for AI-powered functionality, designed to assist and empower software engineers, not replace them.

Language: Rust - Size: 1.61 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 2,695 - Forks: 94

CommanderLake/LMStud

Chat with GGUF LLMs using llama.cpp and a classic Windows Forms interface for minimal GUI bloat.

Language: C# - Size: 557 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

menloresearch/cortex.cpp

Local AI API Platform

Language: C++ - Size: 139 MB - Last synced at: 1 day ago - Pushed at: 7 days ago - Stars: 2,622 - Forks: 163

Josh-XT/AGiXT

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

Language: Python - Size: 168 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 2,974 - Forks: 398

vinhnx/VT.ai

VT.ai - Minimal multimodal AI chat app with dynamic conversation routing

Language: Python - Size: 2.43 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 72 - Forks: 9

mostlygeek/llama-swap

Model swapping for llama.cpp (or any local OpenAPI compatible server)

Language: Go - Size: 552 KB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 537 - Forks: 31

1b5d/llm-api

Run any Large Language Model behind a unified API

Language: Python - Size: 53.7 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 169 - Forks: 27

yeeking/llamacpp-minimal-example

Minimal example of using llama cpp as library from cpp

Language: C++ - Size: 198 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 0 - Forks: 0

AmpereComputingAI/llama.cpp

Ampere optimized llama.cpp

Language: Python - Size: 143 KB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 12 - Forks: 1

kelindar/search

Go library for embedded vector search and semantic embeddings using llama.cpp

Language: Go - Size: 714 KB - Last synced at: about 11 hours ago - Pushed at: about 1 month ago - Stars: 430 - Forks: 13

alexrozanski/LlamaChat

Chat with your favourite LLaMA models in a native macOS app

Language: Swift - Size: 14.6 MB - Last synced at: 1 day ago - Pushed at: almost 2 years ago - Stars: 1,501 - Forks: 62

Mobile-Artificial-Intelligence/maid

Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.

Language: Dart - Size: 114 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1,973 - Forks: 212

karanIPS/claude-deep-research

Claude Deep Research config for Claude Code.

Size: 10.7 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 0

rish-1997/wsl-assistant

A fully auto configured, self-hosted local AI & database stack on Debian WSL2.

Language: Shell - Size: 123 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 1 - Forks: 0

SciSharp/LLamaSharp

A C#/.NET library to run LLM (πŸ¦™LLaMA/LLaVA) on your local device efficiently.

Language: C# - Size: 391 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 3,120 - Forks: 414

l29ah/llama-cpp-haskell

Haskell bindings for the llama.cpp llama-server

Language: Haskell - Size: 23.4 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 1

twinnydotdev/twinny

The most no-nonsense, locally or API-hosted AI code completion plugin for Visual Studio Code - like GitHub Copilot but 100% free.

Language: TypeScript - Size: 60.7 MB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 3,464 - Forks: 192

OEvortex/Webscout

Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and Phind; access cutting-edge AI models; transcribe YouTube videos; generate temporary emails and phone numbers; perform text-to-speech conversions; and much more!

Language: Python - Size: 8.56 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 219 - Forks: 41

floneum/floneum

Instant, controllable, local pre-trained AI models in Rust

Language: Rust - Size: 257 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,833 - Forks: 96

nathanlesage/local-chat

LocalChat is a ChatGPT-like chat that runs on your computer

Language: TypeScript - Size: 2.69 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 103 - Forks: 4

morpheuslord/HackBot

AI-powered cybersecurity chatbot designed to provide helpful and accurate answers to your cybersecurity-related queries and also do code analysis and scan analysis.

Language: Python - Size: 56.6 KB - Last synced at: 1 day ago - Pushed at: 5 months ago - Stars: 300 - Forks: 47

xorbitsai/inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Language: Python - Size: 44.6 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 7,500 - Forks: 636

mrdbourke/mac-ml-speed-test

A few quick scripts focused on testing TensorFlow/PyTorch/Llama 2 on macOS.

Language: Jupyter Notebook - Size: 1.51 MB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 190 - Forks: 33

Mobile-Artificial-Intelligence/llama_sdk

lcpp is a dart implementation of llama.cpp used by the mobile artificial intelligence distribution (maid)

Language: Dart - Size: 1.63 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 74 - Forks: 18

ddh0/easy-llama

Text generation in Python, as easy as possible

Language: Python - Size: 1.29 MB - Last synced at: 1 day ago - Pushed at: 5 days ago - Stars: 58 - Forks: 3

habib-source/pdf-rag

PDF chatbot question and answer

Language: Python - Size: 438 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

poteminr/instruct-ner

Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)

Language: Python - Size: 297 KB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 82 - Forks: 8

Nexesenex/croco.cpp Fork of LostRuins/koboldcpp

Croco.Cpp is a 3rd party testground for KoboldCPP, a simple one-file way to run various GGML/GGUF models with KoboldAI's UI. (for Croco.Cpp, in Cuda mode mainly!)

Language: C++ - Size: 272 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 103 - Forks: 3

gptme/gptme

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.

Language: Python - Size: 14.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 3,724 - Forks: 301

J4NN0/llm-rag

LLMs prompt augmentation with RAG by integrating external custom data from a variety of sources, allowing chat with such documents

Language: Python - Size: 126 KB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 19 - Forks: 5

inferflow/inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

Language: C++ - Size: 1.89 MB - Last synced at: 1 day ago - Pushed at: about 1 year ago - Stars: 242 - Forks: 25

mgonzs13/llama_ros

llama.cpp (GGUF LLMs) and llava.cpp (GGUF VLMs) for ROS 2

Language: C++ - Size: 6.95 MB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 196 - Forks: 30

michaelgiba/the-traitors

Simulation of the TV show The Traitors with Open Source LLMs

Language: HTML - Size: 3.71 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 0

joone/loz

Loz is a command-line tool that enables your preferred LLM to execute system commands and utilize Unix pipes, integrating AI capabilities with other Unix tools.

Language: TypeScript - Size: 1.51 MB - Last synced at: 1 day ago - Pushed at: 4 months ago - Stars: 260 - Forks: 15

asiff00/Orpheus-Local-TTS

Run Orpheus TTS locally.

Language: Python - Size: 6.84 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1 - Forks: 0

Adriankhl/godot-llm-template

Godot LLM Template/Demo

Language: GDScript - Size: 2.79 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 25 - Forks: 2

331Dala/llm_local

Run LLM finance apps hyper fast on local machine.

Language: Python - Size: 7.81 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 0

InftyAI/llmaz

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

Language: Go - Size: 6.1 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 120 - Forks: 19

if-ai/ComfyUI-IF_AI_tools

ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.

Language: Python - Size: 18 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 623 - Forks: 48

reorproject/reor

Private & local AI personal knowledge management app for high entropy people.

Language: JavaScript - Size: 92.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7,827 - Forks: 464

samestrin/llm-interface

A simple NPM interface for seamlessly interacting with 36 Large Language Model (LLM) providers, including OpenAI, Anthropic, Google Gemini, Cohere, Hugging Face Inference, NVIDIA AI, Mistral AI, AI21 Studio, LLaMA.CPP, and Ollama, and hundreds of models.

Language: JavaScript - Size: 1.1 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 103 - Forks: 14

distantmagic/paddler

Stateful load balancer custom-tailored for llama.cpp πŸ“πŸ¦™

Language: Rust - Size: 4.47 MB - Last synced at: 7 days ago - Pushed at: 17 days ago - Stars: 737 - Forks: 30

JohnSnowLabs/spark-nlp

State of the Art Natural Language Processing

Language: Scala - Size: 3.36 GB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 3,951 - Forks: 722

shubham0204/SmolChat-Android

Running any GGUF SLMs/LLMs locally, on-device in Android

Language: Kotlin - Size: 13.3 MB - Last synced at: 8 days ago - Pushed at: 9 days ago - Stars: 251 - Forks: 29

pythops/tenere

πŸ€– TUI interface for LLMs written in Rust

Language: Rust - Size: 642 KB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 505 - Forks: 22

RahulSChand/gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

Language: JavaScript - Size: 1.56 MB - Last synced at: 7 days ago - Pushed at: 5 months ago - Stars: 1,284 - Forks: 68

andrewkchan/yalm

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

Language: C++ - Size: 396 KB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 282 - Forks: 29

mukel/llama3.java

Practical Llama 3 inference in Java

Language: Java - Size: 187 KB - Last synced at: 6 days ago - Pushed at: 3 months ago - Stars: 686 - Forks: 83

rajatasusual/llamabox

Run a full local RAG pipeline on your low-end, CPU-only Windows machine using wsl2 with Debian. Private, resilient, secure.

Language: Python - Size: 1.38 MB - Last synced at: 5 days ago - Pushed at: 9 days ago - Stars: 0 - Forks: 0

ngxson/wllama

WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

Language: TypeScript - Size: 26.9 MB - Last synced at: 7 days ago - Pushed at: about 1 month ago - Stars: 653 - Forks: 36

rpidanny/llm-prompt-templates

Empower your LLM to do more than you ever thought possible with these state-of-the-art prompt templates.

Language: TypeScript - Size: 641 KB - Last synced at: 1 day ago - Pushed at: 7 months ago - Stars: 41 - Forks: 2

AstraBert/qdurllm

Search your favorite websites and chat with them, on your desktop🌐

Language: Python - Size: 1.06 MB - Last synced at: 8 days ago - Pushed at: 3 months ago - Stars: 30 - Forks: 2

Abhi5h3k/PrivateDocBot

πŸ“š Local PDF-Integrated Chat Bot: Secure Conversations and Document Assistance with LLM-Powered Privacy

Language: Python - Size: 2.39 MB - Last synced at: 1 day ago - Pushed at: 27 days ago - Stars: 83 - Forks: 19

nekomeowww/ollama-operator

🚒 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫

Language: Go - Size: 1.82 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 184 - Forks: 22

tinyBigGAMES/AIToolkit

AIToolkit - AI Construction Set

Language: Pascal - Size: 6.67 MB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 14 - Forks: 2

jeromeboivin/ollama-chat

A single file, customizable Python CLI tool for interacting with local Language Models, ensuring data privacy while providing conversation memory and extensibility through plugins and efficient Retrieval-Augmented Generation capabilities with ChromaDB integration. Also compatible with OpenAI API.

Language: Python - Size: 793 KB - Last synced at: 1 day ago - Pushed at: 11 days ago - Stars: 12 - Forks: 3

BrutalCoding/aub.ai

AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.

Language: Dart - Size: 119 MB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 279 - Forks: 24

Adriankhl/godot-llm

LLM in Godot

Language: C++ - Size: 175 MB - Last synced at: 9 days ago - Pushed at: 10 months ago - Stars: 176 - Forks: 11

llmware-ai/llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

Language: Python - Size: 968 MB - Last synced at: 11 days ago - Pushed at: about 1 month ago - Stars: 12,689 - Forks: 1,818

saadkh1/DocQA-TextSummarization-App

A Streamlit app for document question answering and text summarization.

Language: Jupyter Notebook - Size: 190 KB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 3

yportne13/chatbot-ui-llama.cpp

A static web ui for llama.cpp server. The llama.cpp chat interface for everyone. base on chatbot-ui

Language: TypeScript - Size: 2.05 MB - Last synced at: 1 day ago - Pushed at: 6 months ago - Stars: 16 - Forks: 4

cycneuramus/signal-aichat πŸ“¦

An AI chatbot for Signal powered by Google Bard, Bing Chat, ChatGPT, HuggingChat, and llama.cpp

Language: Python - Size: 275 KB - Last synced at: 1 day ago - Pushed at: 12 months ago - Stars: 87 - Forks: 17

CentralFloridaAttorney/zmongo_retriever

Use data from MongoDB in LangChain, Llama and OpenAI

Language: Python - Size: 27.2 MB - Last synced at: 12 days ago - Pushed at: 12 days ago - Stars: 4 - Forks: 1

EZForever/llama.cpp-static

Static builds of llama.cpp (Currently only amd64 server builds are available)

Language: Dockerfile - Size: 50.8 KB - Last synced at: 1 day ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

matrixsmaster/ANNA

Automatic Neural Network Assistant (ANNA) is a very powerful cross-platform AI toolkit. It contains versatile CLI/GUI LLM tools optimized for local inference, and supports optional remote offloading.

Language: C - Size: 1.93 MB - Last synced at: 1 day ago - Pushed at: 12 days ago - Stars: 6 - Forks: 0

Freed-Wu/translate-shell

Translate text by google, bing, youdaozhiyun, haici, stardict, openai, large language model of local machine, etc at same time from CLI, GUI (GNU/Linux, Android, macOS and Windows), REPL, python, shell and vim.

Language: Python - Size: 452 KB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 39 - Forks: 5

xorbitsai/xllamacpp Fork of shakfu/cyllama

xllamacpp - a Python wrapper of llama.cpp

Language: C++ - Size: 4.14 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 31 - Forks: 4

innightwolfsleep/old_llm_telegram_bot

Connect llama-cpp, transformers or text-generation-webui to telegram bot api.

Language: Python - Size: 1.03 MB - Last synced at: 8 days ago - Pushed at: 7 months ago - Stars: 28 - Forks: 10

argonne-lcf/LLM-Inference-Bench

LLM-Inference-Bench

Language: Jupyter Notebook - Size: 11.2 MB - Last synced at: 11 days ago - Pushed at: 3 months ago - Stars: 39 - Forks: 4

Opla/opla

Empower Your Productivity with Local AI Assistants

Language: TypeScript - Size: 59.5 MB - Last synced at: 1 day ago - Pushed at: 6 days ago - Stars: 36 - Forks: 3

AgustinAllamanoCosta/Pulpero

Plugin for Neovim to explain code using a local AI

Language: Lua - Size: 979 KB - Last synced at: 7 days ago - Pushed at: 13 days ago - Stars: 1 - Forks: 0

xNul/code-llama-for-vscode

Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.

Language: Python - Size: 10.7 KB - Last synced at: 1 day ago - Pushed at: 9 months ago - Stars: 568 - Forks: 32

sorohere/Hand-Pose-Detection

This project offers a versatile platform for hand-related tasks, including dataset generation and custom hand gesture detection using Google's MediaPipe library and accelerated real-time sign language translation with LLMs on edge devices.

Language: Python - Size: 821 MB - Last synced at: 1 day ago - Pushed at: 10 months ago - Stars: 13 - Forks: 2

shinomakoi/AI-Messenger

A QT GUI for large language models

Language: Python - Size: 231 KB - Last synced at: 1 day ago - Pushed at: over 1 year ago - Stars: 32 - Forks: 6

nerve-sparks/iris_android

IRIS is an android app for interfacing with GGUF / llama.cpp models locally.

Language: Kotlin - Size: 9.3 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 192 - Forks: 18

AstraBert/PrAIvateSearch

Own your AI, search the web with it🌐😎

Language: Python - Size: 3.56 MB - Last synced at: 5 days ago - Pushed at: 3 months ago - Stars: 84 - Forks: 12

jabberjabberjabber/ImageIndexer

Creates an index of images, queries a local LLM and adds tags to the image metadata

Language: Python - Size: 28.1 MB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 159 - Forks: 7

saddam213/LLamaStack πŸ“¦

ASP.NET Core Web, WebApi & WPF implementations for LLama.cpp & LLamaSharp

Language: C# - Size: 8.84 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 58 - Forks: 20

fynnfluegge/codeqai

Local first semantic code search and chat | Leverage custom copilots with fine-tuning datasets from code in Alpaca, Conversational, Completion and Instruction format

Language: Python - Size: 562 KB - Last synced at: 7 days ago - Pushed at: 2 months ago - Stars: 470 - Forks: 50

Atome-FE/llama-node πŸ“¦

Believe in AI democratization. llama for nodejs backed by llama-rs, llama.cpp and rwkv.cpp, work locally on your laptop CPU. support llama/alpaca/gpt4all/vicuna/rwkv model.

Language: Rust - Size: 30.4 MB - Last synced at: 2 days ago - Pushed at: over 1 year ago - Stars: 870 - Forks: 64

mrtrizer/UnityLlamaCpp

Llama.cpp in Unity, straightforward and clean

Language: C# - Size: 22.5 KB - Last synced at: 2 days ago - Pushed at: 11 months ago - Stars: 18 - Forks: 1

MorganRO8/Lucys_Labyrinth

A game made for a school project, dedicated to my daughter.

Language: C++ - Size: 97.1 MB - Last synced at: 14 days ago - Pushed at: about 1 year ago - Stars: 94 - Forks: 4

Dicklesworthstone/swiss_army_llama

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

Language: Python - Size: 7.25 MB - Last synced at: 7 days ago - Pushed at: about 2 months ago - Stars: 1,014 - Forks: 61

kspviswa/pyOllaMx

Your gateway to both Ollama & Apple MlX models

Language: Python - Size: 5.19 MB - Last synced at: 14 days ago - Pushed at: about 2 months ago - Stars: 115 - Forks: 8

AutonomicPerfectionist/PipeInfer

PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation

Language: C++ - Size: 17.5 MB - Last synced at: 14 days ago - Pushed at: 5 months ago - Stars: 29 - Forks: 4

eliranwong/toolmate

ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in conversations, generative work, and task execution. Supports custom workflow and plugins to automate multi-step actions.

Language: Python - Size: 40.2 MB - Last synced at: 1 day ago - Pushed at: about 1 month ago - Stars: 152 - Forks: 15

dieharders/obrew-studio-server

Obrew Studio - Server: A self-hostable machine learning engine. Build agents and schedule workflows private to you.

Language: Python - Size: 138 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 9 - Forks: 1