GitHub topics: llama-cpp-python
BorjaOteroFerreira/IALab-Suite
Tool for test diferents large language models without code.
Language: Python - Size: 30.3 MB - Last synced at: about 13 hours ago - Pushed at: about 13 hours ago - Stars: 18 - Forks: 0

Woolverine94/biniou
a self-hosted webui for 30+ generative ai
Language: Python - Size: 6.46 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 592 - Forks: 73

jasonacox/TinyLLM
Setup and run a local LLM and Chatbot using consumer grade hardware.
Language: JavaScript - Size: 516 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 267 - Forks: 31

TAO71-AI/I4.0
TAO71 I4.0 is an AI created by TAO71 in Python.
Language: Python - Size: 5.12 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 6 - Forks: 0

controlecidadao/samantha_ia
Experimental interface environment for open source LLM, designed to democratize the use of AI. Powered by llama-cpp, llama-cpp-python and Gradio.
Language: Python - Size: 23.2 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 11 - Forks: 1

pchsu-hsupc/Edge_AI_13th
This project optimizes the LLaMA-3.2B-Instruct model for fast inference on a single NVIDIA T4 GPU (16 GB), targeting high throughput and low perplexity for efficient edge deployment.
Language: Python - Size: 19.5 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

serhaturtis/AI-FlowLib
A Python framework for building structured, flow-based LLM applications with built-in pipeline management, model configuration, and validation capabilities.
Language: Python - Size: 1.57 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

svjack/Genshin-Impact-RAG
A Genshin Impact Question Answer Project supported by Qwen1.5-14B-Chat
Language: Python - Size: 83 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

ossirytk/llama-cpp-chat-memory
Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma
Language: Python - Size: 45.6 MB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 34 - Forks: 5

svjack/CodeActAgent-Gradio
UnOfficial Gradio Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
Language: Jupyter Notebook - Size: 608 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 13 - Forks: 1

Axlfc/RuneScript
RuneScript is a cutting-edge scripting platform for developers, combining advanced script execution, version control integration, and an intuitive AI assistant. Empower your coding workflow with seamless automation, powerful tools, and a streamlined development experience inspired by the mystique of runes.
Language: Python - Size: 9.88 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mlc-delgado/pytldr-oss
An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intelligent assistant for modern professionals.
Language: Python - Size: 905 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 51 - Forks: 5

KT313/assistant_base
A custom framework for easy use of LLMs, VLMs, etc. supporting various modes and settings via web-ui
Language: Jupyter Notebook - Size: 1.18 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Renatoelho/llama-cpp-local
Llama.cpp é uma biblioteca desenvolvida em C++ para a implementação eficiente de grandes modelos de linguagem, como o LLaMA da Meta. Otimizada para rodar em diversas plataformas, incluindo dispositivos com recursos limitados, oferece performance, velocidade de inferência e uso eficiente da memória, essenciais para a execução de grandes. modelos
Language: Python - Size: 226 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

MagnusS0/HuginnHears
Huginn Hears is a local app that transcribes and summarizes your meetings in Norwegian and English, using state-of-the-art models and open-source libraries. No cloud needed, run everything offline.
Language: Python - Size: 508 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

testli-ai/outlines-llama-cpp-python-streaming-output
This repository demonstrates how to use outlines and llama-cpp-python for structured JSON generation with streaming output, integrating llama.cpp for local model inference and outlines for schema-based text generation.
Language: Python - Size: 98.6 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

yueying-teng/generate-language-image-instruction-following-data
Mistral assisted visual instruction data generation by following LLaVA
Language: Python - Size: 69.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 1

unixwzrd/oobabooga-macOS
Information on optimizing python libraries specifically for oobabooga to take advantage of Apple Silicon and Accelerate Framework.
Language: Python - Size: 551 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 69 - Forks: 9

Ali-Fartoot/ProfessorConnected
ProfessorConnected is an API-powered tool that helps you discover professors with similar research interests by analyzing their arXiv publications using advanced NLP and vector search techniques.
Language: Python - Size: 159 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

PRITHIVSAKTHIUR/Triangulum
Triangulum 10B: Multilingual Large Language Models (LLMs)
Size: 2.02 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

laelhalawani/gguf_modeldb
A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more models from hf repos and more. It's super easy to use and comes prepacked with best preconfigured open source models: dolphin phi-2 2.7b, mistral 7b v0.2, mixtral 8x7b v0.1, solar 10.7b and zephyr 3b
Language: Python - Size: 104 KB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 2

woheller69/LLAMA_TK_CHAT
Simple chat interface for local AI using llama-cpp-python and llama-cpp-agent
Language: Python - Size: 127 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 8 - Forks: 1

perpendicularai/SeKernel_for_LLM_UI
This is the repository for the UI for the SeKernel_for_LLM module
Language: Python - Size: 1.91 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Ankitajadhav611/Chatbot_LLM_MoinVonBremen Fork of Jayanths9/Chatbot_Moin_Von_Bremen
LLM chat bot for multimodal processing
Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

mchiovaro/RAGify
A Gradio App for Retrieval-Augmented-Generation on PDFs
Language: Python - Size: 5.88 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

wambugu71/OfflineGPT-
Local gpt in llama.cpp models with chat interface
Language: Python - Size: 11.7 KB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

svjack/Genshin-Impact-Character-Chat
Genshin Impact Character Chat Models tuned by Lora on LLM
Language: Python - Size: 204 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

noir55/gguf_simple_webui
llama-cpp-python(llama.cpp)で実行するGGUF形式のLLM用の簡易Webインタフェースです。
Language: Python - Size: 296 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

abhie7/MINed-Hacakthon
MINeD Hackathon 2024 - Project
Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

cronoimpius/CybersecProject
Repository for the Cybersecurity-M project course of professor M. Colajanni
Language: TeX - Size: 122 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

007prateekd/fin-edubot
A financial chatbot powered by an LLM and retrieval-augmented generation.
Language: Jupyter Notebook - Size: 126 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
