An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: llama-cpp-python

BorjaOteroFerreira/IALab-Suite

Tool for test diferents large language models without code.

Language: Python - Size: 30.3 MB - Last synced at: about 13 hours ago - Pushed at: about 13 hours ago - Stars: 18 - Forks: 0

Woolverine94/biniou

a self-hosted webui for 30+ generative ai

Language: Python - Size: 6.46 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 592 - Forks: 73

jasonacox/TinyLLM

Setup and run a local LLM and Chatbot using consumer grade hardware.

Language: JavaScript - Size: 516 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 267 - Forks: 31

TAO71-AI/I4.0

TAO71 I4.0 is an AI created by TAO71 in Python.

Language: Python - Size: 5.12 MB - Last synced at: 13 days ago - Pushed at: 13 days ago - Stars: 6 - Forks: 0

controlecidadao/samantha_ia

Experimental interface environment for open source LLM, designed to democratize the use of AI. Powered by llama-cpp, llama-cpp-python and Gradio.

Language: Python - Size: 23.2 MB - Last synced at: 16 days ago - Pushed at: 16 days ago - Stars: 11 - Forks: 1

pchsu-hsupc/Edge_AI_13th

This project optimizes the LLaMA-3.2B-Instruct model for fast inference on a single NVIDIA T4 GPU (16 GB), targeting high throughput and low perplexity for efficient edge deployment.

Language: Python - Size: 19.5 KB - Last synced at: 29 days ago - Pushed at: 29 days ago - Stars: 0 - Forks: 0

serhaturtis/AI-FlowLib

A Python framework for building structured, flow-based LLM applications with built-in pipeline management, model configuration, and validation capabilities.

Language: Python - Size: 1.57 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

svjack/Genshin-Impact-RAG

A Genshin Impact Question Answer Project supported by Qwen1.5-14B-Chat

Language: Python - Size: 83 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

ossirytk/llama-cpp-chat-memory

Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma

Language: Python - Size: 45.6 MB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 34 - Forks: 5

svjack/CodeActAgent-Gradio

UnOfficial Gradio Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.

Language: Jupyter Notebook - Size: 608 KB - Last synced at: 9 days ago - Pushed at: 9 months ago - Stars: 13 - Forks: 1

Axlfc/RuneScript

RuneScript is a cutting-edge scripting platform for developers, combining advanced script execution, version control integration, and an intuitive AI assistant. Empower your coding workflow with seamless automation, powerful tools, and a streamlined development experience inspired by the mystique of runes.

Language: Python - Size: 9.88 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 0 - Forks: 0

mlc-delgado/pytldr-oss

An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intelligent assistant for modern professionals.

Language: Python - Size: 905 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 51 - Forks: 5

KT313/assistant_base

A custom framework for easy use of LLMs, VLMs, etc. supporting various modes and settings via web-ui

Language: Jupyter Notebook - Size: 1.18 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

Renatoelho/llama-cpp-local

Llama.cpp é uma biblioteca desenvolvida em C++ para a implementação eficiente de grandes modelos de linguagem, como o LLaMA da Meta. Otimizada para rodar em diversas plataformas, incluindo dispositivos com recursos limitados, oferece performance, velocidade de inferência e uso eficiente da memória, essenciais para a execução de grandes. modelos

Language: Python - Size: 226 KB - Last synced at: 9 days ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

MagnusS0/HuginnHears

Huginn Hears is a local app that transcribes and summarizes your meetings in Norwegian and English, using state-of-the-art models and open-source libraries. No cloud needed, run everything offline.

Language: Python - Size: 508 KB - Last synced at: 3 months ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 1

testli-ai/outlines-llama-cpp-python-streaming-output

This repository demonstrates how to use outlines and llama-cpp-python for structured JSON generation with streaming output, integrating llama.cpp for local model inference and outlines for schema-based text generation.

Language: Python - Size: 98.6 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 0 - Forks: 0

yueying-teng/generate-language-image-instruction-following-data

Mistral assisted visual instruction data generation by following LLaVA

Language: Python - Size: 69.3 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 1 - Forks: 1

unixwzrd/oobabooga-macOS

Information on optimizing python libraries specifically for oobabooga to take advantage of Apple Silicon and Accelerate Framework.

Language: Python - Size: 551 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 69 - Forks: 9

Ali-Fartoot/ProfessorConnected

ProfessorConnected is an API-powered tool that helps you discover professors with similar research interests by analyzing their arXiv publications using advanced NLP and vector search techniques.

Language: Python - Size: 159 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

PRITHIVSAKTHIUR/Triangulum

Triangulum 10B: Multilingual Large Language Models (LLMs)

Size: 2.02 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 1 - Forks: 0

laelhalawani/gguf_modeldb

A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more models from hf repos and more. It's super easy to use and comes prepacked with best preconfigured open source models: dolphin phi-2 2.7b, mistral 7b v0.2, mixtral 8x7b v0.1, solar 10.7b and zephyr 3b

Language: Python - Size: 104 KB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 2

woheller69/LLAMA_TK_CHAT

Simple chat interface for local AI using llama-cpp-python and llama-cpp-agent

Language: Python - Size: 127 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 8 - Forks: 1

perpendicularai/SeKernel_for_LLM_UI

This is the repository for the UI for the SeKernel_for_LLM module

Language: Python - Size: 1.91 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 0

Ankitajadhav611/Chatbot_LLM_MoinVonBremen Fork of Jayanths9/Chatbot_Moin_Von_Bremen

LLM chat bot for multimodal processing

Language: Jupyter Notebook - Size: 10.7 MB - Last synced at: 5 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

mchiovaro/RAGify

A Gradio App for Retrieval-Augmented-Generation on PDFs

Language: Python - Size: 5.88 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

wambugu71/OfflineGPT-

Local gpt in llama.cpp models with chat interface

Language: Python - Size: 11.7 KB - Last synced at: 29 days ago - Pushed at: about 1 year ago - Stars: 1 - Forks: 0

svjack/Genshin-Impact-Character-Chat

Genshin Impact Character Chat Models tuned by Lora on LLM

Language: Python - Size: 204 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

noir55/gguf_simple_webui

llama-cpp-python(llama.cpp)で実行するGGUF形式のLLM用の簡易Webインタフェースです。

Language: Python - Size: 296 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

abhie7/MINed-Hacakthon

MINeD Hackathon 2024 - Project

Language: Jupyter Notebook - Size: 11.3 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

cronoimpius/CybersecProject

Repository for the Cybersecurity-M project course of professor M. Colajanni

Language: TeX - Size: 122 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

007prateekd/fin-edubot

A financial chatbot powered by an LLM and retrieval-augmented generation.

Language: Jupyter Notebook - Size: 126 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0