GitHub topics: llamafile
ad-si/cai
The fastest CLI tool for prompting LLMs. Including support for prompting several LLMs at once!
Language: Rust - Size: 1.37 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 86 - Forks: 3

themaximalist/ai.js
AI Toolkit for Node.js (LLM, Image Generation, Embeddings, Vector Search)
Language: HTML - Size: 419 KB - Last synced at: 6 days ago - Pushed at: 26 days ago - Stars: 53 - Forks: 5

rabilrbl/llamafile-builder
A simple github actions script to build a llamafile and uploads to huggingface
Language: Python - Size: 60.5 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 10

brakmic/langchain-experiments
All of my LangChain experiments
Language: Python - Size: 85 KB - Last synced at: about 12 hours ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

iverly/llamafile-docker
Distribute and run llamafile/LLMs with a single docker image.
Language: Dockerfile - Size: 59.6 KB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 73 - Forks: 9

Wannabeasmartguy/RAGENT
Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge bases in one click, without complex configuration.
Language: Python - Size: 1.68 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 26 - Forks: 8

hfahrudin/Dockerize-Llamafile
This repository dockerizes the LlamaFile application for consistent deployment and management across environments.
Language: Python - Size: 7.81 KB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

redBorder/redborder-llm
Main package for redborder-ng AI assistant.
Language: Ruby - Size: 64.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lperezmo/simple-discord-bot
An over-engineered discord bot that relies too much on OpenAI's JSON mode + a free-to-run version based on llamacpp and stable diffusion
Language: Python - Size: 54.2 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

GR-Menon/BigBanyanTree
Gathering insights from Common Crawl using Apache Spark and LLMs.
Language: Jupyter Notebook - Size: 8.14 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

themaximalist/ModelDeployer
API Proxy for AI models, rate limiting, management and more!
Language: CSS - Size: 11 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

kaust-generative-ai/local-deployment-of-generative-ai-models
Training materials on how to deploy generative AI models locally on your laptop or workstation.
Size: 6.94 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

fjcloud/llamapod
Language: Shell - Size: 31.3 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

metaskills/llamafile-on-lambda
Serverless AI Inference with Gemma 2 using Mozilla's llamafile on AWS Lambda
Language: JavaScript - Size: 864 KB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 11 - Forks: 0

pAI-OS/fetch_llama_cpp
llama.cpp downloader that selects the latest and best available binaries for your system hardware (CPU & GPU).
Language: Python - Size: 88.9 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

shadowcz007/comfyui-sd-prompt-mixlab
Language: JavaScript - Size: 30.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 12 - Forks: 0

thekevinscott/Contortionist
Control what LLMs can, and can't, say
Language: TypeScript - Size: 854 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

mddunlap924/LLM-Inference-Serving
This repository demonstrates LLM execution on CPUs using packages like llamafile, emphasizing low-latency, high-throughput, and cost-effective benefits for inference and serving.
Language: Jupyter Notebook - Size: 6.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0
