GitHub topics: llamafile

Repositories

ad-si/cai

The fastest CLI tool for prompting LLMs. Including support for prompting several LLMs at once!

Language: Rust - Size: 1.37 MB - Last synced at: 6 days ago - Pushed at: about 2 months ago - Stars: 86 - Forks: 3

themaximalist/ai.js

AI Toolkit for Node.js (LLM, Image Generation, Embeddings, Vector Search)

Language: HTML - Size: 419 KB - Last synced at: 6 days ago - Pushed at: 26 days ago - Stars: 53 - Forks: 5

rabilrbl/llamafile-builder

A simple github actions script to build a llamafile and uploads to huggingface

Language: Python - Size: 60.5 KB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 15 - Forks: 10

brakmic/langchain-experiments

All of my LangChain experiments

Language: Python - Size: 85 KB - Last synced at: about 12 hours ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

iverly/llamafile-docker

Distribute and run llamafile/LLMs with a single docker image.

Language: Dockerfile - Size: 59.6 KB - Last synced at: 6 days ago - Pushed at: 2 months ago - Stars: 73 - Forks: 9

Wannabeasmartguy/RAGENT

Probably one of the lightest native RAG + Agent apps out there，experience the power of Agent-powered models and Agent-driven knowledge bases in one click, without complex configuration.

Language: Python - Size: 1.68 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 26 - Forks: 8

hfahrudin/Dockerize-Llamafile

This repository dockerizes the LlamaFile application for consistent deployment and management across environments.

Language: Python - Size: 7.81 KB - Last synced at: 22 days ago - Pushed at: 8 months ago - Stars: 1 - Forks: 0

redBorder/redborder-llm

Main package for redborder-ng AI assistant.

Language: Ruby - Size: 64.5 KB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

lperezmo/simple-discord-bot

An over-engineered discord bot that relies too much on OpenAI's JSON mode + a free-to-run version based on llamacpp and stable diffusion

Language: Python - Size: 54.2 MB - Last synced at: 4 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

GR-Menon/BigBanyanTree

Gathering insights from Common Crawl using Apache Spark and LLMs.

Language: Jupyter Notebook - Size: 8.14 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 3 - Forks: 0

themaximalist/ModelDeployer

API Proxy for AI models, rate limiting, management and more!

Language: CSS - Size: 11 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 0

kaust-generative-ai/local-deployment-of-generative-ai-models

Training materials on how to deploy generative AI models locally on your laptop or workstation.

Size: 6.94 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 2 - Forks: 0

fjcloud/llamapod

Language: Shell - Size: 31.3 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

metaskills/llamafile-on-lambda

Serverless AI Inference with Gemma 2 using Mozilla's llamafile on AWS Lambda

Language: JavaScript - Size: 864 KB - Last synced at: 5 days ago - Pushed at: 12 months ago - Stars: 11 - Forks: 0

pAI-OS/fetch_llama_cpp

llama.cpp downloader that selects the latest and best available binaries for your system hardware (CPU & GPU).

Language: Python - Size: 88.9 KB - Last synced at: 6 days ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

shadowcz007/comfyui-sd-prompt-mixlab

Language: JavaScript - Size: 30.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 12 - Forks: 0

thekevinscott/Contortionist

Control what LLMs can, and can't, say

Language: TypeScript - Size: 854 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

mddunlap924/LLM-Inference-Serving

This repository demonstrates LLM execution on CPUs using packages like llamafile, emphasizing low-latency, high-throughput, and cost-effective benefits for inference and serving.

Language: Jupyter Notebook - Size: 6.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

Related Keywords

llamafile 18 ai 8 llm 6 llama 5 openai 4 llamacpp 4 ollama 3 gpt-4 3 llm-inference 2 groq 2 docker 2 claude 2 python 2 gpt-3 2 deployment 2 docker-compose 1 apache-spark 1 models 1 carpentries-incubator 1 english 1 stable-diffusion-webui 1 stable-diffusion 1 stable 1 openai-api 1 google-search-api 1 duckduckgo-search 1 duckduckgo-api 1 discord-py-bot 1 discord-py 1 aws-s3 1 vllm 1 llms 1 llm-serving 1 large-language-models 1 deepspeed 1 transformersjs 1 gbnf 1 comfyui 1 ggml 1 lambda 1 gemma2 1 gemma 1 aws-lambda 1 openshift 1 pre-alpha 1 llama-cpp 1 lesson 1 generative-ai 1 llamafile-builder 1 llama2 1 vectordb 1 image-g 1 embeddi 1 claude-ai 1 rust 1 prompt 1 ml 1 mistral 1 machine-learning 1 llama3 1 gpt-4o 1 gpt 1 cli 1 chatgpt 1 anthropic 1 rpm 1 redborder-ng 1 autodelivery 1 artificial-intelligence 1 python-packages 1 llmops 1 containerization 1 boilerplate 1 rag 1 local-development 1 langchian 1 azure 1 autogen 1 agent 1 langchain 1 huggingface 1 llamafile-server 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos