GitHub topics: localllama

Repositories

mostlygeek/llama-swap

Model swapping for llama.cpp (or any local OpenAPI compatible server)

Language: Go - Size: 938 KB - Last synced at: 3 days ago - Pushed at: 4 days ago - Stars: 757 - Forks: 38

sozercan/aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

Language: Go - Size: 4.26 MB - Last synced at: 3 days ago - Pushed at: 7 days ago - Stars: 451 - Forks: 39

yankeexe/llm-rag-with-reranker-demo

LLM RAG Application with Cross-Encoders Re-ranking for YouTube video 🎥

Language: Python - Size: 387 KB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 59 - Forks: 28

awaescher/OllamaSharp

The easiest way to use the Ollama API in .NET

Language: C# - Size: 26.6 MB - Last synced at: 5 days ago - Pushed at: 6 days ago - Stars: 963 - Forks: 131

sozercan/kubectl-ai

✨ Kubectl plugin to create manifests with LLMs

Language: Go - Size: 243 KB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 1,171 - Forks: 91

poloclub/wordflow

Social and customizable AI writing assistant! ✍️

Language: TypeScript - Size: 16.7 MB - Last synced at: 8 days ago - Pushed at: 11 months ago - Stars: 236 - Forks: 30

seyf1elislam/LocalLLM_OneClick_Colab

Run gguf LLM models in Latest Version TextGen-webui

Language: Jupyter Notebook - Size: 102 KB - Last synced at: 6 days ago - Pushed at: 7 months ago - Stars: 10 - Forks: 0

BrutalCoding/aub.ai

AubAI brings you on-device gen-AI capabilities, including offline text generation and more, directly within your app.

Language: Dart - Size: 119 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 283 - Forks: 25

SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Language: Python - Size: 19.8 MB - Last synced at: 11 days ago - Pushed at: 9 months ago - Stars: 348 - Forks: 30

moritztng/fltr

Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.

Language: Rust - Size: 63.5 MB - Last synced at: 8 days ago - Pushed at: about 1 year ago - Stars: 376 - Forks: 8

hathibelagal-dev/llamashell

A powerful shell that's powered by a locally running LLM (ideally Llama 3.x or Qwen 2.5)

Language: Python - Size: 55.7 KB - Last synced at: 42 minutes ago - Pushed at: about 1 month ago - Stars: 0 - Forks: 0

awaescher/OllamaSharpConsole 📦

Full featured demo application for OllamaSharp

Language: C# - Size: 137 KB - Last synced at: 7 days ago - Pushed at: 7 months ago - Stars: 12 - Forks: 4

Related Keywords

localllama 17 llm 8 ai 7 llama 6 localllm 5 ollama 5 llamacpp 4 gpt 4 mistral 3 openai 3 large-language-models 3 cli 3 kubernetes 2 gpt-4 2 mixtral 2 ollama-api 2 chatgpt 2 gemini 2 python 2 openai-api 2 open-llm 2 rust 1 operating-system 1 mixtral-8x7b 1 agentic-ai 1 llm-integration 1 shell 1 terminal-based 1 transformers 1 ollama-app 1 qwen2 1 llama-2 1 grep-like 1 grep 1 transformer 1 text-generation 1 small-models 1 quantization 1 natural-language-processing 1 model-compression 1 efficient-model 1 efficient-inference 1 compression 1 pubdev 1 gemini-pro 1 local-llama 1 llm-inference 1 llama3-meta-ai 1 search-engine 1 search-algorithm 1 copilot 1 codecompletion 1 chatbot 1 xttsv2 1 voice-assistant 1 vad 1 tts 1 text-to-speech 1 speech-to-text 1 speech 1 pytorch 1 llamacpp-python 1 kokoro-tts 1 deep-learning 1 bot 1 assistant 1 ollamasharp 1 ollama-ui 1 ollama-gui 1 ollama-client 1 kubectl-plugins 1 kubectl 1 k8s 1 hacktoberfest 1 streaming 1 microsoft-extensions-ai 1 ichatclient 1 streamlit 1 retrieval-augmented-generation 1 re-ranking 1 rag 1 langchain 1 cross-encoders 1 awesome 1 open-source-llm 1 nvidia 1 inference 1 gemma 1 finetuning 1 fine-tuning 1 docker 1 buildkit 1 vllm 1 golang 1 on-device-ai 1 on-device 1 nlp 1 native-apps 1 mistral-7b 1 macos 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos