Topic: "llmops"
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language: Python - Size: 79.7 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 57,561 - Forks: 9,999

pathwaycom/llm-app
Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚡Always in sync with Sharepoint, Google Drive, S3, Kafka, PostgreSQL, real-time data APIs, and more.
Language: Jupyter Notebook - Size: 59.8 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 33,130 - Forks: 900

BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Language: Python - Size: 536 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 28,609 - Forks: 4,067

ComposioHQ/composio
Composio equips your AI agents & LLMs with 100+ high-quality integrations via function calling
Language: TypeScript - Size: 929 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 25,708 - Forks: 4,357

mlflow/mlflow
The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.
Language: Python - Size: 857 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 21,971 - Forks: 4,807

jina-ai/serve
☁️ Build multimodal AI applications with cloud-native stack
Language: Python - Size: 1.57 GB - Last synced at: 5 days ago - Pushed at: 6 months ago - Stars: 21,731 - Forks: 2,233

liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Language: HTML - Size: 23.9 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 19,248 - Forks: 2,295

TransformerOptimus/SuperAGI
<⚡️> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
Language: Python - Size: 60.5 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 16,311 - Forks: 1,968

raga-ai-hub/RagaAI-Catalyst
Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view
Language: Python - Size: 55.8 MB - Last synced at: 11 days ago - Pushed at: 30 days ago - Stars: 16,050 - Forks: 3,708

langfuse/langfuse
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Language: TypeScript - Size: 41.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 15,892 - Forks: 1,479

comet-ml/opik
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Language: Python - Size: 292 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 13,486 - Forks: 946

bentoml/OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Language: Python - Size: 41.1 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 11,753 - Forks: 765

explodinggradients/ragas
Supercharge Your LLM Application Evaluations 🚀
Language: Python - Size: 43.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 10,631 - Forks: 1,063

tensorzero/tensorzero
TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.
Language: Rust - Size: 112 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 10,258 - Forks: 680

dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
Language: TypeScript - Size: 76.7 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 9,597 - Forks: 1,563

Netflix/metaflow
Build, Manage and Deploy AI/ML Systems
Language: Python - Size: 44.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 9,455 - Forks: 876

Portkey-AI/gateway
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
Language: TypeScript - Size: 62.4 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 9,316 - Forks: 707

promptfoo/promptfoo
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
Language: TypeScript - Size: 285 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 8,334 - Forks: 691

bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Language: Python - Size: 98.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 8,053 - Forks: 870

Arize-ai/phoenix
AI Observability & Evaluation
Language: Jupyter Notebook - Size: 355 MB - Last synced at: about 10 hours ago - Pushed at: about 12 hours ago - Stars: 6,933 - Forks: 569

evidentlyai/evidently
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
Language: Jupyter Notebook - Size: 320 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 6,578 - Forks: 722

traceloop/openllmetry
Open-source observability for your LLM application, based on OpenTelemetry
Language: Python - Size: 39.7 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 6,381 - Forks: 796

tensorchord/Awesome-LLMOps
An awesome & curated list of best LLMOps tools for developers
Language: Shell - Size: 190 KB - Last synced at: 2 days ago - Pushed at: about 1 month ago - Stars: 5,281 - Forks: 505

superduper-io/superduper
Superduper: End-to-end framework for building custom AI applications and agents.
Language: Python - Size: 73.8 MB - Last synced at: 7 days ago - Pushed at: 11 days ago - Stars: 5,173 - Forks: 520

coze-dev/coze-loop
Next-generation AI Agent Optimization Platform: Cozeloop addresses challenges in AI agent development by providing full-lifecycle management capabilities from development, debugging, and evaluation to monitoring.
Language: Go - Size: 11 MB - Last synced at: about 5 hours ago - Pushed at: about 5 hours ago - Stars: 4,873 - Forks: 637

zenml-io/zenml
ZenML 🙏: MLOps for Reliable AI: from Classical AI to Agents. https://zenml.io.
Language: Python - Size: 696 MB - Last synced at: about 6 hours ago - Pushed at: about 6 hours ago - Stars: 4,872 - Forks: 538

Giskard-AI/giskard-oss
🐢 Open-Source Evaluation & Testing library for LLM Agents
Language: Python - Size: 176 MB - Last synced at: about 20 hours ago - Pushed at: about 23 hours ago - Stars: 4,850 - Forks: 354

tencentmusic/cube-studio
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,mlops算法链路全流程,算力租赁平台,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU虚拟化,边缘计算,标注平台自动化标注,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有知识库,AI模型市场,支持国产cpu/gpu/npu 昇腾生态,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/ray/volcano等分布式
Language: Jupyter Notebook - Size: 149 MB - Last synced at: 12 days ago - Pushed at: 13 days ago - Stars: 4,532 - Forks: 793

Helicone/helicone
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
Language: TypeScript - Size: 711 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 4,482 - Forks: 432

0xPlaygrounds/rig
⚙️🦀 Build modular and scalable LLM Applications in Rust
Language: Rust - Size: 16 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4,381 - Forks: 476

truefoundry/cognita
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
Language: Python - Size: 50.4 MB - Last synced at: 11 days ago - Pushed at: 12 days ago - Stars: 4,195 - Forks: 348

decodingml/llm-twin-course
🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴
Language: Python - Size: 9.78 MB - Last synced at: 4 months ago - Pushed at: 5 months ago - Stars: 3,887 - Forks: 645

PacktPublishing/LLM-Engineers-Handbook
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
Language: Python - Size: 4.48 MB - Last synced at: about 2 months ago - Pushed at: 6 months ago - Stars: 3,753 - Forks: 848

katanemo/archgw
The smart edge and AI gateway for agents. Arch is a high-performance proxy server that handles the low-level work in building agents: like applying guardrails, routing prompts to the right agent, and unifying access to LLMs, etc. Natively designed to process prompts, it's framework-agnostic and helps you build agents faster.
Language: Rust - Size: 23.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3,597 - Forks: 199

predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Language: Python - Size: 6.62 MB - Last synced at: 4 days ago - Pushed at: 4 months ago - Stars: 3,409 - Forks: 263

iusztinpaul/hands-on-llms 📦
🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴
Language: Jupyter Notebook - Size: 25.7 MB - Last synced at: about 1 month ago - Pushed at: 9 months ago - Stars: 3,329 - Forks: 534

Josh-XT/AGiXT
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
Language: Python - Size: 168 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 3,075 - Forks: 428

pezzolabs/pezzo
🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.
Language: TypeScript - Size: 26.7 MB - Last synced at: 1 day ago - Pushed at: 3 months ago - Stars: 3,055 - Forks: 258

truera/trulens
Evaluation and Tracking for LLM Experiments and AI Agents
Language: Python - Size: 344 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,764 - Forks: 220

OpenPipe/OpenPipe
Turn expensive prompts into cheap fine-tuned models
Language: TypeScript - Size: 11.6 MB - Last synced at: about 17 hours ago - Pushed at: over 1 year ago - Stars: 2,730 - Forks: 160

ianarawjo/ChainForge
An open-source visual programming environment for battle-testing prompts to LLMs.
Language: TypeScript - Size: 184 MB - Last synced at: 27 days ago - Pushed at: 28 days ago - Stars: 2,721 - Forks: 224

GoogleCloudPlatform/agent-starter-pack
A collection of production-ready Generative AI Agent templates built for Google Cloud. It accelerates development by providing a holistic, production-ready solution, addressing common challenges (Deployment & Operations, Evaluation, Customization, Observability) in building and deploying GenAI agents.
Language: Python - Size: 21.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,464 - Forks: 757

langwatch/langwatch
The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨
Language: TypeScript - Size: 32.3 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 2,458 - Forks: 226

uptrain-ai/uptrain
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
Language: Python - Size: 36.9 MB - Last synced at: 17 days ago - Pushed at: about 1 year ago - Stars: 2,316 - Forks: 198

dot-agent/nextpy
🤖Self-Modifying Framework from the Future 🔮 World's First AMS
Language: Python - Size: 56.6 MB - Last synced at: 5 days ago - Pushed at: over 1 year ago - Stars: 2,316 - Forks: 178

lmnr-ai/lmnr
Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Labels. YC S24.
Language: TypeScript - Size: 37.3 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 2,259 - Forks: 135

apache/hamilton
Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
Language: Jupyter Notebook - Size: 103 MB - Last synced at: about 13 hours ago - Pushed at: 3 days ago - Stars: 2,257 - Forks: 158

bionic-gpt/bionic-gpt
Bionic is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality
Language: Rust - Size: 121 MB - Last synced at: 13 days ago - Pushed at: about 1 month ago - Stars: 2,234 - Forks: 226

tensorchord/envd
🏕️ Reproducible development environment
Language: Go - Size: 3.81 MB - Last synced at: 5 days ago - Pushed at: 11 days ago - Stars: 2,139 - Forks: 161

microsoft/aici
AICI: Prompts as (Wasm) Programs
Language: Rust - Size: 9.71 MB - Last synced at: 4 months ago - Pushed at: 8 months ago - Stars: 2,027 - Forks: 83

trypromptly/LLMStack
No-code multi-agent framework to build LLM Agents, workflows and applications with your data
Language: Python - Size: 103 MB - Last synced at: 8 days ago - Pushed at: 9 months ago - Stars: 2,026 - Forks: 298

protectai/llm-guard
The Security Toolkit for LLM Interactions
Language: Python - Size: 4.13 MB - Last synced at: 12 days ago - Pushed at: 18 days ago - Stars: 2,018 - Forks: 271

openlit/openlit
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers, VectorDBs, Agent Frameworks and GPUs.
Language: Python - Size: 49 MB - Last synced at: 10 days ago - Pushed at: 10 days ago - Stars: 1,852 - Forks: 179

apache/burr
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
Language: Python - Size: 39.6 MB - Last synced at: about 13 hours ago - Pushed at: 4 days ago - Stars: 1,787 - Forks: 93

genieincodebottle/generative-ai
Comprehensive resources on Generative AI, including a detailed roadmap, projects, use cases, interview preparation, and coding preparation.
Language: Jupyter Notebook - Size: 92.7 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,446 - Forks: 354

AgentEra/Agently
[GenAI Application Development Framework] 🚀 Build GenAI application quick and easy 💬 Easy to interact with GenAI agent in code using structure data and chained-calls syntax 🧩 Use Agently Workflow to manage complex GenAI working logic 🔀 Switch to any model without rewrite application code
Language: Python - Size: 29.3 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 1,418 - Forks: 160

protectai/rebuff 📦
LLM Prompt Injection Detector
Language: TypeScript - Size: 6.99 MB - Last synced at: 3 days ago - Pushed at: about 1 year ago - Stars: 1,344 - Forks: 113

ThousandBirdsInc/chidori
A reactive runtime for building durable AI agents
Language: Rust - Size: 39.4 MB - Last synced at: 2 days ago - Pushed at: 8 months ago - Stars: 1,323 - Forks: 52

decodingml/second-brain-ai-assistant-course
Learn to build your Second Brain AI assistant with LLMs, agents, RAG, fine-tuning, LLMOps and AI systems techniques.
Language: Jupyter Notebook - Size: 166 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 1,199 - Forks: 190

KusionStack/kusion
Declarative Intent Driven Platform Orchestrator for Internal Developer Platform (IDP).
Language: Go - Size: 10.2 MB - Last synced at: 6 days ago - Pushed at: 15 days ago - Stars: 1,164 - Forks: 92

vllm-project/vllm-ascend
Community maintained hardware plugin for vLLM on Ascend
Language: Python - Size: 6.03 MB - Last synced at: about 21 hours ago - Pushed at: 1 day ago - Stars: 1,099 - Forks: 426

plurai-ai/intellagent
A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions
Language: Python - Size: 14.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1,071 - Forks: 133

tensorchord/VectorChord
Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.
Language: Rust - Size: 1.28 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 1,063 - Forks: 35

intentee/paddler
Open-source LLMOps platform for hosting and scaling AI in your own infrastructure 🏓🦙
Language: Rust - Size: 5.54 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 1,056 - Forks: 48

datadreamer-dev/DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
Language: Python - Size: 895 KB - Last synced at: 16 days ago - Pushed at: 7 months ago - Stars: 1,052 - Forks: 53

e2b-dev/awesome-ai-sdks
A database of SDKs, frameworks, libraries, and tools for creating, monitoring, debugging and deploying autonomous AI agents
Size: 7.08 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 1,024 - Forks: 87

dillionverma/llm.report 📦
📊 llm.report is an open-source logging and analytics platform for OpenAI: Log your ChatGPT API requests, analyze costs, and improve your prompts.
Language: TypeScript - Size: 23.5 MB - Last synced at: 15 days ago - Pushed at: over 1 year ago - Stars: 1,013 - Forks: 89

ajndkr/lanarky
The web framework for building LLM microservices [deprecated]
Language: Python - Size: 22.9 MB - Last synced at: 25 days ago - Pushed at: about 1 year ago - Stars: 994 - Forks: 78

prometheus-eval/prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
Language: Python - Size: 15.1 MB - Last synced at: 3 months ago - Pushed at: 5 months ago - Stars: 953 - Forks: 55

Scale3-Labs/langtrace
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. 🚀💻📊
Language: TypeScript - Size: 3.69 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 933 - Forks: 89

msoedov/langcorn
⛓️ Serving LangChain LLM apps and agents automagically with FastApi. LLMops
Language: Python - Size: 850 KB - Last synced at: about 2 months ago - Pushed at: about 1 year ago - Stars: 931 - Forks: 74

dynamiq-ai/dynamiq
Dynamiq is an orchestration framework for agentic AI and LLM applications
Language: Python - Size: 5.71 MB - Last synced at: about 1 hour ago - Pushed at: about 2 hours ago - Stars: 927 - Forks: 105

getmetal/motorhead
🧠 Motorhead is a memory and information retrieval server for LLMs.
Language: Rust - Size: 323 KB - Last synced at: 6 months ago - Pushed at: 10 months ago - Stars: 868 - Forks: 82

NeumTry/NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Language: Python - Size: 3.83 MB - Last synced at: 12 days ago - Pushed at: over 1 year ago - Stars: 860 - Forks: 48

alibaba/rtp-llm
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Language: C++ - Size: 307 MB - Last synced at: 4 days ago - Pushed at: about 2 months ago - Stars: 847 - Forks: 71

onlyphantom/llm-python
Large Language Models (LLMs) tutorials & sample scripts, ft. langchain, openai, llamaindex, gpt, chromadb & pinecone
Language: Jupyter Notebook - Size: 2.13 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 835 - Forks: 283

morsoli/llm-books
利用LLM构建应用实践笔记
Language: Python - Size: 6.06 MB - Last synced at: 9 days ago - Pushed at: 10 months ago - Stars: 738 - Forks: 48

Azure-Samples/contoso-chat
This sample has the full End2End process of creating RAG application with Prompty and Azure AI Foundry. It includes GPT-4 LLM application code, evaluations, deployment automation with AZD CLI, GitHub actions for evaluation and deployment and intent mapping for multiple LLM task mapping.
Language: Bicep - Size: 234 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 706 - Forks: 4,041

stoyan-stoyanov/llmflows
LLMFlows - Simple, Explicit and Transparent LLM Apps
Language: Python - Size: 36.1 MB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 699 - Forks: 34

Arize-ai/openinference
OpenTelemetry Instrumentation for AI Observability
Language: Python - Size: 11.2 MB - Last synced at: about 21 hours ago - Pushed at: about 23 hours ago - Stars: 588 - Forks: 135

liguodongiot/llm-resource
LLM全栈优质资源汇总
Language: Shell - Size: 65.4 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 584 - Forks: 68

SmythOS/sre
The Operating System for Agents
Language: TypeScript - Size: 24 MB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 556 - Forks: 59

bosun-ai/swiftide
Fast, streaming indexing, query, and agentic LLM applications in Rust
Language: Rust - Size: 5.2 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 552 - Forks: 39

adaline/gateway
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
Language: TypeScript - Size: 1.3 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 535 - Forks: 20

relari-ai/continuous-eval
Data-Driven Evaluation for LLM-Powered Applications
Language: Python - Size: 1.92 MB - Last synced at: 21 days ago - Pushed at: 8 months ago - Stars: 503 - Forks: 36

superlinked/VectorHub
VectorHub is a free, open-source learning website for people (software developers to senior ML architects) interested in adding vector retrieval to their ML stack.
Language: Jupyter Notebook - Size: 156 MB - Last synced at: 18 days ago - Pushed at: 18 days ago - Stars: 492 - Forks: 123

Writesonic/GPTRouter
Smoothly Manage Multiple LLMs (OpenAI, Anthropic, Azure) and Image Models (Dall-E, SDXL), Speed Up Responses, and Ensure Non-Stop Reliability.
Language: TypeScript - Size: 1.28 MB - Last synced at: 7 days ago - Pushed at: over 1 year ago - Stars: 454 - Forks: 41

operand/agency
A fast and minimal framework for building agentic systems
Language: Python - Size: 2.9 MB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 440 - Forks: 24

Kenza-AI/sagify
LLMs and Machine Learning done easily
Language: Python - Size: 36.1 MB - Last synced at: 14 days ago - Pushed at: about 1 month ago - Stars: 440 - Forks: 69

phospho-app/text-analytics-legacy
Text analytics for LLM apps. Cluster messages to detect use cases, outliers, power users. Detect intents and run evals with LLM (OpenAI, MistralAI, Ollama, etc.)
Language: Python - Size: 37.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 436 - Forks: 34

paulpierre/markdown-crawler
A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG
Language: Python - Size: 1.14 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 400 - Forks: 49

deadbits/vigil-llm
⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs
Language: Python - Size: 548 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 385 - Forks: 39

GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS
End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects
Size: 196 KB - Last synced at: 4 days ago - Pushed at: 8 months ago - Stars: 382 - Forks: 112

AI-Hypercomputer/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Language: Python - Size: 6.32 MB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 374 - Forks: 51

neurocult/agency
🕵️♂️ Library designed for developers eager to explore the potential of Large Language Models (LLMs) and other generative AI through a clean, effective, and Go-idiomatic approach.
Language: Go - Size: 894 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 373 - Forks: 19

traceloop/openllmetry-js
Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry
Language: TypeScript - Size: 14.4 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 356 - Forks: 42

zmedelis/bosquet
Tooling to build LLM applications: prompt templating and composition, agents, LLM memory, and other instruments for builders of AI applications.
Language: Clojure - Size: 2.56 MB - Last synced at: 2 days ago - Pushed at: 12 days ago - Stars: 347 - Forks: 27

TensorOpsAI/LLMstudio
Framework to bring LLM applications to production
Language: Python - Size: 38.1 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 345 - Forks: 38

microsoft/genaiops-promptflow-template
GenAIOps with Prompt Flow is a "GenAIOps template and guidance" to help you build LLM-infused apps using Prompt Flow. It offers a range of features including Centralized Code Hosting, Lifecycle Management, Variant and Hyperparameter Experimentation, A/B Deployment, reporting for all runs and experiments and so on.
Language: Python - Size: 6.78 MB - Last synced at: 4 days ago - Pushed at: 5 months ago - Stars: 341 - Forks: 272

TonicAI/tonic_validate
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.
Language: Python - Size: 5.73 MB - Last synced at: 19 days ago - Pushed at: 2 months ago - Stars: 317 - Forks: 31
