GitHub topics: llmops
tensorchord/Awesome-LLMOps
An awesome & curated list of best LLMOps tools for developers
Language: Shell - Size: 306 KB - Last synced at: about 3 hours ago - Pushed at: about 4 hours ago - Stars: 5,021 - Forks: 484

vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language: Python - Size: 57.3 MB - Last synced at: about 6 hours ago - Pushed at: about 6 hours ago - Stars: 50,538 - Forks: 8,267

adaline/gateway
The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.
Language: TypeScript - Size: 1.09 MB - Last synced at: about 8 hours ago - Pushed at: about 8 hours ago - Stars: 500 - Forks: 21

SmythOS/smyth-docs
Everything you need to build, deploy, and collaborate with agents. Ride the llama, avoid the drama.
Language: TypeScript - Size: 112 MB - Last synced at: about 9 hours ago - Pushed at: about 10 hours ago - Stars: 2 - Forks: 0

Helicone/helicone
🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓
Language: TypeScript - Size: 500 MB - Last synced at: about 10 hours ago - Pushed at: about 10 hours ago - Stars: 3,995 - Forks: 396

bentoml/OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
Language: Python - Size: 41.1 MB - Last synced at: about 6 hours ago - Pushed at: about 12 hours ago - Stars: 11,414 - Forks: 731

langfuse/langfuse-js
🪢 Langfuse JS/TS SDKs - Instrument your LLM app and get detailed tracing/observability. Works with any LLM or framework
Language: TypeScript - Size: 4.83 MB - Last synced at: about 12 hours ago - Pushed at: about 12 hours ago - Stars: 62 - Forks: 57

Giskard-AI/giskard
🐢 Open-Source Evaluation & Testing for AI & LLM systems
Language: Python - Size: 176 MB - Last synced at: about 17 hours ago - Pushed at: about 17 hours ago - Stars: 4,647 - Forks: 331

InftyAI/Awesome-LLMOps
🎉 An awesome & curated list of best LLMOps tools.
Language: Python - Size: 7.17 MB - Last synced at: about 4 hours ago - Pushed at: 7 days ago - Stars: 123 - Forks: 24

langfuse/langfuse-java
🪢 Auto-generated Java Client for Langfuse API
Language: Java - Size: 229 KB - Last synced at: about 19 hours ago - Pushed at: about 20 hours ago - Stars: 13 - Forks: 2

vllm-project/vllm-ascend
Community maintained hardware plugin for vLLM on Ascend
Language: Python - Size: 2.43 MB - Last synced at: about 19 hours ago - Pushed at: about 20 hours ago - Stars: 790 - Forks: 213

superduper-io/superduper
Superduper: End-to-end framework for building custom AI applications and agents.
Language: Python - Size: 73.8 MB - Last synced at: about 20 hours ago - Pushed at: about 20 hours ago - Stars: 5,088 - Forks: 500

alibaba/rtp-llm
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Language: C++ - Size: 307 MB - Last synced at: about 8 hours ago - Pushed at: 21 days ago - Stars: 802 - Forks: 68

bentoml/BentoML
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Language: Python - Size: 95.8 MB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 7,809 - Forks: 847

bionic-gpt/bionic-gpt
BionicGPT is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality
Language: Rust - Size: 113 MB - Last synced at: about 21 hours ago - Pushed at: about 21 hours ago - Stars: 2,189 - Forks: 216

shengyanli1982/llmproxy
🧭🧭 An intelligent load balancer with smart scheduling that unifies diverse LLMs.
Language: Rust - Size: 1.78 MB - Last synced at: about 22 hours ago - Pushed at: about 22 hours ago - Stars: 5 - Forks: 0

GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS
End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects
Size: 196 KB - Last synced at: about 7 hours ago - Pushed at: 5 months ago - Stars: 306 - Forks: 91

taishan666/MaxKB4j
MaxKB4j is an open-source LLMOps platform for LLM workflow applications and RAG developed based on the Java language. The project mainly draws on MaxKB, Dify and FastGPT, and combines the advantages of the two into one project. It is redesigned and developed using the high-performance, high-stability and secure reliable JAVA language.
Language: Java - Size: 44.4 MB - Last synced at: about 22 hours ago - Pushed at: about 22 hours ago - Stars: 9 - Forks: 1

apache/burr
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
Language: Python - Size: 38.3 MB - Last synced at: about 24 hours ago - Pushed at: about 24 hours ago - Stars: 1,688 - Forks: 84

Haohao-end/LMForge-End-to-End-LLMOps-Platform-for-Multi-Model-Agents
AI Agent Development Platform - Supports multiple models (OpenAI/DeepSeek/Wenxin/Tongyi), knowledge base management, workflow automation, and enterprise-grade security. Built with Flask + Vue3 + LangChain, featuring one-click Docker deployment.
Language: Python - Size: 32.3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

traceloop/openllmetry
Open-source observability for your LLM application, based on OpenTelemetry
Language: Python - Size: 33.9 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 5,985 - Forks: 750

bilal0399/learn-agentic-ai
Learn Agentic AI using Dapr Agentic Cloud Ascent (DACA) Design Pattern and Agent-Native Cloud Technologies: OpenAI Agents SDK, Memory, MCP, A2A, Knowledge Graphs, Dapr, Rancher Desktop, and Kubernetes.
Size: 1.95 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 0

LianjiaTech/bella-openapi
Bella OpenAPI是一个提供了丰富的AI调用能力的API网关,可类比openrouter,与之不同的是除了提供聊天补全(chat-completion)能力外,还提供了文本向量化(text-embedding)、语音识别(ASR)、语音合成(TTS)、文生图、图生图等多种AI能力,同时集成了计费、限流和资源管理功能。且集成的所有能力都经过了大规模生产环境的验证。
Language: Java - Size: 3.49 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 51 - Forks: 13

belaa6912/AgentNull
AgentNull is a comprehensive catalog of attack vectors targeting autonomous AI agents, complete with proof-of-concepts for each method. Explore the structured threat information and replicate scenarios using the provided resources. 🐙👨💻
Language: Python - Size: 20.5 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 0

AgentEra/Agently
[GenAI Application Development Framework] 🚀 Build GenAI application quick and easy 💬 Easy to interact with GenAI agent in code using structure data and chained-calls syntax 🧩 Use Agently Workflow to manage complex GenAI working logic 🔀 Switch to any model without rewrite application code
Language: Python - Size: 29.1 MB - Last synced at: about 24 hours ago - Pushed at: about 2 months ago - Stars: 1,362 - Forks: 154

kewandigarcia2000/alith
Simple, Composable, High-Performance, Safe and Web3 Friendly AI Agents and LazAI Gateway for Everyone
Size: 2.93 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Language: Python - Size: 6.62 MB - Last synced at: about 7 hours ago - Pushed at: about 1 month ago - Stars: 3,028 - Forks: 217

liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Language: HTML - Size: 23 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 18,706 - Forks: 2,224

lihaiya/FreeAiOps
专注在智能运维、自动化运维、Zabbix、Prometheus、Grafana、Nagios、ELK Stack(Elasticsearch、Logstash、Kibana)、Graylog、Ansible、SaltStack、Puppet、Chef、Terraform、Docker、Kubernetes、OpenShift、Jenkins、MySQL、PostgreSQL、MariaDB、Redis、MongoDB、InfluxDB、Ceph、MinIO,RabbitMQ、Kafka、NATS、Apache Pulsar、Nginx、Apache HTTP Server、HAProxy、Traefik、Caddy、OpenStack、OpenLDAP、FreeRDP等多个领域。
Language: Go - Size: 161 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 9 - Forks: 7

tenemos/langwatch
The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨
Language: TypeScript - Size: 17.9 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

evidentlyai/evidently
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
Language: Jupyter Notebook - Size: 289 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 6,304 - Forks: 693

uptrain-ai/uptrain
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
Language: Python - Size: 36.9 MB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 2,282 - Forks: 199

OpenPipe/OpenPipe
Turn expensive prompts into cheap fine-tuned models
Language: TypeScript - Size: 11.6 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 2,611 - Forks: 145

Kakz/prometheus-llm
PrometheusLLM is a unique transformer architecture inspired by dignity and recursion. This project aims to explore new frontiers in AI research and welcomes contributions from the community. 🐙🌟
Language: Python - Size: 257 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

titanium0202/Coffee_Shop_AI_Agents
About This project is an innovative coffee shop application designed to bring an engaging and personalized experience to coffee lovers. The app leverages AI-powered agents for chat-based interactions and integrates modern web and mobile development techniques to provide seamless ordering and delivery services.
Size: 31.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

interestingLSY/swiftLLM
A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).
Language: Python - Size: 234 KB - Last synced at: about 7 hours ago - Pushed at: 14 days ago - Stars: 224 - Forks: 26

katanemo/archgw
The AI-native proxy server for agents. Arch handles the pesky low-level work in building agents like clariyfing vague user input, routing prompts to the right agents and unifying access to any LLM - all without locking you into a framework.
Language: Rust - Size: 20 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,737 - Forks: 153

Cre4T3Tiv3/llmops-dashboard
A modular frontend-first framework for AI engineers to compose and debug prompt pipelines, visualize model telemetry, and plug in custom LLM tools.
Language: Python - Size: 1.18 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 0

Netflix/metaflow
Build, Manage and Deploy AI/ML Systems
Language: Python - Size: 43.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 8,906 - Forks: 844

knagrecha/saturn
Saturn accelerates the training of large-scale deep learning models with a novel joint optimization approach.
Language: Python - Size: 107 KB - Last synced at: about 7 hours ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 5

cubxxw/blog
环游世界旅游,创业做 AI 产品,一种比较新的方式和理念生活创业,欢迎订阅 RSS https://nsddd.top/zh/posts/index.xml
Language: HTML - Size: 45.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 20 - Forks: 2

craftgen/craftgen
Integrating AI into every workflow with our open-source, no-code platform, powered by the actor model for dynamic, graph-based solutions.
Language: TypeScript - Size: 97.1 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 298 - Forks: 25

tuanlda78202/leo
v0.1.0-beta
Language: Python - Size: 415 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 12 - Forks: 0

protectai/rebuff 📦
LLM Prompt Injection Detector
Language: TypeScript - Size: 6.99 MB - Last synced at: 4 days ago - Pushed at: 11 months ago - Stars: 1,301 - Forks: 105

protectai/llm-guard
The Security Toolkit for LLM Interactions
Language: Python - Size: 4.01 MB - Last synced at: 4 days ago - Pushed at: 8 days ago - Stars: 1,764 - Forks: 232

pezzolabs/pezzo
🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.
Language: TypeScript - Size: 26.7 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 2,932 - Forks: 247

tensorchord/VectorChord
Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.
Language: Rust - Size: 809 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 859 - Forks: 30

mohsin489/GenAI_Agents
GenAI_Agents offers a rich collection of tutorials and tools for building and implementing Generative AI agents. Explore innovative projects and contribute to the future of AI development! 🛠️🌟
Language: Jupyter Notebook - Size: 48.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

AI-Hypercomputer/JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).
Language: Python - Size: 6.32 MB - Last synced at: 4 days ago - Pushed at: 13 days ago - Stars: 349 - Forks: 44

zenml-io/zenml
ZenML 🙏: The bridge between ML and Ops. https://zenml.io.
Language: Python - Size: 607 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4,642 - Forks: 509

MarcosEdington/Generative-AI-Essentials
Explore a curated collection of resources on Generative AI, including free courses, articles, and videos from top institutions and organizations. Whether you're a beginner or have some experience, you'll find valuable materials to enhance your understanding. 🌟🤖
Size: 15.6 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

Mohamedyaslimcabdalla/generative_ai_project
Size: 8.79 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

GoogleCloudPlatform/agent-starter-pack
A collection of production-ready Generative AI Agent templates built for Google Cloud. It accelerates development by providing a holistic, production-ready solution, addressing common challenges (Deployment & Operations, Evaluation, Customization, Observability) in building and deploying GenAI agents.
Language: Python - Size: 11.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2,030 - Forks: 628

Arize-ai/openinference
OpenTelemetry Instrumentation for AI Observability
Language: Python - Size: 7.83 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 473 - Forks: 99

0xPlaygrounds/rig
⚙️🦀 Build portable, modular & lightweight Fullstack Agents
Language: Rust - Size: 13.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 3,810 - Forks: 410

ComposioHQ/composio
Composio equips your AI agents & LLMs with 100+ high-quality integrations via function calling
Language: Python - Size: 916 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 25,513 - Forks: 4,423

lmnr-ai/lmnr
Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Labels. YC S24.
Language: TypeScript - Size: 32.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2,082 - Forks: 126

tencentmusic/cube-studio
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,标注平台,自动化标注,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式,deepseek训练推理
Language: Jupyter Notebook - Size: 148 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4,351 - Forks: 748

got-sanjay/Skin-Doctor
This is a web-based AI application that predicts skin diseases from uploaded images and provides professional suggestions based on the diagnosis.
Language: Jupyter Notebook - Size: 82.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

langwatch/langwatch
The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨
Language: TypeScript - Size: 28.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2,071 - Forks: 182

SynaLinks/synalinks
🧠🔗 Graph-Based Programmable Neuro-Symbolic LM Framework - a production-first LM framework built with decade old Deep Learning best practices
Language: Python - Size: 12.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 249 - Forks: 17

microsoft/genaiops-promptflow-template
GenAIOps with Prompt Flow is a "GenAIOps template and guidance" to help you build LLM-infused apps using Prompt Flow. It offers a range of features including Centralized Code Hosting, Lifecycle Management, Variant and Hyperparameter Experimentation, A/B Deployment, reporting for all runs and experiments and so on.
Language: Python - Size: 6.78 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 328 - Forks: 264

kolenaIO/kolena
Python client for Kolena's machine learning testing platform
Language: Python - Size: 75.4 MB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 47 - Forks: 5

dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
Language: TypeScript - Size: 46.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8,865 - Forks: 1,454

0xLazAI/alith
Simple, Composable, High-Performance, Safe and Web3 Friendly AI Agents and LazAI Gateway for Everyone
Language: Rust - Size: 21.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 24 - Forks: 13

MaxMLang/pytector
Easy to use LLM Prompt Injection Detection / Detector Python Package
Language: Python - Size: 49.8 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 27 - Forks: 21

OpenXRIF/synapse
Robot VLM and VLA (Vision-Language-Action) inference API helping you manage multimodal prompts, RAG, and location metadata
Language: Rust - Size: 373 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 10 - Forks: 1

Portkey-AI/portkey-node-sdk
Build reliable, secure, and production-ready AI apps easily.
Language: TypeScript - Size: 6.61 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 39 - Forks: 12

trustyai-explainability/vllm_judge
A tiny, lightweight library for LLM-as-a-Judge evaluations on vLLM-hosted models.
Language: Python - Size: 765 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 1

awslabs/aiops-modules
AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large Language Models (LLM) and GenAI development and operations on AWS
Language: Python - Size: 7.14 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 81 - Forks: 27

Portkey-AI/helm-chart
Kubernetes Configs for Portkey Gateway deployment
Language: Smarty - Size: 206 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 3

microsoft/genaiops-azureaisdk-template
Implement GenAIOps using Azure AI Foundry with ease and jumpstart
Language: Python - Size: 458 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 23 - Forks: 30

langfuse/langfuse
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Language: TypeScript - Size: 21.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 12,742 - Forks: 1,163

comet-ml/opik
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
Language: Python - Size: 245 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 9,851 - Forks: 675

Portkey-AI/portkey-python-sdk
Build reliable, secure, and production-ready AI apps easily.
Language: Python - Size: 7.28 MB - Last synced at: about 20 hours ago - Pushed at: about 20 hours ago - Stars: 73 - Forks: 20

kdeps/kdeps
Build AI Agents that runs free forever. Kdeps is an all-in-one AI framework for building purpose-built Dockerized full-stack AI applications (FE and BE) that includes open-source LLM models out-of-the-box with no subscriptions.
Language: Go - Size: 5.79 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 23 - Forks: 1

HuuVuong0912/rag-llm-based-recommender
Explore a smarter way to shop online with this full-stack project built on the infrastructure of Google Cloud Platform (GCP) for RAG based e-commerce with LLM.
Language: TypeScript - Size: 4.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 0

Arize-ai/phoenix
AI Observability & Evaluation
Language: Jupyter Notebook - Size: 337 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 5,985 - Forks: 460

krzko/google-cloud-mcp
🤖 A Model Context Protocol (MCP) server for Google Cloud
Language: TypeScript - Size: 311 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 38 - Forks: 5

truera/trulens
Evaluation and Tracking for LLM Experiments
Language: Python - Size: 344 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,571 - Forks: 217

BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Language: Python - Size: 428 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 24,191 - Forks: 3,220

Portkey-AI/gateway
A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.
Language: TypeScript - Size: 61.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 8,070 - Forks: 605

apache/hamilton
Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
Language: Jupyter Notebook - Size: 98.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,160 - Forks: 149

JonathanChavezTamales/llm-leaderboard
A comprehensive set of LLM benchmark scores and provider prices.
Language: JavaScript - Size: 332 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 227 - Forks: 21

genieincodebottle/generative-ai
Comprehensive resources on Generative AI, including a detailed roadmap, projects, use cases, interview preparation, and coding preparation.
Language: Jupyter Notebook - Size: 52.7 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 1,021 - Forks: 267

promptfoo/promptfoo
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
Language: TypeScript - Size: 223 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7,226 - Forks: 575

e2b-dev/awesome-ai-sdks
A database of SDKs, frameworks, libraries, and tools for creating, monitoring, debugging and deploying autonomous AI agents
Size: 7.08 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 954 - Forks: 79

Josh-XT/AGiXT
AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.
Language: Python - Size: 168 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3,029 - Forks: 408

niyogi/render-mcp
An unofficial MCP server for Render to help developers ship code faster via Cline, Cursor, and Windsurf
Language: TypeScript - Size: 61.5 KB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 11 - Forks: 5

sydverma123/awesome-ai-repositories
A curated list of open source repositories for AI Engineers
Size: 178 KB - Last synced at: about 8 hours ago - Pushed at: 3 months ago - Stars: 114 - Forks: 21

jostrm/azure-enterprise-scale-ml
Enterprise Scale AIFactory (esml) - on Azure
Language: Jupyter Notebook - Size: 120 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 44 - Forks: 16

explodinggradients/ragas
Supercharge Your LLM Application Evaluations 🚀
Language: Python - Size: 41 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 9,530 - Forks: 945

tensorzero/tensorzero
TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.
Language: Rust - Size: 98.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7,344 - Forks: 438

tensorchord/envd
🏕️ Reproducible development environment
Language: Go - Size: 3.41 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 2,119 - Forks: 159

distantmagic/paddler
Stateful load balancer custom-tailored for llama.cpp 🏓🦙
Language: Rust - Size: 26.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 779 - Forks: 36

openlit/openlit
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers, VectorDBs, Agent Frameworks and GPUs.
Language: Python - Size: 44.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,634 - Forks: 160

dot-agent/nextpy
🤖Self-Modifying Framework from the Future 🔮 World's First AMS
Language: Python - Size: 56.6 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 2,292 - Forks: 168

NeumTry/NeumAI
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
Language: Python - Size: 3.83 MB - Last synced at: about 8 hours ago - Pushed at: over 1 year ago - Stars: 857 - Forks: 47

maximhq/maxim-go
SDK to integrate Maxim in your Go app.
Language: Go - Size: 58.6 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 1

traceloop/hub
High-scale LLM gateway, written in Rust. OpenTelemetry-based observability included
Language: Rust - Size: 634 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 102 - Forks: 16
