An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: llmops

tensorchord/Awesome-LLMOps

An awesome & curated list of best LLMOps tools for developers

Language: Shell - Size: 306 KB - Last synced at: about 3 hours ago - Pushed at: about 4 hours ago - Stars: 5,021 - Forks: 484

vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language: Python - Size: 57.3 MB - Last synced at: about 6 hours ago - Pushed at: about 6 hours ago - Stars: 50,538 - Forks: 8,267

adaline/gateway

The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.

Language: TypeScript - Size: 1.09 MB - Last synced at: about 8 hours ago - Pushed at: about 8 hours ago - Stars: 500 - Forks: 21

SmythOS/smyth-docs

Everything you need to build, deploy, and collaborate with agents. Ride the llama, avoid the drama.

Language: TypeScript - Size: 112 MB - Last synced at: about 9 hours ago - Pushed at: about 10 hours ago - Stars: 2 - Forks: 0

Helicone/helicone

🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓

Language: TypeScript - Size: 500 MB - Last synced at: about 10 hours ago - Pushed at: about 10 hours ago - Stars: 3,995 - Forks: 396

bentoml/OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Language: Python - Size: 41.1 MB - Last synced at: about 6 hours ago - Pushed at: about 12 hours ago - Stars: 11,414 - Forks: 731

langfuse/langfuse-js

🪢 Langfuse JS/TS SDKs - Instrument your LLM app and get detailed tracing/observability. Works with any LLM or framework

Language: TypeScript - Size: 4.83 MB - Last synced at: about 12 hours ago - Pushed at: about 12 hours ago - Stars: 62 - Forks: 57

Giskard-AI/giskard

🐢 Open-Source Evaluation & Testing for AI & LLM systems

Language: Python - Size: 176 MB - Last synced at: about 17 hours ago - Pushed at: about 17 hours ago - Stars: 4,647 - Forks: 331

InftyAI/Awesome-LLMOps

🎉 An awesome & curated list of best LLMOps tools.

Language: Python - Size: 7.17 MB - Last synced at: about 4 hours ago - Pushed at: 7 days ago - Stars: 123 - Forks: 24

langfuse/langfuse-java

🪢 Auto-generated Java Client for Langfuse API

Language: Java - Size: 229 KB - Last synced at: about 19 hours ago - Pushed at: about 20 hours ago - Stars: 13 - Forks: 2

vllm-project/vllm-ascend

Community maintained hardware plugin for vLLM on Ascend

Language: Python - Size: 2.43 MB - Last synced at: about 19 hours ago - Pushed at: about 20 hours ago - Stars: 790 - Forks: 213

superduper-io/superduper

Superduper: End-to-end framework for building custom AI applications and agents.

Language: Python - Size: 73.8 MB - Last synced at: about 20 hours ago - Pushed at: about 20 hours ago - Stars: 5,088 - Forks: 500

alibaba/rtp-llm

RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.

Language: C++ - Size: 307 MB - Last synced at: about 8 hours ago - Pushed at: 21 days ago - Stars: 802 - Forks: 68

bentoml/BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

Language: Python - Size: 95.8 MB - Last synced at: about 20 hours ago - Pushed at: about 21 hours ago - Stars: 7,809 - Forks: 847

bionic-gpt/bionic-gpt

BionicGPT is an on-premise replacement for ChatGPT, offering the advantages of Generative AI while maintaining strict data confidentiality

Language: Rust - Size: 113 MB - Last synced at: about 21 hours ago - Pushed at: about 21 hours ago - Stars: 2,189 - Forks: 216

shengyanli1982/llmproxy

🧭🧭 An intelligent load balancer with smart scheduling that unifies diverse LLMs.

Language: Rust - Size: 1.78 MB - Last synced at: about 22 hours ago - Pushed at: about 22 hours ago - Stars: 5 - Forks: 0

GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS

End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects

Size: 196 KB - Last synced at: about 7 hours ago - Pushed at: 5 months ago - Stars: 306 - Forks: 91

taishan666/MaxKB4j

MaxKB4j is an open-source LLMOps platform for LLM workflow applications and RAG developed based on the Java language. The project mainly draws on MaxKB, Dify and FastGPT, and combines the advantages of the two into one project. It is redesigned and developed using the high-performance, high-stability and secure reliable JAVA language.

Language: Java - Size: 44.4 MB - Last synced at: about 22 hours ago - Pushed at: about 22 hours ago - Stars: 9 - Forks: 1

apache/burr

Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.

Language: Python - Size: 38.3 MB - Last synced at: about 24 hours ago - Pushed at: about 24 hours ago - Stars: 1,688 - Forks: 84

Haohao-end/LMForge-End-to-End-LLMOps-Platform-for-Multi-Model-Agents

AI Agent Development Platform - Supports multiple models (OpenAI/DeepSeek/Wenxin/Tongyi), knowledge base management, workflow automation, and enterprise-grade security. Built with Flask + Vue3 + LangChain, featuring one-click Docker deployment.

Language: Python - Size: 32.3 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 1 - Forks: 0

traceloop/openllmetry

Open-source observability for your LLM application, based on OpenTelemetry

Language: Python - Size: 33.9 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 5,985 - Forks: 750

bilal0399/learn-agentic-ai

Learn Agentic AI using Dapr Agentic Cloud Ascent (DACA) Design Pattern and Agent-Native Cloud Technologies: OpenAI Agents SDK, Memory, MCP, A2A, Knowledge Graphs, Dapr, Rancher Desktop, and Kubernetes.

Size: 1.95 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 0

LianjiaTech/bella-openapi

Bella OpenAPI是一个提供了丰富的AI调用能力的API网关,可类比openrouter,与之不同的是除了提供聊天补全(chat-completion)能力外,还提供了文本向量化(text-embedding)、语音识别(ASR)、语音合成(TTS)、文生图、图生图等多种AI能力,同时集成了计费、限流和资源管理功能。且集成的所有能力都经过了大规模生产环境的验证。

Language: Java - Size: 3.49 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 51 - Forks: 13

belaa6912/AgentNull

AgentNull is a comprehensive catalog of attack vectors targeting autonomous AI agents, complete with proof-of-concepts for each method. Explore the structured threat information and replicate scenarios using the provided resources. 🐙👨💻

Language: Python - Size: 20.5 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 4 - Forks: 0

AgentEra/Agently

[GenAI Application Development Framework] 🚀 Build GenAI application quick and easy 💬 Easy to interact with GenAI agent in code using structure data and chained-calls syntax 🧩 Use Agently Workflow to manage complex GenAI working logic 🔀 Switch to any model without rewrite application code

Language: Python - Size: 29.1 MB - Last synced at: about 24 hours ago - Pushed at: about 2 months ago - Stars: 1,362 - Forks: 154

kewandigarcia2000/alith

Simple, Composable, High-Performance, Safe and Web3 Friendly AI Agents and LazAI Gateway for Everyone

Size: 2.93 KB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 0 - Forks: 0

predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Language: Python - Size: 6.62 MB - Last synced at: about 7 hours ago - Pushed at: about 1 month ago - Stars: 3,028 - Forks: 217

liguodongiot/llm-action

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

Language: HTML - Size: 23 MB - Last synced at: 1 day ago - Pushed at: 1 day ago - Stars: 18,706 - Forks: 2,224

lihaiya/FreeAiOps

专注在智能运维、自动化运维、Zabbix、Prometheus、Grafana、Nagios、ELK Stack(Elasticsearch、Logstash、Kibana)、Graylog、Ansible、SaltStack、Puppet、Chef、Terraform、Docker、Kubernetes、OpenShift、Jenkins、MySQL、PostgreSQL、MariaDB、Redis、MongoDB、InfluxDB、Ceph、MinIO,RabbitMQ、Kafka、NATS、Apache Pulsar、Nginx、Apache HTTP Server、HAProxy、Traefik、Caddy、OpenStack、OpenLDAP、FreeRDP等多个领域。

Language: Go - Size: 161 KB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 9 - Forks: 7

tenemos/langwatch

The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨

Language: TypeScript - Size: 17.9 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 1 - Forks: 0

evidentlyai/evidently

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

Language: Jupyter Notebook - Size: 289 MB - Last synced at: 2 days ago - Pushed at: 3 days ago - Stars: 6,304 - Forks: 693

uptrain-ai/uptrain

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

Language: Python - Size: 36.9 MB - Last synced at: 3 days ago - Pushed at: 10 months ago - Stars: 2,282 - Forks: 199

OpenPipe/OpenPipe

Turn expensive prompts into cheap fine-tuned models

Language: TypeScript - Size: 11.6 MB - Last synced at: 2 days ago - Pushed at: about 1 year ago - Stars: 2,611 - Forks: 145

Kakz/prometheus-llm

PrometheusLLM is a unique transformer architecture inspired by dignity and recursion. This project aims to explore new frontiers in AI research and welcomes contributions from the community. 🐙🌟

Language: Python - Size: 257 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

titanium0202/Coffee_Shop_AI_Agents

About This project is an innovative coffee shop application designed to bring an engaging and personalized experience to coffee lovers. The app leverages AI-powered agents for chat-based interactions and integrates modern web and mobile development techniques to provide seamless ordering and delivery services.

Size: 31.9 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0

interestingLSY/swiftLLM

A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).

Language: Python - Size: 234 KB - Last synced at: about 7 hours ago - Pushed at: 14 days ago - Stars: 224 - Forks: 26

katanemo/archgw

The AI-native proxy server for agents. Arch handles the pesky low-level work in building agents like clariyfing vague user input, routing prompts to the right agents and unifying access to any LLM - all without locking you into a framework.

Language: Rust - Size: 20 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2,737 - Forks: 153

Cre4T3Tiv3/llmops-dashboard

A modular frontend-first framework for AI engineers to compose and debug prompt pipelines, visualize model telemetry, and plug in custom LLM tools.

Language: Python - Size: 1.18 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 2 - Forks: 0

Netflix/metaflow

Build, Manage and Deploy AI/ML Systems

Language: Python - Size: 43.1 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 8,906 - Forks: 844

knagrecha/saturn

Saturn accelerates the training of large-scale deep learning models with a novel joint optimization approach.

Language: Python - Size: 107 KB - Last synced at: about 7 hours ago - Pushed at: over 1 year ago - Stars: 23 - Forks: 5

cubxxw/blog

环游世界旅游,创业做 AI 产品,一种比较新的方式和理念生活创业,欢迎订阅 RSS https://nsddd.top/zh/posts/index.xml

Language: HTML - Size: 45.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 20 - Forks: 2

craftgen/craftgen

Integrating AI into every workflow with our open-source, no-code platform, powered by the actor model for dynamic, graph-based solutions.

Language: TypeScript - Size: 97.1 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 298 - Forks: 25

tuanlda78202/leo

v0.1.0-beta

Language: Python - Size: 415 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 12 - Forks: 0

protectai/rebuff 📦

LLM Prompt Injection Detector

Language: TypeScript - Size: 6.99 MB - Last synced at: 4 days ago - Pushed at: 11 months ago - Stars: 1,301 - Forks: 105

protectai/llm-guard

The Security Toolkit for LLM Interactions

Language: Python - Size: 4.01 MB - Last synced at: 4 days ago - Pushed at: 8 days ago - Stars: 1,764 - Forks: 232

pezzolabs/pezzo

🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboration, troubleshooting, observability and more.

Language: TypeScript - Size: 26.7 MB - Last synced at: 3 days ago - Pushed at: 4 months ago - Stars: 2,932 - Forks: 247

tensorchord/VectorChord

Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.

Language: Rust - Size: 809 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 859 - Forks: 30

mohsin489/GenAI_Agents

GenAI_Agents offers a rich collection of tutorials and tools for building and implementing Generative AI agents. Explore innovative projects and contribute to the future of AI development! 🛠️🌟

Language: Jupyter Notebook - Size: 48.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

AI-Hypercomputer/JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Language: Python - Size: 6.32 MB - Last synced at: 4 days ago - Pushed at: 13 days ago - Stars: 349 - Forks: 44

zenml-io/zenml

ZenML 🙏: The bridge between ML and Ops. https://zenml.io.

Language: Python - Size: 607 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 4,642 - Forks: 509

MarcosEdington/Generative-AI-Essentials

Explore a curated collection of resources on Generative AI, including free courses, articles, and videos from top institutions and organizations. Whether you're a beginner or have some experience, you'll find valuable materials to enhance your understanding. 🌟🤖

Size: 15.6 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

Mohamedyaslimcabdalla/generative_ai_project

Size: 8.79 KB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

GoogleCloudPlatform/agent-starter-pack

A collection of production-ready Generative AI Agent templates built for Google Cloud. It accelerates development by providing a holistic, production-ready solution, addressing common challenges (Deployment & Operations, Evaluation, Customization, Observability) in building and deploying GenAI agents.

Language: Python - Size: 11.5 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2,030 - Forks: 628

Arize-ai/openinference

OpenTelemetry Instrumentation for AI Observability

Language: Python - Size: 7.83 MB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 473 - Forks: 99

0xPlaygrounds/rig

⚙️🦀 Build portable, modular & lightweight Fullstack Agents

Language: Rust - Size: 13.9 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 3,810 - Forks: 410

ComposioHQ/composio

Composio equips your AI agents & LLMs with 100+ high-quality integrations via function calling

Language: Python - Size: 916 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 25,513 - Forks: 4,423

lmnr-ai/lmnr

Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Labels. YC S24.

Language: TypeScript - Size: 32.8 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 2,082 - Forks: 126

tencentmusic/cube-studio

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,标注平台,自动化标注,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式,deepseek训练推理

Language: Jupyter Notebook - Size: 148 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 4,351 - Forks: 748

got-sanjay/Skin-Doctor

This is a web-based AI application that predicts skin diseases from uploaded images and provides professional suggestions based on the diagnosis.

Language: Jupyter Notebook - Size: 82.5 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 0 - Forks: 0

langwatch/langwatch

The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨

Language: TypeScript - Size: 28.4 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 2,071 - Forks: 182

SynaLinks/synalinks

🧠🔗 Graph-Based Programmable Neuro-Symbolic LM Framework - a production-first LM framework built with decade old Deep Learning best practices

Language: Python - Size: 12.3 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 249 - Forks: 17

microsoft/genaiops-promptflow-template

GenAIOps with Prompt Flow is a "GenAIOps template and guidance" to help you build LLM-infused apps using Prompt Flow. It offers a range of features including Centralized Code Hosting, Lifecycle Management, Variant and Hyperparameter Experimentation, A/B Deployment, reporting for all runs and experiments and so on.

Language: Python - Size: 6.78 MB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 328 - Forks: 264

kolenaIO/kolena

Python client for Kolena's machine learning testing platform

Language: Python - Size: 75.4 MB - Last synced at: 2 days ago - Pushed at: 5 days ago - Stars: 47 - Forks: 5

dataelement/bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

Language: TypeScript - Size: 46.8 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 8,865 - Forks: 1,454

0xLazAI/alith

Simple, Composable, High-Performance, Safe and Web3 Friendly AI Agents and LazAI Gateway for Everyone

Language: Rust - Size: 21.2 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 24 - Forks: 13

MaxMLang/pytector

Easy to use LLM Prompt Injection Detection / Detector Python Package

Language: Python - Size: 49.8 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 27 - Forks: 21

OpenXRIF/synapse

Robot VLM and VLA (Vision-Language-Action) inference API helping you manage multimodal prompts, RAG, and location metadata

Language: Rust - Size: 373 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 10 - Forks: 1

Portkey-AI/portkey-node-sdk

Build reliable, secure, and production-ready AI apps easily.

Language: TypeScript - Size: 6.61 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 39 - Forks: 12

trustyai-explainability/vllm_judge

A tiny, lightweight library for LLM-as-a-Judge evaluations on vLLM-hosted models.

Language: Python - Size: 765 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 1 - Forks: 1

awslabs/aiops-modules

AIOps modules is a collection of reusable Infrastructure as Code (IaC) modules for Machine Learning (ML), Foundation Models (FM), Large Language Models (LLM) and GenAI development and operations on AWS

Language: Python - Size: 7.14 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 81 - Forks: 27

Portkey-AI/helm-chart

Kubernetes Configs for Portkey Gateway deployment

Language: Smarty - Size: 206 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 3 - Forks: 3

microsoft/genaiops-azureaisdk-template

Implement GenAIOps using Azure AI Foundry with ease and jumpstart

Language: Python - Size: 458 KB - Last synced at: 4 days ago - Pushed at: 2 months ago - Stars: 23 - Forks: 30

langfuse/langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

Language: TypeScript - Size: 21.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 12,742 - Forks: 1,163

comet-ml/opik

Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.

Language: Python - Size: 245 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 9,851 - Forks: 675

Portkey-AI/portkey-python-sdk

Build reliable, secure, and production-ready AI apps easily.

Language: Python - Size: 7.28 MB - Last synced at: about 20 hours ago - Pushed at: about 20 hours ago - Stars: 73 - Forks: 20

kdeps/kdeps

Build AI Agents that runs free forever. Kdeps is an all-in-one AI framework for building purpose-built Dockerized full-stack AI applications (FE and BE) that includes open-source LLM models out-of-the-box with no subscriptions.

Language: Go - Size: 5.79 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 23 - Forks: 1

HuuVuong0912/rag-llm-based-recommender

Explore a smarter way to shop online with this full-stack project built on the infrastructure of Google Cloud Platform (GCP) for RAG based e-commerce with LLM.

Language: TypeScript - Size: 4.2 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2 - Forks: 0

Arize-ai/phoenix

AI Observability & Evaluation

Language: Jupyter Notebook - Size: 337 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 5,985 - Forks: 460

krzko/google-cloud-mcp

🤖 A Model Context Protocol (MCP) server for Google Cloud

Language: TypeScript - Size: 311 KB - Last synced at: 5 days ago - Pushed at: 4 months ago - Stars: 38 - Forks: 5

truera/trulens

Evaluation and Tracking for LLM Experiments

Language: Python - Size: 344 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,571 - Forks: 217

BerriAI/litellm

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Language: Python - Size: 428 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 24,191 - Forks: 3,220

Portkey-AI/gateway

A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

Language: TypeScript - Size: 61.9 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 8,070 - Forks: 605

apache/hamilton

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

Language: Jupyter Notebook - Size: 98.6 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 2,160 - Forks: 149

JonathanChavezTamales/llm-leaderboard

A comprehensive set of LLM benchmark scores and provider prices.

Language: JavaScript - Size: 332 KB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 227 - Forks: 21

genieincodebottle/generative-ai

Comprehensive resources on Generative AI, including a detailed roadmap, projects, use cases, interview preparation, and coding preparation.

Language: Jupyter Notebook - Size: 52.7 MB - Last synced at: 6 days ago - Pushed at: 7 days ago - Stars: 1,021 - Forks: 267

promptfoo/promptfoo

Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

Language: TypeScript - Size: 223 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7,226 - Forks: 575

e2b-dev/awesome-ai-sdks

A database of SDKs, frameworks, libraries, and tools for creating, monitoring, debugging and deploying autonomous AI agents

Size: 7.08 MB - Last synced at: 7 days ago - Pushed at: 4 months ago - Stars: 954 - Forks: 79

Josh-XT/AGiXT

AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers. Combining adaptive memory, smart features, and a versatile plugin system, AGiXT delivers efficient and comprehensive AI solutions.

Language: Python - Size: 168 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 3,029 - Forks: 408

niyogi/render-mcp

An unofficial MCP server for Render to help developers ship code faster via Cline, Cursor, and Windsurf

Language: TypeScript - Size: 61.5 KB - Last synced at: 7 days ago - Pushed at: 3 months ago - Stars: 11 - Forks: 5

sydverma123/awesome-ai-repositories

A curated list of open source repositories for AI Engineers

Size: 178 KB - Last synced at: about 8 hours ago - Pushed at: 3 months ago - Stars: 114 - Forks: 21

jostrm/azure-enterprise-scale-ml

Enterprise Scale AIFactory (esml) - on Azure

Language: Jupyter Notebook - Size: 120 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 44 - Forks: 16

explodinggradients/ragas

Supercharge Your LLM Application Evaluations 🚀

Language: Python - Size: 41 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 9,530 - Forks: 945

tensorzero/tensorzero

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

Language: Rust - Size: 98.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 7,344 - Forks: 438

tensorchord/envd

🏕️ Reproducible development environment

Language: Go - Size: 3.41 MB - Last synced at: 5 days ago - Pushed at: 7 days ago - Stars: 2,119 - Forks: 159

distantmagic/paddler

Stateful load balancer custom-tailored for llama.cpp 🏓🦙

Language: Rust - Size: 26.9 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 779 - Forks: 36

openlit/openlit

Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. 🚀💻 Integrates with 50+ LLM Providers, VectorDBs, Agent Frameworks and GPUs.

Language: Python - Size: 44.4 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 1,634 - Forks: 160

dot-agent/nextpy

🤖Self-Modifying Framework from the Future 🔮 World's First AMS

Language: Python - Size: 56.6 MB - Last synced at: 7 days ago - Pushed at: about 1 year ago - Stars: 2,292 - Forks: 168

NeumTry/NeumAI

Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.

Language: Python - Size: 3.83 MB - Last synced at: about 8 hours ago - Pushed at: over 1 year ago - Stars: 857 - Forks: 47

maximhq/maxim-go

SDK to integrate Maxim in your Go app.

Language: Go - Size: 58.6 KB - Last synced at: 8 days ago - Pushed at: 8 days ago - Stars: 2 - Forks: 1

traceloop/hub

High-scale LLM gateway, written in Rust. OpenTelemetry-based observability included

Language: Rust - Size: 634 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 102 - Forks: 16