An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: ai-evaluation-tools

raga-ai-hub/RagaAI-Catalyst

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view

Language: Python - Size: 55.6 MB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 16,161 - Forks: 3,781

petmal/MindTrial

MindTrial: Evaluate and compare AI language models (LLMs) on text-based tasks with optional file/image attachments. Supports multiple providers (OpenAI, Google, Anthropic, DeepSeek), custom tasks in YAML, and HTML/CSV reports.

Language: Go - Size: 143 KB - Last synced at: 11 days ago - Pushed at: 11 days ago - Stars: 0 - Forks: 0