An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: llm-evals

kevinschaul/llm-evals

Because we should all have our own set of LLM evals.

Language: Python - Size: 11.1 MB - Last synced at: 1 day ago - Pushed at: 2 days ago - Stars: 1 - Forks: 1

The-Swarm-Corporation/StatisticalModelEvaluator

An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"

Language: Python - Size: 2.32 MB - Last synced at: about 18 hours ago - Pushed at: 26 days ago - Stars: 16 - Forks: 1

pyladiesams/eval-llm-based-apps-jan2025

Create an evaluation framework for your LLM based app. Incorporate it into your test suite. Lay the monitoring foundation.

Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: 23 days ago - Pushed at: 4 months ago - Stars: 7 - Forks: 5