An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: multi-agent-eval

The-Swarm-Corporation/StatisticalModelEvaluator

An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"

Language: Python - Size: 2.32 MB - Last synced at: 2 days ago - Pushed at: 27 days ago - Stars: 16 - Forks: 1