Topic: "multi-agent-eval"
The-Swarm-Corporation/StatisticalModelEvaluator
An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"
Language: Python - Size: 2.32 MB - Last synced at: 6 days ago - Pushed at: about 1 month ago - Stars: 16 - Forks: 1

Related Topics