GitHub topics: multi-agent-eval
The-Swarm-Corporation/StatisticalModelEvaluator
An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"
Language: Python - Size: 2.32 MB - Last synced at: 2 days ago - Pushed at: 27 days ago - Stars: 16 - Forks: 1

Related Keywords