GitHub topics: llm-evals

Because we should all have our own set of LLM evals.

Language: Python - Size: 13.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 1

Create an evaluation framework for your LLM based app. Incorporate it into your test suite. Lay the monitoring foundation.

Language: Jupyter Notebook - Size: 11.6 MB - Last synced at: about 1 month ago - Pushed at: about 2 months ago - Stars: 7 - Forks: 5

An implementation of the Anthropic's paper and essay on "A statistical approach to model evaluations"

Language: Python - Size: 2.32 MB - Last synced at: 21 days ago - Pushed at: 3 months ago - Stars: 16 - Forks: 1

Related Keywords

ecosyste.ms