GitHub topics: autoevaluation
uptrain-ai/uptrain
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
Language: Python - Size: 36.9 MB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 2,282 - Forks: 199

amazon-science/BeyondCorrelation
Implementation of the paper: Beyond Correlation: The impact of human uncertainty in measuring the effectiveness of automatic evaluation and LLM-as-a-judge
Language: Python - Size: 691 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 7 - Forks: 1

PremierLangage/premierlangage
Server for auto-evaluating exercices
Language: Python - Size: 178 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 9

vicgalle/autocrit-likert-gpt
Automatic and zero-shot critique of outputs using the OpenAI API with json outputs
Language: Python - Size: 12.7 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 1
