ecosyste.ms

Repos

An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: autoevaluation

Repositories

uptrain-ai/uptrain

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.

Language: Python - Size: 36.9 MB - Last synced at: 6 days ago - Pushed at: 10 months ago - Stars: 2,282 - Forks: 199

amazon-science/BeyondCorrelation

Implementation of the paper: Beyond Correlation: The impact of human uncertainty in measuring the effectiveness of automatic evaluation and LLM-as-a-judge

Language: Python - Size: 691 KB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 7 - Forks: 1

PremierLangage/premierlangage

Server for auto-evaluating exercices

Language: Python - Size: 178 MB - Last synced at: 4 months ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 9

vicgalle/autocrit-likert-gpt

Automatic and zero-shot critique of outputs using the OpenAI API with json outputs

Language: Python - Size: 12.7 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 1

Related Keywords

autoevaluation 4 evaluation 2 experimentation 1 hallucination-detection 1 jailbreak-detection 1 llm-eval 1 llm-prompting 1 llm-test 1 llmops 1 machine-learning 1 monitoring 1 openai-evals 1 prompt-engineering 1 root-cause-analysis 1 correlation 1 llm 1 computer-science 1 exercice 1 exercise 1 lms 1 maths 1 quiz 1 students 1 critique 1 gpt-3-5-turbo 1 gpt-4 1 helpfulness 1 likert 1 openai-api 1