An open API service providing repository metadata for many open source software ecosystems.

Topic: "judge-model"

IAAR-Shanghai/xFinder

[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation

Language: Python - Size: 1.36 MB - Last synced at: 13 days ago - Pushed at: 3 months ago - Stars: 169 - Forks: 7

IAAR-Shanghai/xVerify

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Language: Python - Size: 826 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 71 - Forks: 5

Abhisang3/xVerify

xVerify: Efficient Answer Verifier for Large Language Model Evaluations

Language: Python - Size: 806 KB - Last synced at: 3 days ago - Pushed at: 3 days ago - Stars: 0 - Forks: 0