GitHub / aigc-apps / PertEval
[NeurIPS '24 Spotlight] PertEval: Unveiling Real Knowledge Capacity of LLMs via Knowledge-invariant Perturbations
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aigc-apps%2FPertEval
Stars: 12
Forks: 2
Open issues: 0
License: apache-2.0
Language: Jupyter Notebook
Size: 12.9 MB
Dependencies parsed at: Pending
Created at: 9 months ago
Updated at: about 2 months ago
Pushed at: 7 months ago
Last synced at: 4 days ago
Topics: evaluation-framework, evaluation-metrics, large-language-models, llm-evaluation, machine-learning, trustworthy-ai