GitHub topics: human-evaluation-scores
avnCode/Topics_in_AI
We propose a novel evaluation technique for LLMs which surpasses BeRT based evaluation scores in terms of correlation with human evaluation scores
Language: Jupyter Notebook - Size: 169 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
