GitHub topics: multi-choice
X-PLUG/CValues
面向中文大模型价值观的评估与对齐研究
Language: Python - Size: 4.2 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 524 - Forks: 20

nl4opt/ORQA
[AAAI 2025] ORQA is a new QA benchmark designed to assess the reasoning capabilities of LLMs in a specialized technical domain of Operations Research. The benchmark evaluates whether LLMs can emulate the knowledge and reasoning skills of OR experts when presented with complex optimization modeling tasks.
Language: Python - Size: 2.49 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 36 - Forks: 0

MaitisamY/flashcard-quiz
Flashcard Quiz is a web application designed to help users practice and test their knowledge on various topics using flashcards. It offers a user-friendly interface and multiple-choice answer selection.
Language: JavaScript - Size: 339 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
