An open API service providing repository metadata for many open source software ecosystems.

GitHub topics: llm4or

nl4opt/ORQA

[AAAI 2025] ORQA is a new QA benchmark designed to assess the reasoning capabilities of LLMs in a specialized technical domain of Operations Research. The benchmark evaluates whether LLMs can emulate the knowledge and reasoning skills of OR experts when presented with complex optimization modeling tasks.

Size: 2.48 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 37 - Forks: 0