Topic: "mllm-reasoning"
yaotingwangofficial/Awesome-MCoT
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Size: 4.58 MB - Last synced at: 13 days ago - Pushed at: 14 days ago - Stars: 465 - Forks: 9

ritzz-ai/GUI-R1
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
Language: Python - Size: 958 KB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 52 - Forks: 1

Wild-Cooperation-Hub/Awesome-MLLM-Reasoning-Benchmarks
A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.
Size: 89.8 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 26 - Forks: 2

vulab-AI/YESBUT-v2
We introduce the YesBut-v2, a benchmark for assessing AI's ability to interpret juxtaposed comic panels with contradictory narratives. Unlike existing benchmarks, it emphasizes visual understanding, comparative reasoning, and social knowledge.
Language: JavaScript - Size: 22.3 MB - Last synced at: 25 days ago - Pushed at: 25 days ago - Stars: 1 - Forks: 0
