GitHub topics: mllm-reasoning
falonss703/Awesome-Uncertainty-based-Reinforcement-Learning
π₯π₯π₯Latest Papers, Codes on Uncertainty-based RL
Size: 8.79 KB - Last synced at: about 22 hours ago - Pushed at: about 22 hours ago - Stars: 9 - Forks: 0

XxabueloxX/Vision-Matters
Vision Matters explores how simple visual changes can enhance multimodal math reasoning. Join the discussion and contribute to the project! π©π»π¨π»
Language: Python - Size: 15.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

ritzz-ai/GUI-R1
Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents
Language: Python - Size: 974 KB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 112 - Forks: 11

yaotingwangofficial/Awesome-MCoT
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Size: 4.63 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 576 - Forks: 15

vulab-AI/YESBUT-v2
We introduce the YesBut-v2, a benchmark for assessing AI's ability to interpret juxtaposed comic panels with contradictory narratives. Unlike existing benchmarks, it emphasizes visual understanding, comparative reasoning, and social knowledge.
Language: JavaScript - Size: 22.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

Wild-Cooperation-Hub/Awesome-MLLM-Reasoning-Benchmarks
A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.
Size: 89.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 26 - Forks: 2
