GitHub topics: mllm-reasoning

Repositories

falonss703/Awesome-Uncertainty-based-Reinforcement-Learning

🔥🔥🔥Latest Papers, Codes on Uncertainty-based RL

Size: 8.79 KB - Last synced at: about 22 hours ago - Pushed at: about 22 hours ago - Stars: 9 - Forks: 0

XxabueloxX/Vision-Matters

Vision Matters explores how simple visual changes can enhance multimodal math reasoning. Join the discussion and contribute to the project! 👩💻👨💻

Language: Python - Size: 15.9 MB - Last synced at: 4 days ago - Pushed at: 4 days ago - Stars: 0 - Forks: 0

ritzz-ai/GUI-R1

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

Language: Python - Size: 974 KB - Last synced at: 4 days ago - Pushed at: about 1 month ago - Stars: 112 - Forks: 11

yaotingwangofficial/Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Size: 4.63 MB - Last synced at: 30 days ago - Pushed at: 30 days ago - Stars: 576 - Forks: 15

We introduce the YesBut-v2, a benchmark for assessing AI's ability to interpret juxtaposed comic panels with contradictory narratives. Unlike existing benchmarks, it emphasizes visual understanding, comparative reasoning, and social knowledge.

Language: JavaScript - Size: 22.3 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

Wild-Cooperation-Hub/Awesome-MLLM-Reasoning-Benchmarks

A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.

Size: 89.8 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 26 - Forks: 2

Related Keywords

mllm-reasoning 6 multimodal 3 multimodal-large-language-models 3 reasoning 2 llm-reasoning 1 o1 1 r1 1 chain-of-thought 1 cot 1 deepseek-r1 1 instruction-tuning 1 large-vision-language-model 1 mcts 1 multimodal-chain-of-thought 1 openai-o1 1 slow-thinking 1 survey 1 system-2 1 benchmark 1 mllm-evaluation 1 vlm 1 yesbut 1 yesbut-v2 1 multimodal-reasoning 1 multimodal-reasoning-benchmarks 1 reasoning-language-models 1 rainforcement-learning 1 uncertainty-analysis 1 unsupervised-learning 1 3d-scene-understanding 1 ai 1 autonomous-agents 1 deep-learning 1 gpt4 1 html-css-javascript 1 imagenet 1 lv-vit 1 mllm 1 multi-modal-learning 1 python 1 pytorch 1 segmentation 1 transformer 1 vision 1 vision-language-learning 1 vision-language-model 1 deep-reinforcement-learning 1 grpo 1 gui-agent 1 large-multimodal-models 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos