mllm-reasoning | Topic | Ecosyste.ms: Repos

Topic: "mllm-reasoning"

yaotingwangofficial/Awesome-MCoT

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Size: 4.63 MB - Last synced at: 5 days ago - Pushed at: 5 days ago - Stars: 576 - Forks: 15

ritzz-ai/GUI-R1

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

Language: Python - Size: 974 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 70 - Forks: 5

Wild-Cooperation-Hub/Awesome-MLLM-Reasoning-Benchmarks

A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.

Size: 89.8 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 26 - Forks: 2

We introduce the YesBut-v2, a benchmark for assessing AI's ability to interpret juxtaposed comic panels with contradictory narratives. Unlike existing benchmarks, it emphasizes visual understanding, comparative reasoning, and social knowledge.

Language: JavaScript - Size: 22.3 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

Topic: "mllm-reasoning"

yaotingwangofficial/Awesome-MCoT

ritzz-ai/GUI-R1

Wild-Cooperation-Hub/Awesome-MLLM-Reasoning-Benchmarks

vulab-AI/YESBUT-v2