YESBUT-v2

We introduce the YesBut-v2, a benchmark for assessing AI's ability to interpret juxtaposed comic panels with contradictory narratives. Unlike existing benchmarks, it emphasizes visual understanding, comparative reasoning, and social knowledge.

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vulab-AI%2FYESBUT-v2
PURL: pkg:github/vulab-AI/YESBUT-v2

Stars: 1
Forks: 0
Open issues: 0

License: mit
Language: JavaScript
Size: 22.3 MB
Dependencies parsed at: Pending

Created at: 4 months ago
Updated at: 4 months ago
Pushed at: 4 months ago
Last synced at: 4 months ago

Topics: benchmark, mllm-evaluation, mllm-reasoning, vlm, yesbut, yesbut-v2

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

GitHub / vulab-AI / YESBUT-v2