Topic: "vision-language-dataset"
Q-Future/Q-Bench
①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.
Language: Jupyter Notebook - Size: 29.2 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 264 - Forks: 13

oakink/OakInk2
🌴[CVPR 2024] OakInk2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion
Language: Python - Size: 11.6 MB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 38 - Forks: 0

unitaryai/VTC-dataset
Language: Python - Size: 38.1 KB - Last synced at: 8 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
