An open API service providing repository metadata for many open source software ecosystems.

Topic: "vision-language-dataset"

Q-Future/Q-Bench

①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

Language: Jupyter Notebook - Size: 29.2 MB - Last synced at: 14 days ago - Pushed at: 9 months ago - Stars: 264 - Forks: 13

oakink/OakInk2

🌴[CVPR 2024] OakInk2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion

Language: Python - Size: 11.6 MB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 38 - Forks: 0

unitaryai/VTC-dataset

Language: Python - Size: 38.1 KB - Last synced at: 8 months ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0