GitHub topics: vqa-dataset
google-research-datasets/maverics 📦
MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering (VQA).
Size: 2.18 MB - Last synced at: 6 days ago - Pushed at: about 2 years ago - Stars: 13 - Forks: 1

abachaa/VQA-Med-2021
VQA-Med 2021
Language: Python - Size: 46.7 MB - Last synced at: 18 days ago - Pushed at: almost 3 years ago - Stars: 19 - Forks: 3

badripatro/awesome-vqg
Visual Question Generation reading list
Size: 18.6 KB - Last synced at: 10 days ago - Pushed at: over 4 years ago - Stars: 29 - Forks: 4

yanx27/CLEVR3D
CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation
Language: Python - Size: 5.22 MB - Last synced at: 16 days ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 1

vzhou842/easy-VQA
The Easy Visual Question Answering dataset.
Language: Python - Size: 9.5 MB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 33 - Forks: 11

shreshthsaini/CHUG
CHUG: Crowdsourced User-Generated HDR Video Quality Dataset
Language: JavaScript - Size: 0 Bytes - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

chakravarthi589/Video-Question-Answering_Resources
Video Question Answering | Video QA | VQA
Size: 546 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 9 - Forks: 5

sutdcv/SUTD-TrafficQA
[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
Language: JavaScript - Size: 6 MB - Last synced at: about 1 month ago - Pushed at: 8 months ago - Stars: 53 - Forks: 2

fraction-ai/GAP
Gamified Adversarial Prompting (GAP): Crowdsourcing AI-weakness-targeting data through gamification. Boost model performance with community-driven, strategic data collection
Language: Python - Size: 8.92 MB - Last synced at: 5 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

csebuetnlp/IllusionVQA
This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models"
Language: Jupyter Notebook - Size: 87.2 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 8 - Forks: 1

CAMMA-public/SSG-VQA
SSG-VQA is a Visual Question Answering (VQA) dataset on laparoscopic videos providing diverse, geometrically grounded, unbiased and surgical action-oriented queries generated using scene graphs.
Language: Python - Size: 2.39 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 28 - Forks: 1

abdur75648/MedicalGPT
Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)
Language: Python - Size: 24.9 MB - Last synced at: 22 days ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

Cloud-CV/VQA
CloudCV Visual Question Answering Demo
Language: Lua - Size: 4.75 MB - Last synced at: 2 days ago - Pushed at: over 2 years ago - Stars: 66 - Forks: 24

gutbash/lmm-graph-vision
How well do the GPT-4V, Gemini Pro Vision, and Claude 3 Opus models perform zero-shot vision tasks on data structures?
Language: Python - Size: 186 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 2 - Forks: 1

findalexli/SciGraphQA
SciGraphQA
Language: Jupyter Notebook - Size: 16.7 MB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 32 - Forks: 2

chandrakanthm/visual-question-generator
Language: Python - Size: 1.36 MB - Last synced at: about 1 year ago - Pushed at: about 8 years ago - Stars: 1 - Forks: 0

radonys/CFB-VQA
VQA Challenge - hosted on Hasura using Flask
Language: Python - Size: 49.5 MB - Last synced at: about 1 year ago - Pushed at: almost 7 years ago - Stars: 2 - Forks: 0

thatAverageGuy/EarlyFusion-on-EasyVQA
Streamlit app for demonstrating multi-modal(vision+language) modelling in Pytorch.
Language: Python - Size: 2.74 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

dinesh-kumar-mr/MediVQA
Part of our final year project work involving complex NLP tasks along with experimentation on various datasets and different LLMs
Language: HTML - Size: 1.98 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

abachaa/VQA-Med-2019
Visual Question Answering in the Medical Domain VQA-Med 2019
Size: 20 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 65 - Forks: 26

Letian2003/C-VQA
Counterfactual Reasoning VQA Dataset
Language: Python - Size: 271 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 2

ghazaleh-mahmoodi/lxmert_compression
B.Sc. Final Project: LXMERT Model Compression for Visual Question Answering.
Language: Python - Size: 11.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

juletx/egunean-behin-vqa
Egunean Behin Visual Question Answering Dataset
Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 2 - Forks: 0

manoja328/vqatools
API for VQA , visual 7w dataset
Language: Jupyter Notebook - Size: 520 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 3 - Forks: 1

yousefkotp/Visual-Question-Answering
A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder
Language: Jupyter Notebook - Size: 15.9 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

vztu/BVQA_Benchmark
A resource list and performance benchmark for blind video quality assessment (BVQA) models on user-generated content (UGC) datasets. [IEEE TIP'2021] "UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content", Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
Language: Python - Size: 872 KB - Last synced at: about 2 years ago - Pushed at: about 3 years ago - Stars: 96 - Forks: 15

AnshDesai/visual-question-answering
Deep Learning Web app that responds to any question about an image.
Language: Python - Size: 105 KB - Last synced at: over 1 year ago - Pushed at: almost 5 years ago - Stars: 0 - Forks: 0

zeryabmoussaoui/Real-time-VQA
A real-time Visual Question Answering Framework
Language: Jupyter Notebook - Size: 797 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 3 - Forks: 2

badripatro/MDN-VQG
Size: 3.4 MB - Last synced at: almost 2 years ago - Pushed at: over 5 years ago - Stars: 9 - Forks: 3

rentainhe/TRAR-Feature-Extraction Fork of facebookresearch/grid-feats-vqa
Grid features extraction for ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
Language: Python - Size: 73.2 KB - Last synced at: about 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 1

IAmS4n/Visual-Question-Answering
Investigation on VQA dataset. TensorFlow is utilized for the implementation of a solution based on CNN and RNN architectures plus some ideas such as Attention and Positional features.
Language: Python - Size: 2.76 MB - Last synced at: about 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 1

cserajdeep/Visual-Question-Answering-VQA
Visual Question Answering (VQA)
Language: Python - Size: 18.6 KB - Last synced at: 25 days ago - Pushed at: over 3 years ago - Stars: 0 - Forks: 0

jiayi-wei/vqa-tf2
Language: Python - Size: 146 KB - Last synced at: almost 2 years ago - Pushed at: almost 5 years ago - Stars: 1 - Forks: 0

VibhuJawa/vqa-2018
This repo implements attention networks for visual question answering
Language: Python - Size: 112 MB - Last synced at: 16 days ago - Pushed at: over 6 years ago - Stars: 4 - Forks: 1

zeryabmoussaoui/VQA-dataset-Generator
Language: Jupyter Notebook - Size: 457 KB - Last synced at: about 2 years ago - Pushed at: over 5 years ago - Stars: 1 - Forks: 2

nishitmehta1/Deep-Image-Understanding-Visual-Question-Answering
Language: Python - Size: 168 KB - Last synced at: about 2 years ago - Pushed at: over 6 years ago - Stars: 1 - Forks: 0
