GitHub topics: vqa-dataset

Repositories

yanx27/CLEVR3D

CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation

Language: Python - Size: 5.22 MB - Last synced at: 9 days ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 1

brightvqa/BrightVQ

Repository for New HDR-UGC dataset BrightVQ and a new SOTA model BrightRate

Language: Python - Size: 425 MB - Last synced at: 16 days ago - Pushed at: 17 days ago - Stars: 0 - Forks: 1

abdur75648/MedicalGPT

Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)

Language: Python - Size: 24.9 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 6 - Forks: 4

hacheyz/FlowchartQA

Create flowchart QA datasets using Python and Mermaid, free of AIGC.

Language: Python - Size: 457 KB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 1 - Forks: 0

chakravarthi589/Video-Question-Answering_Resources

Video Question Answering | Video QA | VQA

Size: 580 KB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 14 - Forks: 6

csebuetnlp/IllusionVQA

This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models"

Language: Jupyter Notebook - Size: 87.7 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 13 - Forks: 2

google-research-datasets/maverics 📦

MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering (VQA).

Size: 2.18 MB - Last synced at: 3 months ago - Pushed at: over 2 years ago - Stars: 13 - Forks: 1

abachaa/VQA-Med-2021

VQA-Med 2021

Language: Python - Size: 46.7 MB - Last synced at: 4 months ago - Pushed at: about 3 years ago - Stars: 19 - Forks: 3

badripatro/awesome-vqg

Visual Question Generation reading list

Size: 18.6 KB - Last synced at: 13 days ago - Pushed at: almost 5 years ago - Stars: 29 - Forks: 4

vzhou842/easy-VQA

The Easy Visual Question Answering dataset.

Language: Python - Size: 9.5 MB - Last synced at: 10 days ago - Pushed at: almost 2 years ago - Stars: 33 - Forks: 11

shreshthsaini/CHUG

CHUG: Crowdsourced User-Generated HDR Video Quality Dataset

Language: JavaScript - Size: 0 Bytes - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 0 - Forks: 0

sutdcv/SUTD-TrafficQA

[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events

Language: JavaScript - Size: 6 MB - Last synced at: 4 months ago - Pushed at: 11 months ago - Stars: 53 - Forks: 2

fraction-ai/GAP

Gamified Adversarial Prompting (GAP): Crowdsourcing AI-weakness-targeting data through gamification. Boost model performance with community-driven, strategic data collection

Language: Python - Size: 8.92 MB - Last synced at: 9 months ago - Pushed at: 10 months ago - Stars: 4 - Forks: 0

CAMMA-public/SSG-VQA

SSG-VQA is a Visual Question Answering (VQA) dataset on laparoscopic videos providing diverse, geometrically grounded, unbiased and surgical action-oriented queries generated using scene graphs.

Language: Python - Size: 2.39 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 28 - Forks: 1

Cloud-CV/VQA

CloudCV Visual Question Answering Demo

Language: Lua - Size: 4.75 MB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 66 - Forks: 24

gutbash/lmm-graph-vision

How well do the GPT-4V, Gemini Pro Vision, and Claude 3 Opus models perform zero-shot vision tasks on data structures?

Language: Python - Size: 186 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 1

findalexli/SciGraphQA

SciGraphQA

Language: Jupyter Notebook - Size: 16.7 MB - Last synced at: over 1 year ago - Pushed at: almost 2 years ago - Stars: 32 - Forks: 2

chandrakanthm/visual-question-generator

Language: Python - Size: 1.36 MB - Last synced at: over 1 year ago - Pushed at: over 8 years ago - Stars: 1 - Forks: 0

radonys/CFB-VQA

VQA Challenge - hosted on Hasura using Flask

Language: Python - Size: 49.5 MB - Last synced at: over 1 year ago - Pushed at: about 7 years ago - Stars: 2 - Forks: 0

thatAverageGuy/EarlyFusion-on-EasyVQA

Streamlit app for demonstrating multi-modal(vision+language) modelling in Pytorch.

Language: Python - Size: 2.74 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

dinesh-kumar-mr/MediVQA

Part of our final year project work involving complex NLP tasks along with experimentation on various datasets and different LLMs

Language: HTML - Size: 1.98 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

abachaa/VQA-Med-2019

Visual Question Answering in the Medical Domain VQA-Med 2019

Size: 20 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 65 - Forks: 26

Letian2003/C-VQA

Counterfactual Reasoning VQA Dataset

Language: Python - Size: 271 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 14 - Forks: 2

ghazaleh-mahmoodi/lxmert_compression

B.Sc. Final Project: LXMERT Model Compression for Visual Question Answering.

Language: Python - Size: 11.6 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 1

juletx/egunean-behin-vqa

Egunean Behin Visual Question Answering Dataset

Language: Jupyter Notebook - Size: 12.9 MB - Last synced at: almost 2 years ago - Pushed at: over 3 years ago - Stars: 2 - Forks: 0

manoja328/vqatools

API for VQA , visual 7w dataset

Language: Jupyter Notebook - Size: 520 KB - Last synced at: almost 2 years ago - Pushed at: almost 8 years ago - Stars: 3 - Forks: 1

yousefkotp/Visual-Question-Answering

A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder

Language: Jupyter Notebook - Size: 15.9 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 1

vztu/BVQA_Benchmark

A resource list and performance benchmark for blind video quality assessment (BVQA) models on user-generated content (UGC) datasets. [IEEE TIP'2021] "UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content", Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik

Language: Python - Size: 872 KB - Last synced at: over 2 years ago - Pushed at: over 3 years ago - Stars: 96 - Forks: 15

vqa-dataset 38 vqa 30 visual-question-answering 11 deep-learning 4 pytorch 4 machine-learning 4 llm 4 question-answering 4 dataset 4 tensorflow 4 python 3 computer-vision 3 medical-imaging 3 visual-question-generation 3 datasets 2 nlp 2 vqg 2 artificial-intelligence 2 multimodal 2 cvpr 2 radiology 2 vqa-med 2 llms 2 vgg16 2 video-quality-assessment 2 multimodal-deep-learning 2 hdr-video 2 domain-adaptation 2 scene-graph 2 visual7w 1 qa 1 clip 1 clip-model 1 image-and-text 1 image-encoding 1 open-ai-clip 1 text-encoding 1 visual-question-anwsering 1 vizwiz 1 hackathon-project 1 hasura 1 keras-models 1 keras-tensorflow 1 lstm 1 early-fusion 1 streamlit 1 transformers 1 llms-benchmarking 1 medical-application 1 vqa-med-2018 1 imageclef 1 benchmark 1 counterfactual 1 reasoning 1 symbolic 1 pruning 1 egunean-behin 1 question-generation 1 triplet 1 triplet-loss 1 extract-features 1 iccv2021 1 vqav2 1 cnn 1 rnn 1 flask 1 keras 1 san 1 stacked-attention-networks 1 tensorflow2 1 visual-q 1 attention-model 1 indoor-scenes 1 keyword-text 1 cnn-model 1 deep 1 deeplearning 1 vizwiz-vqa 1 vqa-2023 1 bvqa-benchmark 1 bvqa-models 1 image-quality-assessment 1 performance-benchmark 1 picture-quality 1 ugc-datasets 1 ugc-vqa 1 youtube-dataset 1 spacy 1 dialog 1 multi-modal 1 phythia 1 visual 1 awesome-vqg 1 classification-model 1 emnlp-2018 1 multimodel 1 multimodel-network 1 point-cloud 1 acm 1 arxiv-papers 1

GitHub topics: vqa-dataset

google-research-datasets/maverics 📦

rentainhe/TRAR-Feature-Extraction Fork of facebookresearch/grid-feats-vqa