GitHub topics: vqa-2023

Repositories

yousefkotp/Visual-Question-Answering

A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder

Language: Jupyter Notebook - Size: 15.9 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 1

Related Keywords

clip 1 clip-model 1 deep-learning 1 image-and-text 1 image-encoding 1 machine-learning 1 open-ai-clip 1 text-encoding 1 visual-question-answering 1 visual-question-anwsering 1 vizwiz 1 vizwiz-vqa 1 vqa 1 vqa-2023 1 vqa-dataset 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos

GitHub topics: vqa-2023

yousefkotp/Visual-Question-Answering