Topic: "video-qa"
sutdcv/SUTD-TrafficQA
[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
Language: JavaScript - Size: 6 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 53 - Forks: 2

RenShuhuai-Andy/TESTA
[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
Language: Python - Size: 835 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 40 - Forks: 3

TXH-mercury/COSA
[ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
Language: Python - Size: 84.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 39 - Forks: 3

Kyung-Min/Deep-Embedded-Memory-Networks
https://arxiv.org/abs/1707.00836
Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 22 - Forks: 6

ZJULearning/videoqa
Unifying the Video and Question Attentions for Open-Ended Video Question Answering
Language: Python - Size: 17.4 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 21 - Forks: 4
