video-qa | Topic | Ecosyste.ms: Repos

Topic: "video-qa"

[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events

Language: JavaScript - Size: 6 MB - Last synced at: about 2 months ago - Pushed at: 9 months ago - Stars: 53 - Forks: 2

[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding

Language: Python - Size: 835 KB - Last synced at: 12 months ago - Pushed at: over 1 year ago - Stars: 40 - Forks: 3

[ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

Language: Python - Size: 84.2 MB - Last synced at: 4 months ago - Pushed at: 4 months ago - Stars: 39 - Forks: 3

https://arxiv.org/abs/1707.00836

Language: Jupyter Notebook - Size: 21.5 KB - Last synced at: over 1 year ago - Pushed at: over 7 years ago - Stars: 22 - Forks: 6

Unifying the Video and Question Attentions for Open-Ended Video Question Answering

Language: Python - Size: 17.4 MB - Last synced at: over 1 year ago - Pushed at: almost 6 years ago - Stars: 21 - Forks: 4