GitHub / TIGER-AI-Lab / VISTA

The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]

JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TIGER-AI-Lab%2FVISTA
PURL: pkg:github/TIGER-AI-Lab/VISTA

Stars: 16
Forks: 0
Open issues: 0

License: None
Language: Python
Size: 9.62 MB
Dependencies parsed at: Pending

Created at: 9 months ago
Updated at: 3 months ago
Pushed at: 6 months ago
Last synced at: 3 months ago

Topics: lm, multimodal, vlm

Loading...