GitHub / TIGER-AI-Lab / VISTA
The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]
JSON API: http://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/TIGER-AI-Lab%2FVISTA
PURL: pkg:github/TIGER-AI-Lab/VISTA
Stars: 16
Forks: 0
Open issues: 0
License: None
Language: Python
Size: 9.62 MB
Dependencies parsed at: Pending
Created at: 9 months ago
Updated at: 3 months ago
Pushed at: 6 months ago
Last synced at: 3 months ago
Topics: lm, multimodal, vlm