GitHub topics: video-transformer
fcakyon/video-transformers
Easiest way of fine-tuning HuggingFace video classification models
Language: Python - Size: 72.3 KB - Last synced at: about 2 months ago - Pushed at: about 2 years ago - Stars: 141 - Forks: 13

junchen14/Multi-Modal-Transformer
The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and self-supervised learning models. Additionally, it also collects many useful tutorials and tools in these related domains.
Size: 354 KB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 225 - Forks: 31

amazon-science/long-short-term-transformer
[NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection
Language: Python - Size: 142 KB - Last synced at: about 2 months ago - Pushed at: 10 months ago - Stars: 132 - Forks: 19

MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language: Python - Size: 547 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1,451 - Forks: 142

aliemo/transfomers-silicon-research
Research and Materials on Hardware implementation of Transformer Model
Language: Jupyter Notebook - Size: 1.84 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 183 - Forks: 25

mdnuruzzamanKALLOL/VideoMAE_Tensorflow
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language: Python - Size: 337 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

mlvlab/vid-TLDR
Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".
Language: Python - Size: 1.04 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 18 - Forks: 0

MCG-NJU/VideoMAE-Action-Detection
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
Language: Python - Size: 580 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 38 - Forks: 1

shotstack/mp4-to-mov-demo
A demo project showing how to convert an MP4 video to MOV format using the Shotstack Ingest API.
Language: JavaScript - Size: 130 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
