Topic: "video-representation-learning"
MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language: Python - Size: 547 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1,451 - Forks: 142

ttengwang/Awesome_Long_Form_Video_Understanding
Awesome papers & datasets specifically focused on long-term videos.
Size: 44.9 KB - Last synced at: 21 days ago - Pushed at: 7 months ago - Stars: 270 - Forks: 12

cvlab-columbia/hyperfuture
Code for the paper Learning the Predictability of the Future (CVPR 2021)
Language: Python - Size: 40 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 166 - Forks: 26

xyzforever/BEVT
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
Language: Python - Size: 19.2 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 158 - Forks: 19

ruiwang2021/mvd
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Language: Python - Size: 477 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 90 - Forks: 9

GV1028/videogan
Implementation of "Generating Videos with Scene Dynamics" in Tensorflow
Language: Python - Size: 731 KB - Last synced at: 6 months ago - Pushed at: about 7 years ago - Stars: 77 - Forks: 20

xiaojieli0903/MaskAgain
Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)
Language: Python - Size: 8.05 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 27 - Forks: 0

boschresearch/rince 📦
This is the code accompanying the AAAI 2022 paper "Ranking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked Positives" https://arxiv.org/abs/2201.11736 . The method allows you to use additional ranking information for representation learning.
Language: Python - Size: 13.7 MB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 25 - Forks: 4

xiaojieli0903/FGKVMemPred_video
Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)
Language: Python - Size: 2.79 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 23 - Forks: 0

ihaeyong/PFNR
Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (NIR)
Language: Python - Size: 353 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 22 - Forks: 4

sunilhoho/EVEREST
Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].
Language: Python - Size: 2.77 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 19 - Forks: 1

oooolga/JEDi
👆PyTorch Implementation of JEDi Metric described in "Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality"
Language: Python - Size: 460 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 14 - Forks: 3

furushchev/chainervr
Chainer implementation of Networks for Learning Video Representations
Language: Python - Size: 2.68 MB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 8 - Forks: 1

UARK-AICV/Video_Representation
[Asilomar 2022] Contextual Explainable Video Representation: Human Perception-based Understanding
Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

Mallory24/cae_modeling
The official repository for the IJCNLP-AACL 2023 paper: Implicit Affordance Acquisition via Causal Action–Effect Modeling in the Video Domain
Language: Python - Size: 620 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

XFeiF/ComputerVision_PaperNotes
📚 Paper Notes (Computer vision)
Size: 3.17 MB - Last synced at: 6 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

mdnuruzzamanKALLOL/VideoMAE_Tensorflow
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language: Python - Size: 337 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Video-MAC/VideoMAC
Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”
Language: Python - Size: 5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1
