video-representation-learning | Topic

Topic: "video-representation-learning"

MCG-NJU/VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language: Python - Size: 547 KB - Last synced at: 2 months ago - Pushed at: over 1 year ago - Stars: 1,451 - Forks: 142

ttengwang/Awesome_Long_Form_Video_Understanding

Awesome papers & datasets specifically focused on long-term videos.

Size: 44.9 KB - Last synced at: 21 days ago - Pushed at: 7 months ago - Stars: 270 - Forks: 12

cvlab-columbia/hyperfuture

Code for the paper Learning the Predictability of the Future (CVPR 2021)

Language: Python - Size: 40 KB - Last synced at: about 2 months ago - Pushed at: almost 2 years ago - Stars: 166 - Forks: 26

xyzforever/BEVT

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

Language: Python - Size: 19.2 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 158 - Forks: 19

ruiwang2021/mvd

[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

Language: Python - Size: 477 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 90 - Forks: 9

GV1028/videogan

Implementation of "Generating Videos with Scene Dynamics" in Tensorflow

Language: Python - Size: 731 KB - Last synced at: 6 months ago - Pushed at: about 7 years ago - Stars: 77 - Forks: 20

xiaojieli0903/MaskAgain

Official repository of the “Mask Again: Masked Knowledge Distillation for Masked Video Modeling” (ACM MM 2023)

Language: Python - Size: 8.05 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 27 - Forks: 0

This is the code accompanying the AAAI 2022 paper "Ranking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked Positives" https://arxiv.org/abs/2201.11736 . The method allows you to use additional ranking information for representation learning.

Language: Python - Size: 13.7 MB - Last synced at: 21 days ago - Pushed at: almost 3 years ago - Stars: 25 - Forks: 4

xiaojieli0903/FGKVMemPred_video

Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning" (ACM MM 2023)

Language: Python - Size: 2.79 MB - Last synced at: about 2 months ago - Pushed at: 11 months ago - Stars: 23 - Forks: 0

ihaeyong/PFNR

Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (NIR)

Language: Python - Size: 353 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 22 - Forks: 4

sunilhoho/EVEREST

Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].

Language: Python - Size: 2.77 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 19 - Forks: 1

oooolga/JEDi

👆PyTorch Implementation of JEDi Metric described in "Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality"

Language: Python - Size: 460 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 14 - Forks: 3

furushchev/chainervr

Chainer implementation of Networks for Learning Video Representations

Language: Python - Size: 2.68 MB - Last synced at: almost 2 years ago - Pushed at: almost 7 years ago - Stars: 8 - Forks: 1

UARK-AICV/Video_Representation

[Asilomar 2022] Contextual Explainable Video Representation: Human Perception-based Understanding

Size: 3.91 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

Mallory24/cae_modeling

The official repository for the IJCNLP-AACL 2023 paper: Implicit Affordance Acquisition via Causal Action–Effect Modeling in the Video Domain

Language: Python - Size: 620 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

XFeiF/ComputerVision_PaperNotes

📚 Paper Notes (Computer vision)

Size: 3.17 MB - Last synced at: 6 months ago - Pushed at: about 4 years ago - Stars: 1 - Forks: 0

mdnuruzzamanKALLOL/VideoMAE_Tensorflow

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language: Python - Size: 337 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Video-MAC/VideoMAC

Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”

Language: Python - Size: 5 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Repos