Topic: "masked-autoencoder"
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language: Python - Size: 53.2 MB - Last synced at: 1 day ago - Pushed at: 14 days ago - Stars: 1,899 - Forks: 112

MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language: Python - Size: 547 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1,451 - Forks: 142

keyu-tian/SparK
[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"
Language: Python - Size: 699 KB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 1,343 - Forks: 82

EdisonLeeeee/Awesome-Masked-Autoencoders
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
Size: 488 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 823 - Forks: 52

Lupin1998/Awesome-MIM
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Language: Python - Size: 6.7 MB - Last synced at: 13 days ago - Pushed at: about 2 months ago - Stars: 331 - Forks: 16

implus/UM-MAE
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
Language: Jupyter Notebook - Size: 5.72 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 231 - Forks: 20

uncbiag/SimpleClick
SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)
Language: Python - Size: 40.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 225 - Forks: 38

xyzforever/BEVT
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
Language: Python - Size: 19.2 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 158 - Forks: 19

implus/mae_segmentation
reproduction of semantic segmentation using masked autoencoder (mae)
Language: Python - Size: 198 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 134 - Forks: 13

TonyLianLong/CrossMAE
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
Language: Python - Size: 1.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 107 - Forks: 6

zubair-irshad/NeRF-MAE
[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Language: Python - Size: 4.47 MB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 101 - Forks: 4

Haochen-Wang409/HPM
[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling
Language: Python - Size: 1.67 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 92 - Forks: 7

ruiwang2021/mvd
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Language: Python - Size: 477 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 90 - Forks: 9

nttcslab/msm-mae
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations
Language: Jupyter Notebook - Size: 10 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 84 - Forks: 8

habla-liaa/encodecmae
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
Language: Python - Size: 97.7 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 78 - Forks: 4

rishikksh20/AudioMAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders that Listen
Language: Python - Size: 226 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 60 - Forks: 6

recursionpharma/maes_microscopy
Official repo for Recursion's accepted spotlight paper at NeurIPS 2023 Generative AI & Biology workshop.
Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 59 - Forks: 13

nttcslab/m2d
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 57 - Forks: 1

HKUDS/MAERec
[SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"
Language: Python - Size: 80.3 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 50 - Forks: 5

lucidrains/LVMAE-pytorch
Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch
Language: Python - Size: 740 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 45 - Forks: 1

MCG-NJU/VideoMAE-Action-Detection
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
Language: Python - Size: 580 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 38 - Forks: 1

naver-ai/augsub
[CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"
Language: Python - Size: 251 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 34 - Forks: 1

Event-AHU/VFM-Det
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models
Language: Python - Size: 7.14 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 31 - Forks: 3

Westlake-AI/A2MIM
[ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
Language: Python - Size: 174 KB - Last synced at: 25 days ago - Pushed at: 10 months ago - Stars: 27 - Forks: 4

shlokk/mae-contrastive
Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".
Language: Python - Size: 598 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 21 - Forks: 1

sunilhoho/EVEREST
Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].
Language: Python - Size: 2.77 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 19 - Forks: 1

liruiw/Dec-SSL
Understanding Self-Supervised Learning in a Decentralized Setting
Language: Python - Size: 662 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 1

jakhac/CSMAE
Cross-Sensor Masked Autoencoder for Content Based Image Retrieval in Remote Sensing
Language: Python - Size: 1.07 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 4

lyhkevin/MT-Net
Multi-scale Transformer Network for Cross-Modality MR Image Synthesis (IEEE TMI)
Language: Python - Size: 13.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 1

bayartsogt-ya/albert-mongolian
ALBERT trained on Mongolian text corpus
Language: Jupyter Notebook - Size: 263 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 15 - Forks: 2

stoneMo/DeepAVFusion
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
Language: Python - Size: 26.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 12 - Forks: 0

naver-ai/lut
[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"
Language: Python - Size: 4.67 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

yifanzhang-pro/M-MAE
Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)
Language: Python - Size: 74.2 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 11 - Forks: 3

JJLi0427/CNN_Masked_Autoencoder
Design a patches masked autoencoder by CNN
Language: Python - Size: 1.07 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 0

Ryan21wy/HSIMAE
HSIMAE: A Unified Masked Autoencoder with large-scale pretraining for Hyperspectral Image Classification
Language: Python - Size: 75.2 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 10 - Forks: 0

mkang315/PK-YOLO
[WACV'25] Official implementation of "PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplane MRI Slices".
Language: Python - Size: 792 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 8 - Forks: 3

MSA-LMC/MAE-SFER
MAE pre-training models (ViT and ConvNeXt) using AffectNet images for static facial expression recognition (SFER).
Language: Python - Size: 168 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 7 - Forks: 1

waldo-vision/models
Repository for model development and training
Language: Python - Size: 665 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 4

jonahanton/SSL_audio
Codebase for Imperial MSc AI Individual Project - Self-Supervised Learning for Audio Inference
Language: Python - Size: 59.3 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 0

danelpeng/RDMAE_Nav
A robust embodied navigation agent to various visual corruptions.
Language: Python - Size: 11.1 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

EdisonLeeeee/lrGAE
A comprehensive (masked) graph autoencoders benchmark.
Language: Python - Size: 1.53 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 5 - Forks: 0

samsad35/VQ-MAE-S-code
A Vector Quantized Masked AutoEncoder for speech emotion recognition
Language: Python - Size: 4.71 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

chris-santiago/met
Reproducing the MET framework with PyTorch
Language: Python - Size: 10.9 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

mvrl/BirdSAT
A PyTorch implementation of "BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping"
Language: Python - Size: 6.34 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

YunghuiHsu/ebird_project
Extraction of deep features/representation of birds by deep learning algorithms.
Language: Jupyter Notebook - Size: 8.42 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 1

asbjrnmunk/amaes
Masked Autoencoder Pretraining on 3D Brain MRI
Language: Python - Size: 187 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

tomouellette/autoencodersplz
Generative modeling and representation learning through reconstruction
Language: Python - Size: 27.8 MB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

maple-research-lab/AdPE
code for "AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+"
Language: Python - Size: 95.7 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

vahidzee/torchde
PyTorch wrapper for Deep Density Estimation Models
Language: Python - Size: 181 KB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 1

Talented-Q/MSMAE
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
Language: Python - Size: 20.3 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

dan-crdll/nn_project_dreamdiffusion
Re-implementation of the method proposed in ''DreamDiffusion: Generating High-Quality Images from Brain EEG Signals'' by Y. Bai, X. Wang et al. for Neural Network Course exam Topics
Language: Jupyter Notebook - Size: 5.56 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

sinatayebati/R-MAE
R-MAE: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing
Language: Python - Size: 6.53 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 0

e-hulten/made
PyTorch implementation of MADE
Language: Python - Size: 1.16 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

samsad35/VQ-MAE-AudioVisual-code
A vector quantized masked autoencoder for audiovisual speech emotion recognition
Language: Python - Size: 21.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

samsad35/code-ancogen
AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
Language: Python - Size: 2.49 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

s35lay/PMCF
"Adaptive Filter Attention for Multi-level Structure-Aware Graph Self-supervised Learning"
Language: Python - Size: 10.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

eminorhan/optimized-mae
An optimized implementation of masked autoencoders (MAEs)
Language: Python - Size: 69.3 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 1

thinh-re/mae
Train MAE on Kaggle 2 GPUs (T4x2), Log to Wandb
Language: Python - Size: 899 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

sathishkumar67/Masked-Autoencoders-Are-Scalable-Vision-Learners
Implementation of Masked AutoEncoder
Language: Python - Size: 9.51 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

max-kuk/bimae_seed_classification
BiMAE - A Bimodal Masked Autoencoder Architecture for Single-Label Hyperspectral Image Classification, CVPRW 2024
Language: Python - Size: 298 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

jwmao1/MSMAE
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
Language: Python - Size: 20.3 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

teddyld/mae-torch
A PyTorch implementation of Masked Autoencoders are Scalable Vision Learners in Jupyter Notebook
Language: Jupyter Notebook - Size: 261 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

turhancan97/Learning-by-Reconstruction-with-MAE
Enhancing Representation Learning in Masked Autoencoders by Focusing on Low-Variance Components
Language: Python - Size: 48.9 MB - Last synced at: 9 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ZY-LIi/Diffusion_Enhanced_Masked_Autoencoder
Pre-training a Masked Autoencoder with the idea of Diffusion Models for Hyperspectral Image Classification.
Language: Python - Size: 14.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ImRTon/TiMAE-Lightning
Unofficial implementation of Ti-MAE in PyTorch Lightning.
Language: Python - Size: 73.2 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lambda-xyz-01/AGMAE
AG-MAE: Anatomically Guided Spatio-Temporal Masked Auto-Encoder for Online Hand Gesture Recognition
Language: Python - Size: 1.37 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

eminorhan/optimized-stmae
An optimized implementation of spatiotemporal masked autoencoders
Language: Python - Size: 152 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

RichardScottOZ/grid-mae
Investigate possibilities for Vision Transformers with multiscale grids
Language: Python - Size: 114 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

RichardScottOZ/torchgeo-cimae Fork of Modexus/torchgeo
TorchGeo: datasets, transforms, and models for geospatial data
Language: Python - Size: 72 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Video-MAC/VideoMAC
Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”
Language: Python - Size: 5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

LorenzoBonanni/CVProject
Project for Computer Vision course @ MSc in Artificial Intelligence, UniVR
Language: TeX - Size: 5.45 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sebastienmeyer2/masked-change-detection
Change detection on satellite images with masked autoencoders.
Language: Python - Size: 5.57 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0
