Topic: "masked-autoencoder"
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language: Python - Size: 53.3 MB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 2,109 - Forks: 132
MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language: Python - Size: 547 KB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 1,451 - Forks: 142
keyu-tian/SparK
[ICLR'23 Spotlightđ„] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"
Language: Python - Size: 699 KB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 1,343 - Forks: 82
EdisonLeeeee/Awesome-Masked-Autoencoders
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
Size: 488 KB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 856 - Forks: 54
Lupin1998/Awesome-MIM
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Language: Python - Size: 6.7 MB - Last synced at: 1 day ago - Pushed at: 7 months ago - Stars: 350 - Forks: 17
uncbiag/SimpleClick
SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)
Language: Python - Size: 40.2 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 239 - Forks: 40
implus/UM-MAE
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
Language: Jupyter Notebook - Size: 5.72 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 231 - Forks: 20
xyzforever/BEVT
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
Language: Python - Size: 19.2 MB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 162 - Forks: 19
implus/mae_segmentation
reproduction of semantic segmentation using masked autoencoder (mae)
Language: Python - Size: 198 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 134 - Forks: 13
TonyLianLong/CrossMAE
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
Language: Python - Size: 1.5 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 117 - Forks: 7
nttcslab/m2d
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 112 - Forks: 6
zubair-irshad/NeRF-MAE
[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Language: Python - Size: 4.47 MB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 101 - Forks: 4
Haochen-Wang409/HPM
[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling
Language: Python - Size: 1.67 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 92 - Forks: 7
ruiwang2021/mvd
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Language: Python - Size: 477 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 90 - Forks: 9
nttcslab/msm-mae
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations
Language: Jupyter Notebook - Size: 10 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 84 - Forks: 8
habla-liaa/encodecmae
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
Language: Python - Size: 97.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 78 - Forks: 4
HKUDS/MAERec
[SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"
Language: Python - Size: 80.3 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 61 - Forks: 5
rishikksh20/AudioMAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders that Listen
Language: Python - Size: 226 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 60 - Forks: 6
recursionpharma/maes_microscopy
Official repo for Recursion's accepted spotlight paper at NeurIPS 2023 Generative AI & Biology workshop.
Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 59 - Forks: 13
lucidrains/LVMAE-pytorch
Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch
Language: Python - Size: 740 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 45 - Forks: 1
MCG-NJU/VideoMAE-Action-Detection
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
Language: Python - Size: 580 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 38 - Forks: 1
Event-AHU/VFM-Det
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models
Language: Python - Size: 7.15 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 36 - Forks: 3
naver-ai/augsub
[CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"
Language: Python - Size: 251 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 34 - Forks: 1
Westlake-AI/A2MIM
[ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
Language: Python - Size: 174 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 3
shlokk/mae-contrastive
Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".
Language: Python - Size: 598 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 1
FengheTan9/Hi-End-MAE
[MedIA 2025] Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation
Language: Python - Size: 2.29 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 19 - Forks: 4
sunilhoho/EVEREST
Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].
Language: Python - Size: 2.77 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 1
liruiw/Dec-SSL
Understanding Self-Supervised Learning in a Decentralized Setting
Language: Python - Size: 662 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 19 - Forks: 1
jakhac/CSMAE
Cross-Sensor Masked Autoencoder for Content Based Image Retrieval in Remote Sensing
Language: Python - Size: 1.07 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 4
lyhkevin/MT-Net
Multi-scale Transformer Network for Cross-Modality MR Image Synthesis (IEEE TMI)
Language: Python - Size: 13.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 16 - Forks: 1
bayartsogt-ya/albert-mongolian
ALBERT trained on Mongolian text corpus
Language: Jupyter Notebook - Size: 263 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 15 - Forks: 2
mkang315/PK-YOLO
[WACV'25] Official implementation of "PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplane MRI Slices".
Language: Python - Size: 754 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 14 - Forks: 5
stoneMo/DeepAVFusion
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
Language: Python - Size: 26.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0
naver-ai/lut
[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"
Language: Python - Size: 4.67 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 11 - Forks: 0
yifanzhang-pro/M-MAE
Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)
Language: Python - Size: 74.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 3
JJLi0427/CNN_Masked_Autoencoder
Design a patches masked autoencoder by CNN
Language: Python - Size: 1.07 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0
Ryan21wy/HSIMAE
HSIMAE: A Unified Masked Autoencoder with large-scale pretraining for Hyperspectral Image Classification
Language: Python - Size: 75.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 0
Yusen-Peng/CascadeFormer
[arXiv preprint] đCascadeFormer: A Family of Two-stage Cascading Transformers for Skeleton-based Human Action Recognition
Language: Python - Size: 232 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 7 - Forks: 1
yc-cui/PEMAE
[TGRS 2024] PEMAE: Pixel-Wise Ensembled Masked Autoencoder for Multispectral Pan-Sharpening
Language: Python - Size: 326 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 1
MSA-LMC/MAE-SFER
MAE pre-training models (ViT and ConvNeXt) using AffectNet images for static facial expression recognition (SFER).
Language: Python - Size: 168 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 1
waldo-vision/models
Repository for model development and training
Language: Python - Size: 665 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 4
danelpeng/RDMAE_Nav
A robust embodied navigation agent to various visual corruptions.
Language: Python - Size: 11.1 MB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 6 - Forks: 0
jonahanton/SSL_audio
Codebase for Imperial MSc AI Individual Project - Self-Supervised Learning for Audio Inference
Language: Python - Size: 59.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0
EdisonLeeeee/lrGAE
A comprehensive (masked) graph autoencoders benchmark.
Language: Python - Size: 1.53 MB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 5 - Forks: 0
samsad35/VQ-MAE-S-code
A Vector Quantized Masked AutoEncoder for speech emotion recognition
Language: Python - Size: 4.71 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0
chris-santiago/met
Reproducing the MET framework with PyTorch
Language: Python - Size: 10.9 MB - Last synced at: 28 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 1
mvrl/BirdSAT
A PyTorch implementation of "BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping"
Language: Python - Size: 6.34 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2
YunghuiHsu/ebird_project
Extraction of deep features/representation of birds by deep learning algorithms.
Language: Jupyter Notebook - Size: 8.42 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1
isno0907/ldmae
Latent Diffusion Models with Masked AutoEncoders (LDMAE) official code
Language: Python - Size: 15.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0
asbjrnmunk/amaes
Masked Autoencoder Pretraining on 3D Brain MRI
Language: Python - Size: 187 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0
maple-research-lab/AdPE
code for "AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+"
Language: Python - Size: 95.7 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0
vahidzee/torchde
PyTorch wrapper for Deep Density Estimation Models
Language: Python - Size: 181 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1
Talented-Q/MSMAE
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
Language: Python - Size: 20.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0
dan-crdll/nn_project_dreamdiffusion
Re-implementation of the method proposed in ''DreamDiffusion: Generating High-Quality Images from Brain EEG Signals'' by Y. Bai, X. Wang et al. for Neural Network Course exam Topics
Language: Jupyter Notebook - Size: 5.56 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0
sinatayebati/R-MAE
R-MAE: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing
Language: Python - Size: 6.53 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0
e-hulten/made
PyTorch implementation of MADE
Language: Python - Size: 1.16 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0
AlexMaks02/CGD-MAE
CLIP Distillation-Driven pre-training framework for vehicle re-identification
Language: Python - Size: 1.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0
samsad35/VQ-MAE-AudioVisual-code
A vector quantized masked autoencoder for audiovisual speech emotion recognition
Language: Python - Size: 21.7 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0
samsad35/code-ancogen
AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
Language: Python - Size: 2.49 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0
s35lay/PMCF
"Adaptive Filter Attention for Multi-level Structure-Aware Graph Self-supervised Learning"
Language: Python - Size: 10.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0
eminorhan/optimized-mae
An optimized implementation of masked autoencoders (MAEs)
Language: Python - Size: 69.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1
thinh-re/mae
Train MAE on Kaggle 2 GPUs (T4x2), Log to Wandb
Language: Python - Size: 899 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0
cell-observatory/cell_observatory_platform
Training backend and some self-supervised pretraining methods for Cell Observatory models
Language: Python - Size: 1.31 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 1
AlbughdadiM/geo-moe-mae
Mixture-of-Experts Masked AutoEncoder for Earth Observation.
Language: Jupyter Notebook - Size: 33.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0
sathishkumar67/Masked-Autoencoders-Are-Scalable-Vision-Learners
Implementation of Masked AutoEncoder
Language: Python - Size: 9.51 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0
max-kuk/bimae_seed_classification
BiMAE - A Bimodal Masked Autoencoder Architecture for Single-Label Hyperspectral Image Classification, CVPRW 2024
Language: Python - Size: 298 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0
jwmao1/MSMAE
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
Language: Python - Size: 20.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
teddyld/mae-torch
A PyTorch implementation of Masked Autoencoders are Scalable Vision Learners in Jupyter Notebook
Language: Jupyter Notebook - Size: 261 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
turhancan97/Learning-by-Reconstruction-with-MAE
Enhancing Representation Learning in Masked Autoencoders by Focusing on Low-Variance Components
Language: Python - Size: 48.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
ZY-LIi/Diffusion_Enhanced_Masked_Autoencoder
Pre-training a Masked Autoencoder with the idea of Diffusion Models for Hyperspectral Image Classification.
Language: Python - Size: 14.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
ImRTon/TiMAE-Lightning
Unofficial implementation of Ti-MAE in PyTorch Lightning.
Language: Python - Size: 73.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
lambda-xyz-01/AGMAE
AG-MAE: Anatomically Guided Spatio-Temporal Masked Auto-Encoder for Online Hand Gesture Recognition
Language: Python - Size: 1.37 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
eminorhan/optimized-stmae
An optimized implementation of spatiotemporal masked autoencoders
Language: Python - Size: 152 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
RichardScottOZ/grid-mae
Investigate possibilities for Vision Transformers with multiscale grids
Language: Python - Size: 114 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
RichardScottOZ/torchgeo-cimae Fork of Modexus/torchgeo
TorchGeo: datasets, transforms, and models for geospatial data
Language: Python - Size: 72 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
Video-MAC/VideoMAC
Official code for CVPR2024 âVideoMAC: Video Masked Autoencoders Meet ConvNetsâ
Language: Python - Size: 5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1
LorenzoBonanni/CVProject
Project for Computer Vision course @ MSc in Artificial Intelligence, UniVR
Language: TeX - Size: 5.45 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
sebastienmeyer2/masked-change-detection
Change detection on satellite images with masked autoencoders.
Language: Python - Size: 5.57 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0