An open API service providing repository metadata for many open source software ecosystems.

Topic: "masked-autoencoder"

OpenGVLab/InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language: Python - Size: 53.3 MB - Last synced at: 12 days ago - Pushed at: 4 months ago - Stars: 2,109 - Forks: 132

MCG-NJU/VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language: Python - Size: 547 KB - Last synced at: 9 months ago - Pushed at: almost 2 years ago - Stars: 1,451 - Forks: 142

keyu-tian/SparK

[ICLR'23 SpotlightđŸ”„] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Language: Python - Size: 699 KB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 1,343 - Forks: 82

EdisonLeeeee/Awesome-Masked-Autoencoders

A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

Size: 488 KB - Last synced at: 26 days ago - Pushed at: over 1 year ago - Stars: 856 - Forks: 54

Lupin1998/Awesome-MIM

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

Language: Python - Size: 6.7 MB - Last synced at: 1 day ago - Pushed at: 7 months ago - Stars: 350 - Forks: 17

uncbiag/SimpleClick

SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)

Language: Python - Size: 40.2 MB - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 239 - Forks: 40

implus/UM-MAE

Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"

Language: Jupyter Notebook - Size: 5.72 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 231 - Forks: 20

xyzforever/BEVT

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

Language: Python - Size: 19.2 MB - Last synced at: 4 months ago - Pushed at: over 3 years ago - Stars: 162 - Forks: 19

implus/mae_segmentation

reproduction of semantic segmentation using masked autoencoder (mae)

Language: Python - Size: 198 KB - Last synced at: about 2 years ago - Pushed at: almost 4 years ago - Stars: 134 - Forks: 13

TonyLianLong/CrossMAE

Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders

Language: Python - Size: 1.5 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 117 - Forks: 7

nttcslab/m2d

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 112 - Forks: 6

zubair-irshad/NeRF-MAE

[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields

Language: Python - Size: 4.47 MB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 101 - Forks: 4

Haochen-Wang409/HPM

[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling

Language: Python - Size: 1.67 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 92 - Forks: 7

ruiwang2021/mvd

[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

Language: Python - Size: 477 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 90 - Forks: 9

nttcslab/msm-mae

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations

Language: Jupyter Notebook - Size: 10 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 84 - Forks: 8

habla-liaa/encodecmae

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

Language: Python - Size: 97.7 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 78 - Forks: 4

HKUDS/MAERec

[SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"

Language: Python - Size: 80.3 MB - Last synced at: 5 months ago - Pushed at: over 2 years ago - Stars: 61 - Forks: 5

rishikksh20/AudioMAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders that Listen

Language: Python - Size: 226 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 60 - Forks: 6

recursionpharma/maes_microscopy

Official repo for Recursion's accepted spotlight paper at NeurIPS 2023 Generative AI & Biology workshop.

Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 59 - Forks: 13

lucidrains/LVMAE-pytorch

Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch

Language: Python - Size: 740 KB - Last synced at: 12 months ago - Pushed at: about 1 year ago - Stars: 45 - Forks: 1

MCG-NJU/VideoMAE-Action-Detection

[NeurIPS 2022 Spotlight] VideoMAE for Action Detection

Language: Python - Size: 580 KB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 38 - Forks: 1

Event-AHU/VFM-Det

VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models

Language: Python - Size: 7.15 MB - Last synced at: 2 months ago - Pushed at: 8 months ago - Stars: 36 - Forks: 3

naver-ai/augsub

[CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"

Language: Python - Size: 251 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 34 - Forks: 1

Westlake-AI/A2MIM

[ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN

Language: Python - Size: 174 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 3

shlokk/mae-contrastive

Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".

Language: Python - Size: 598 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 21 - Forks: 1

FengheTan9/Hi-End-MAE

[MedIA 2025] Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation

Language: Python - Size: 2.29 MB - Last synced at: 28 days ago - Pushed at: 28 days ago - Stars: 19 - Forks: 4

sunilhoho/EVEREST

Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].

Language: Python - Size: 2.77 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 19 - Forks: 1

liruiw/Dec-SSL

Understanding Self-Supervised Learning in a Decentralized Setting

Language: Python - Size: 662 KB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 19 - Forks: 1

jakhac/CSMAE

Cross-Sensor Masked Autoencoder for Content Based Image Retrieval in Remote Sensing

Language: Python - Size: 1.07 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 4

lyhkevin/MT-Net

Multi-scale Transformer Network for Cross-Modality MR Image Synthesis (IEEE TMI)

Language: Python - Size: 13.1 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 16 - Forks: 1

bayartsogt-ya/albert-mongolian

ALBERT trained on Mongolian text corpus

Language: Jupyter Notebook - Size: 263 KB - Last synced at: over 2 years ago - Pushed at: almost 5 years ago - Stars: 15 - Forks: 2

mkang315/PK-YOLO

[WACV'25] Official implementation of "PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplane MRI Slices".

Language: Python - Size: 754 KB - Last synced at: 15 days ago - Pushed at: 15 days ago - Stars: 14 - Forks: 5

stoneMo/DeepAVFusion

Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".

Language: Python - Size: 26.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0

naver-ai/lut

[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"

Language: Python - Size: 4.67 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 11 - Forks: 0

yifanzhang-pro/M-MAE

Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)

Language: Python - Size: 74.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 3

JJLi0427/CNN_Masked_Autoencoder

Design a patches masked autoencoder by CNN

Language: Python - Size: 1.07 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 11 - Forks: 0

Ryan21wy/HSIMAE

HSIMAE: A Unified Masked Autoencoder with large-scale pretraining for Hyperspectral Image Classification

Language: Python - Size: 75.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 10 - Forks: 0

Yusen-Peng/CascadeFormer

[arXiv preprint] 🌊CascadeFormer: A Family of Two-stage Cascading Transformers for Skeleton-based Human Action Recognition

Language: Python - Size: 232 MB - Last synced at: 6 days ago - Pushed at: 6 days ago - Stars: 7 - Forks: 1

yc-cui/PEMAE

[TGRS 2024] PEMAE: Pixel-Wise Ensembled Masked Autoencoder for Multispectral Pan-Sharpening

Language: Python - Size: 326 KB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 7 - Forks: 1

MSA-LMC/MAE-SFER

MAE pre-training models (ViT and ConvNeXt) using AffectNet images for static facial expression recognition (SFER).

Language: Python - Size: 168 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 7 - Forks: 1

waldo-vision/models

Repository for model development and training

Language: Python - Size: 665 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 4

danelpeng/RDMAE_Nav

A robust embodied navigation agent to various visual corruptions.

Language: Python - Size: 11.1 MB - Last synced at: 6 months ago - Pushed at: 8 months ago - Stars: 6 - Forks: 0

jonahanton/SSL_audio

Codebase for Imperial MSc AI Individual Project - Self-Supervised Learning for Audio Inference

Language: Python - Size: 59.3 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 6 - Forks: 0

EdisonLeeeee/lrGAE

A comprehensive (masked) graph autoencoders benchmark.

Language: Python - Size: 1.53 MB - Last synced at: 4 months ago - Pushed at: 12 months ago - Stars: 5 - Forks: 0

samsad35/VQ-MAE-S-code

A Vector Quantized Masked AutoEncoder for speech emotion recognition

Language: Python - Size: 4.71 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

chris-santiago/met

Reproducing the MET framework with PyTorch

Language: Python - Size: 10.9 MB - Last synced at: 28 days ago - Pushed at: almost 2 years ago - Stars: 5 - Forks: 1

mvrl/BirdSAT

A PyTorch implementation of "BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping"

Language: Python - Size: 6.34 MB - Last synced at: almost 2 years ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 2

YunghuiHsu/ebird_project

Extraction of deep features/representation of birds by deep learning algorithms.

Language: Jupyter Notebook - Size: 8.42 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 4 - Forks: 1

isno0907/ldmae

Latent Diffusion Models with Masked AutoEncoders (LDMAE) official code

Language: Python - Size: 15.6 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 3 - Forks: 0

asbjrnmunk/amaes

Masked Autoencoder Pretraining on 3D Brain MRI

Language: Python - Size: 187 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0

maple-research-lab/AdPE

code for "AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+"

Language: Python - Size: 95.7 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0

vahidzee/torchde

PyTorch wrapper for Deep Density Estimation Models

Language: Python - Size: 181 KB - Last synced at: 3 months ago - Pushed at: over 3 years ago - Stars: 3 - Forks: 1

Talented-Q/MSMAE

Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification

Language: Python - Size: 20.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0

dan-crdll/nn_project_dreamdiffusion

Re-implementation of the method proposed in ''DreamDiffusion: Generating High-Quality Images from Brain EEG Signals'' by Y. Bai, X. Wang et al. for Neural Network Course exam Topics

Language: Jupyter Notebook - Size: 5.56 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

sinatayebati/R-MAE

R-MAE: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing

Language: Python - Size: 6.53 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 2 - Forks: 0

e-hulten/made

PyTorch implementation of MADE

Language: Python - Size: 1.16 MB - Last synced at: over 2 years ago - Pushed at: about 5 years ago - Stars: 2 - Forks: 0

AlexMaks02/CGD-MAE

CLIP Distillation-Driven pre-training framework for vehicle re-identification

Language: Python - Size: 1.5 MB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 1 - Forks: 0

samsad35/VQ-MAE-AudioVisual-code

A vector quantized masked autoencoder for audiovisual speech emotion recognition

Language: Python - Size: 21.7 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 1 - Forks: 0

samsad35/code-ancogen

AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder

Language: Python - Size: 2.49 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 1 - Forks: 0

s35lay/PMCF

"Adaptive Filter Attention for Multi-level Structure-Aware Graph Self-supervised Learning"

Language: Python - Size: 10.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 0

eminorhan/optimized-mae

An optimized implementation of masked autoencoders (MAEs)

Language: Python - Size: 69.3 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 1 - Forks: 1

thinh-re/mae

Train MAE on Kaggle 2 GPUs (T4x2), Log to Wandb

Language: Python - Size: 899 KB - Last synced at: about 2 years ago - Pushed at: almost 3 years ago - Stars: 1 - Forks: 0

cell-observatory/cell_observatory_platform

Training backend and some self-supervised pretraining methods for Cell Observatory models

Language: Python - Size: 1.31 MB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 0 - Forks: 1

AlbughdadiM/geo-moe-mae

Mixture-of-Experts Masked AutoEncoder for Earth Observation.

Language: Jupyter Notebook - Size: 33.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

sathishkumar67/Masked-Autoencoders-Are-Scalable-Vision-Learners

Implementation of Masked AutoEncoder

Language: Python - Size: 9.51 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 0

max-kuk/bimae_seed_classification

BiMAE - A Bimodal Masked Autoencoder Architecture for Single-Label Hyperspectral Image Classification, CVPRW 2024

Language: Python - Size: 298 KB - Last synced at: 3 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

jwmao1/MSMAE

Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification

Language: Python - Size: 20.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

teddyld/mae-torch

A PyTorch implementation of Masked Autoencoders are Scalable Vision Learners in Jupyter Notebook

Language: Jupyter Notebook - Size: 261 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

turhancan97/Learning-by-Reconstruction-with-MAE

Enhancing Representation Learning in Masked Autoencoders by Focusing on Low-Variance Components

Language: Python - Size: 48.9 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ZY-LIi/Diffusion_Enhanced_Masked_Autoencoder

Pre-training a Masked Autoencoder with the idea of Diffusion Models for Hyperspectral Image Classification.

Language: Python - Size: 14.2 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

ImRTon/TiMAE-Lightning

Unofficial implementation of Ti-MAE in PyTorch Lightning.

Language: Python - Size: 73.2 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

lambda-xyz-01/AGMAE

AG-MAE: Anatomically Guided Spatio-Temporal Masked Auto-Encoder for Online Hand Gesture Recognition

Language: Python - Size: 1.37 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

eminorhan/optimized-stmae

An optimized implementation of spatiotemporal masked autoencoders

Language: Python - Size: 152 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

RichardScottOZ/grid-mae

Investigate possibilities for Vision Transformers with multiscale grids

Language: Python - Size: 114 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

RichardScottOZ/torchgeo-cimae Fork of Modexus/torchgeo

TorchGeo: datasets, transforms, and models for geospatial data

Language: Python - Size: 72 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

Video-MAC/VideoMAC

Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”

Language: Python - Size: 5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

LorenzoBonanni/CVProject

Project for Computer Vision course @ MSc in Artificial Intelligence, UniVR

Language: TeX - Size: 5.45 MB - Last synced at: about 1 year ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0

sebastienmeyer2/masked-change-detection

Change detection on satellite images with masked autoencoders.

Language: Python - Size: 5.57 MB - Last synced at: almost 2 years ago - Pushed at: over 2 years ago - Stars: 0 - Forks: 0

Related Topics
self-supervised-learning 36 pytorch 19 deep-learning 15 masked-image-modeling 14 mae 12 vision-transformer 12 representation-learning 7 computer-vision 6 video-understanding 6 video-representation-learning 5 machine-learning 5 transformer 5 pretraining 5 autoencoder 4 medical-image-analysis 4 foundation-models 4 action-recognition 4 audio 3 transformers 3 imagenet 3 graph-neural-networks 3 artificial-intelligence 3 diffusion-models 3 object-detection 3 remote-sensing 3 pretrained-models 3 hyperspectral-image-classification 3 convolutional-neural-networks 3 bert 3 contrastive-learning 3 vision-transformers 2 ade20k 2 coco 2 imagenet-classification 2 semantic-segmentation 2 vit 2 3d 2 optimized 2 cvpr2023 2 unsupervised-learning 2 speech 2 speech-synthesis 2 microscopy 2 density-estimation 2 generative-model 2 segmentation 2 emotion-recognition 2 pytorch-lightning 2 audio-processing 2 attention-mechanism 2 transformer-architecture 2 benchmark 2 cnn 2 pre-training 2 neurips-2022 2 ssl 2 video-transformer 2 pretrain 2 sparse-convolution 2 video-models 1 vector-quantization 1 grid 1 biodiversity 1 data-science 1 ebird 1 sparse-coding 1 vsc 1 fine-grained-classification 1 geoscience 1 species-distribution-mapping 1 brain-mri 1 5d 1 image-classification 1 geophysics 1 flow-matching 1 iccv 1 iccv2025 1 latent-diffusion 1 large-scale-pretraining 1 skeleton-based-action-recognition 1 4d 1 multispectral-images 1 pansharpening 1 super-resolution 1 facial-expression 1 reconstruction 1 image-recognition 1 mineral-exploration 1 pca 1 audioset 1 barlow-twins 1 byol 1 embodied-ai 1 robustness 1 visual-navigation 1 adversarial-learning 1 hydra 1 few-shot-classifcation 1 taskfile 1 graph-autoencoder 1