GitHub topics: masked-image-modeling
microsoft/SimMIM
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
Language: Python - Size: 275 KB - Last synced at: about 18 hours ago - Pushed at: about 3 years ago - Stars: 1,008 - Forks: 105
open-mmlab/mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
Language: Python - Size: 13.5 MB - Last synced at: 5 days ago - Pushed at: about 1 year ago - Stars: 3,780 - Forks: 1,106
Lupin1998/Awesome-MIM
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Language: Python - Size: 6.7 MB - Last synced at: 5 days ago - Pushed at: 7 months ago - Stars: 350 - Forks: 17
mkang315/PK-YOLO
[WACV'25] Official implementation of "PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplane MRI Slices".
Language: Python - Size: 754 KB - Last synced at: 19 days ago - Pushed at: 19 days ago - Stars: 14 - Forks: 5
aicip/Cross-Scale-MAE
[NIPS'23] Official Code of the paper "Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing"
Language: Python - Size: 645 KB - Last synced at: 11 days ago - Pushed at: almost 2 years ago - Stars: 50 - Forks: 2
FengheTan9/Hi-End-MAE
[MedIA 2025] Hi-End-MAE: Hierarchical encoder-driven masked autoencoders are stronger vision learners for medical image segmentation
Language: Python - Size: 2.29 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 19 - Forks: 4
Westlake-AI/openmixup
CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark
Language: Python - Size: 3.79 MB - Last synced at: 26 days ago - Pushed at: about 2 months ago - Stars: 658 - Forks: 61
open-mmlab/mmselfsup
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
Language: Python - Size: 3.39 MB - Last synced at: 2 months ago - Pushed at: over 2 years ago - Stars: 3,282 - Forks: 442
Westlake-AI/A2MIM
[ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
Language: Python - Size: 174 KB - Last synced at: about 1 month ago - Pushed at: over 1 year ago - Stars: 28 - Forks: 3
eezkni/SMHDR
[IJCV-2025] Official Pytorch implementation of "Semantic Masking with Curriculum Learning for Robust HDR Image Reconstruction"
Language: Python - Size: 0 Bytes - Last synced at: 5 months ago - Pushed at: 5 months ago - Stars: 2 - Forks: 0
dvlab-research/MOOD
Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Need. (2) MOODv2: Masked Image Modeling for Out-of-Distribution Detection.
Language: Python - Size: 3.09 MB - Last synced at: 5 months ago - Pushed at: over 1 year ago - Stars: 133 - Forks: 7
lxtGH/CAE
This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"
Language: Python - Size: 366 KB - Last synced at: 3 months ago - Pushed at: almost 3 years ago - Stars: 198 - Forks: 21
hustvl/MIMDet
[ICCV 2023] You Only Look at One Partial Sequence
Language: Python - Size: 551 KB - Last synced at: 6 months ago - Pushed at: about 2 years ago - Stars: 341 - Forks: 30
keyu-tian/SparK
[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"
Language: Python - Size: 699 KB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 1,343 - Forks: 82
FengheTan9/MambaMIM
[MedIA'25] MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentation
Language: Python - Size: 2.47 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 26 - Forks: 1
salesforce/MUST 📦
PyTorch code for MUST
Language: Python - Size: 1.33 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 106 - Forks: 12
liuxingbin/dbot
[ICLR2024] Exploring Target Representations for Masked Autoencoders
Language: Python - Size: 2.13 MB - Last synced at: 6 months ago - Pushed at: almost 2 years ago - Stars: 55 - Forks: 8
bwconrad/can
PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".
Language: Python - Size: 352 KB - Last synced at: 4 months ago - Pushed at: almost 3 years ago - Stars: 39 - Forks: 7
Alpha-VL/ConvMAE
ConvMAE: Masked Convolution Meets Masked Autoencoders
Language: Python - Size: 8.53 MB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 504 - Forks: 40
Haochen-Wang409/HPM
[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling
Language: Python - Size: 1.67 MB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 92 - Forks: 7
Sense-X/MixMIM
MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning
Language: Python - Size: 649 KB - Last synced at: 7 months ago - Pushed at: over 2 years ago - Stars: 142 - Forks: 6
sbartlett97/torch-electra
A Custom implementation of the ELECTRA training method using PyTorch and HuggingFace Transformers
Language: Python - Size: 45.9 KB - Last synced at: 5 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0
PGSmall/clip-pgs
Official code for CVPR2025 "Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection"
Language: Python - Size: 8.97 MB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 0 - Forks: 1
ariG23498/mae-scalable-vision-learners
A TensorFlow 2.x implementation of Masked Autoencoders Are Scalable Vision Learners
Language: Jupyter Notebook - Size: 38.7 MB - Last synced at: 7 months ago - Pushed at: over 3 years ago - Stars: 78 - Forks: 15
haofanwang/awesome-vision-language-modeling
Recent Advances in Vision-Language Pre-training!
Size: 18.6 KB - Last synced at: 23 days ago - Pushed at: almost 4 years ago - Stars: 29 - Forks: 2
AndreaCossu/continual-pretraining-nlp-vision
Code to reproduce experiments from the paper "Continual Pre-Training Mitigates Forgetting in Language and Vision" https://arxiv.org/abs/2205.09357
Language: Jupyter Notebook - Size: 872 KB - Last synced at: 5 months ago - Pushed at: about 2 years ago - Stars: 22 - Forks: 1
CRazorback/MDM
[TMI'24] "Masked Deformation Modeling for Volumetric Brain MRI Self-supervised Pre-training".
Language: Python - Size: 3.45 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 4 - Forks: 0
naver-ai/lut
[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"
Language: Python - Size: 4.67 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 0
Talented-Q/MSMAE
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
Language: Python - Size: 20.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 2 - Forks: 0
jwmao1/MSMAE
Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification
Language: Python - Size: 20.3 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0
FengheTan9/HySparK
[MICCAI 2024] HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training
Language: Python - Size: 3.85 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 0
bwconrad/masked-distillation
Pytorch reimplementation of "A Unified View of Masked Image Modeling".
Language: Python - Size: 155 KB - Last synced at: 8 months ago - Pushed at: almost 3 years ago - Stars: 5 - Forks: 0
JunlinHan/CropMix
Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping
Language: Jupyter Notebook - Size: 1.97 MB - Last synced at: 7 months ago - Pushed at: about 3 years ago - Stars: 17 - Forks: 2
yifanzhang-pro/M-MAE
Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)
Language: Python - Size: 74.2 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 3
asbjrnmunk/amaes
Masked Autoencoder Pretraining on 3D Brain MRI
Language: Python - Size: 187 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 3 - Forks: 0
stoneMo/DeepAVFusion
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
Language: Python - Size: 26.4 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 12 - Forks: 0
madhava20217/KAMIM
Code for "Keypoint Aware Masked Image Modelling"
Language: Jupyter Notebook - Size: 8.57 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
olibridge01/MaskedImageModelling
Pre-training a VisionTransformer with Masked Image Modelling for semantic segmentation
Language: Python - Size: 17.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
chadHGY/CAM
Learning Cortical Anomaly through Masked Encoding for Unsupervised Heterogeneity Mapping.
Language: Python - Size: 96.7 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0
russellllaputa/MIRL
[NeurIPS 2023] Masked Image Residual Learning for Scaling Deeper Vision Transformers
Language: Python - Size: 1.28 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 17 - Forks: 3
LayneH/GreenMIM
[NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.
Language: Python - Size: 1.39 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 163 - Forks: 6
implus/UM-MAE
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
Language: Jupyter Notebook - Size: 5.72 MB - Last synced at: over 1 year ago - Pushed at: almost 3 years ago - Stars: 231 - Forks: 20
MohamedOmar2020/QuPath_scripts
Custom groovy scripts for QuaPath
Language: Groovy - Size: 8.79 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 0 - Forks: 0
lixiaotong97/mc-BEiT
[ECCV 2022] Official pytorch implementation of "mc-BEiT: Multi-choice Discretization for Image BERT Pre-training" in European Conference on Computer Vision (ECCV) 2022.
Language: Python - Size: 9.4 MB - Last synced at: over 1 year ago - Pushed at: about 3 years ago - Stars: 22 - Forks: 2
Atten4Vis/CAE
This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"
Language: Python - Size: 568 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 34 - Forks: 2
maple-research-lab/AdPE
code for "AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+"
Language: Python - Size: 95.7 KB - Last synced at: about 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0
faris-k/self-supervised-wafermaps
Self-Supervised Representation Learning of Semiconductor Wafer Maps using PyTorch
Language: Jupyter Notebook - Size: 444 MB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 3 - Forks: 0
AndyShih12/mac
PyTorch implementation for "Training and Inference on Any-Order Autoregressive Models the Right Way", NeurIPS 2022
Language: Python - Size: 18.6 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 7 - Forks: 2
LumenPallidium/energy_transformer
Pytorch implementation of an energy transformer - an energy-based reccurrent variant of the transformer.
Language: Python - Size: 69.3 KB - Last synced at: over 2 years ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0