GitHub topics: masked-image-modeling

Repositories

mkang315/PK-YOLO

[WACV'25] Official implementation of "PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplane MRI Slices".

Language: Python - Size: 792 KB - Last synced at: 7 days ago - Pushed at: 8 days ago - Stars: 8 - Forks: 3

microsoft/SimMIM

This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".

Language: Python - Size: 275 KB - Last synced at: 3 days ago - Pushed at: over 2 years ago - Stars: 976 - Forks: 96

aicip/Cross-Scale-MAE

[NIPS'23] Official Code of the paper "Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing"

Language: Python - Size: 645 KB - Last synced at: 4 days ago - Pushed at: over 1 year ago - Stars: 46 - Forks: 2

open-mmlab/mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Language: Python - Size: 13.5 MB - Last synced at: 11 days ago - Pushed at: 7 months ago - Stars: 3,671 - Forks: 1,091

Lupin1998/Awesome-MIM

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

Language: Python - Size: 6.7 MB - Last synced at: 14 days ago - Pushed at: about 2 months ago - Stars: 331 - Forks: 16

hustvl/MIMDet

[ICCV 2023] You Only Look at One Partial Sequence

Language: Python - Size: 551 KB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 341 - Forks: 30

Westlake-AI/openmixup

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

Language: Python - Size: 3.68 MB - Last synced at: 11 days ago - Pushed at: about 2 months ago - Stars: 650 - Forks: 59

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Language: Python - Size: 699 KB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 1,343 - Forks: 82

open-mmlab/mmselfsup

OpenMMLab Self-Supervised Learning Toolbox and Benchmark

Language: Python - Size: 3.39 MB - Last synced at: 18 days ago - Pushed at: almost 2 years ago - Stars: 3,258 - Forks: 438

FengheTan9/MambaMIM

[MedIA'25] MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentation

Language: Python - Size: 2.47 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 26 - Forks: 1

salesforce/MUST 📦

PyTorch code for MUST

Language: Python - Size: 1.33 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 106 - Forks: 12

liuxingbin/dbot

[ICLR2024] Exploring Target Representations for Masked Autoencoders

Language: Python - Size: 2.13 MB - Last synced at: 14 days ago - Pushed at: over 1 year ago - Stars: 55 - Forks: 8

Westlake-AI/A2MIM

[ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN

Language: Python - Size: 174 KB - Last synced at: 26 days ago - Pushed at: 10 months ago - Stars: 27 - Forks: 4

Alpha-VL/ConvMAE

ConvMAE: Masked Convolution Meets Masked Autoencoders

Language: Python - Size: 8.53 MB - Last synced at: about 1 month ago - Pushed at: about 2 years ago - Stars: 504 - Forks: 40

Haochen-Wang409/HPM

[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling

Language: Python - Size: 1.67 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 92 - Forks: 7

Sense-X/MixMIM

MixMIM: Mixed and Masked Image Modeling for Efficient Visual Representation Learning

Language: Python - Size: 649 KB - Last synced at: about 1 month ago - Pushed at: almost 2 years ago - Stars: 142 - Forks: 6

lxtGH/CAE

This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"

Language: Python - Size: 366 KB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 196 - Forks: 21

sbartlett97/torch-electra

A Custom implementation of the ELECTRA training method using PyTorch and HuggingFace Transformers

Language: Python - Size: 45.9 KB - Last synced at: 2 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

PGSmall/clip-pgs

Official code for CVPR2025 "Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection"

Language: Python - Size: 8.97 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 1

ariG23498/mae-scalable-vision-learners

A TensorFlow 2.x implementation of Masked Autoencoders Are Scalable Vision Learners

Language: Jupyter Notebook - Size: 38.7 MB - Last synced at: about 1 month ago - Pushed at: about 3 years ago - Stars: 78 - Forks: 15

haofanwang/awesome-vision-language-modeling

Recent Advances in Vision-Language Pre-training!

Size: 18.6 KB - Last synced at: 26 days ago - Pushed at: over 3 years ago - Stars: 29 - Forks: 2

AndreaCossu/continual-pretraining-nlp-vision

Code to reproduce experiments from the paper "Continual Pre-Training Mitigates Forgetting in Language and Vision" https://arxiv.org/abs/2205.09357

Language: Jupyter Notebook - Size: 872 KB - Last synced at: about 2 months ago - Pushed at: over 1 year ago - Stars: 22 - Forks: 1

bwconrad/can

PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".

Language: Python - Size: 352 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 38 - Forks: 7

CRazorback/MDM

[TMI'24] "Masked Deformation Modeling for Volumetric Brain MRI Self-supervised Pre-training".

Language: Python - Size: 3.45 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 4 - Forks: 0

naver-ai/lut

[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"

Language: Python - Size: 4.67 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

Talented-Q/MSMAE

Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification

Language: Python - Size: 20.3 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

jwmao1/MSMAE

Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification

Language: Python - Size: 20.3 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

FengheTan9/HySparK

[MICCAI 2024] HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training

Language: Python - Size: 3.85 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 8 - Forks: 0

bwconrad/masked-distillation

Pytorch reimplementation of "A Unified View of Masked Image Modeling".

Language: Python - Size: 155 KB - Last synced at: about 2 months ago - Pushed at: over 2 years ago - Stars: 5 - Forks: 0

JunlinHan/CropMix

Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping

Language: Jupyter Notebook - Size: 1.97 MB - Last synced at: about 1 month ago - Pushed at: over 2 years ago - Stars: 17 - Forks: 2

yifanzhang-pro/M-MAE

Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)

Language: Python - Size: 74.2 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 11 - Forks: 3

asbjrnmunk/amaes

Masked Autoencoder Pretraining on 3D Brain MRI

Language: Python - Size: 187 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

stoneMo/DeepAVFusion

Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".

Language: Python - Size: 26.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 12 - Forks: 0

madhava20217/KAMIM

Code for "Keypoint Aware Masked Image Modelling"

Language: Jupyter Notebook - Size: 8.57 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

olibridge01/MaskedImageModelling

Pre-training a VisionTransformer with Masked Image Modelling for semantic segmentation

Language: Python - Size: 17.1 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

chadHGY/CAM

Learning Cortical Anomaly through Masked Encoding for Unsupervised Heterogeneity Mapping.

Language: Python - Size: 96.7 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 0 - Forks: 0

russellllaputa/MIRL

[NeurIPS 2023] Masked Image Residual Learning for Scaling Deeper Vision Transformers

Language: Python - Size: 1.28 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 17 - Forks: 3

LayneH/GreenMIM

[NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.

Language: Python - Size: 1.39 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 163 - Forks: 6

implus/UM-MAE

Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"

Language: Jupyter Notebook - Size: 5.72 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 231 - Forks: 20

MohamedOmar2020/QuPath_scripts

Custom groovy scripts for QuaPath

Language: Groovy - Size: 8.79 KB - Last synced at: 4 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

dvlab-research/MOOD

Official PyTorch implementation of MOOD series: (1) MOODv1: Rethinking Out-of-distributionDetection: Masked Image Modeling Is All You Need. (2) MOODv2: Masked Image Modeling for Out-of-Distribution Detection.

Language: Python - Size: 3.08 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 120 - Forks: 4

lixiaotong97/mc-BEiT

[ECCV 2022] Official pytorch implementation of "mc-BEiT: Multi-choice Discretization for Image BERT Pre-training" in European Conference on Computer Vision (ECCV) 2022.

Language: Python - Size: 9.4 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 22 - Forks: 2

Atten4Vis/CAE

This is a PyTorch implementation of “Context AutoEncoder for Self-Supervised Representation Learning"

Language: Python - Size: 568 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 34 - Forks: 2

maple-research-lab/AdPE

code for "AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+"

Language: Python - Size: 95.7 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

faris-k/self-supervised-wafermaps

Self-Supervised Representation Learning of Semiconductor Wafer Maps using PyTorch

Language: Jupyter Notebook - Size: 444 MB - Last synced at: almost 2 years ago - Pushed at: almost 2 years ago - Stars: 3 - Forks: 0

AndyShih12/mac

PyTorch implementation for "Training and Inference on Any-Order Autoregressive Models the Right Way", NeurIPS 2022

Language: Python - Size: 18.6 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 7 - Forks: 2

LumenPallidium/energy_transformer

Pytorch implementation of an energy transformer - an energy-based reccurrent variant of the transformer.

Language: Python - Size: 69.3 KB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 1 - Forks: 0

Related Keywords

masked-image-modeling 47 self-supervised-learning 29 masked-autoencoder 13 pytorch 12 vision-transformer 10 mae 9 deep-learning 7 computer-vision 6 representation-learning 6 pretraining 5 object-detection 5 image-classification 5 unsupervised-learning 5 machine-learning 4 imagenet 4 transformer 4 contrastive-learning 4 pretrained-models 4 swin-transformer 3 pre-training 3 clip 3 medical-image-analysis 3 data-augmentation 2 masked-language-models 2 instance-segmentation 2 medical-imaging 2 ssl 2 generative-models 2 bert 2 awesome-mim 2 awesome-list 2 semantic-segmentation 2 convolutional-neural-networks 2 moco 2 pytorch-lightning 2 ade20k 2 coco 2 beit 2 cnn 2 cvpr2023 2 context-autoencoder 2 medical-image-segmentation 2 sparse-convolution 2 audio-visual-correspondence 1 audio-visual-learning 1 attention-mechanism 1 segmentation 1 brain-mri 1 multimodal-learning 1 non-contrastive-learning 1 foundation-models 1 hybrid-model 1 eccv2024 1 neuroimaging 1 diffeomorphism 1 transformers 1 sentiment-analysis 1 senteval 1 self-supervised 1 qnli 1 lifelong-learning 1 forgetting 1 continual-learning 1 vision-language 1 convnet 1 groovy-script 1 qupath 1 qupath-script 1 ood-detection 1 outlier-detection 1 eccv2022 1 embeddings 1 image-retrieval 1 semiconductor 1 any-order-autoregressive-models 1 autoregressive-models 1 masked-language-modeling 1 tractable-inference 1 tractable-models 1 energy-transformer 1 hopfield-network 1 sound-source-localization 1 sound-source-separation 1 transformer-architecture 1 visual-pretraining 1 3d-mesh 1 anomaly-detection 1 neuroscience 1 surface 1 deeper-network 1 masked-image-residual-learning 1 efficient-deep-learning 1 hierarchical-vision-transformer 1 imagenet-classification 1 pyramid-vision-transformer 1 celltype-annotation 1 digital-pathology 1 semi-supervised-learning 1 mixup 1 image-classifcation 1