An open API service providing repository metadata for many open source software ecosystems.

Topic: "masked-autoencoder"

OpenGVLab/InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language: Python - Size: 53.2 MB - Last synced at: 1 day ago - Pushed at: 14 days ago - Stars: 1,899 - Forks: 112

MCG-NJU/VideoMAE

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Language: Python - Size: 547 KB - Last synced at: 3 months ago - Pushed at: over 1 year ago - Stars: 1,451 - Forks: 142

keyu-tian/SparK

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Language: Python - Size: 699 KB - Last synced at: 16 days ago - Pushed at: over 1 year ago - Stars: 1,343 - Forks: 82

EdisonLeeeee/Awesome-Masked-Autoencoders

A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

Size: 488 KB - Last synced at: about 1 month ago - Pushed at: 11 months ago - Stars: 823 - Forks: 52

Lupin1998/Awesome-MIM

[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

Language: Python - Size: 6.7 MB - Last synced at: 13 days ago - Pushed at: about 2 months ago - Stars: 331 - Forks: 16

implus/UM-MAE

Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"

Language: Jupyter Notebook - Size: 5.72 MB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 231 - Forks: 20

uncbiag/SimpleClick

SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)

Language: Python - Size: 40.2 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 225 - Forks: 38

xyzforever/BEVT

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

Language: Python - Size: 19.2 MB - Last synced at: 6 months ago - Pushed at: almost 3 years ago - Stars: 158 - Forks: 19

implus/mae_segmentation

reproduction of semantic segmentation using masked autoencoder (mae)

Language: Python - Size: 198 KB - Last synced at: over 1 year ago - Pushed at: over 3 years ago - Stars: 134 - Forks: 13

TonyLianLong/CrossMAE

Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders

Language: Python - Size: 1.5 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 107 - Forks: 6

zubair-irshad/NeRF-MAE

[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields

Language: Python - Size: 4.47 MB - Last synced at: 19 days ago - Pushed at: 3 months ago - Stars: 101 - Forks: 4

Haochen-Wang409/HPM

[CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling

Language: Python - Size: 1.67 MB - Last synced at: about 2 months ago - Pushed at: about 2 months ago - Stars: 92 - Forks: 7

ruiwang2021/mvd

[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

Language: Python - Size: 477 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 90 - Forks: 9

nttcslab/msm-mae

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations

Language: Jupyter Notebook - Size: 10 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 84 - Forks: 8

habla-liaa/encodecmae

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

Language: Python - Size: 97.7 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 78 - Forks: 4

rishikksh20/AudioMAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders that Listen

Language: Python - Size: 226 KB - Last synced at: about 1 year ago - Pushed at: almost 3 years ago - Stars: 60 - Forks: 6

recursionpharma/maes_microscopy

Official repo for Recursion's accepted spotlight paper at NeurIPS 2023 Generative AI & Biology workshop.

Language: Jupyter Notebook - Size: 4.86 MB - Last synced at: 2 days ago - Pushed at: 2 days ago - Stars: 59 - Forks: 13

nttcslab/m2d

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

Language: Jupyter Notebook - Size: 17.1 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 57 - Forks: 1

HKUDS/MAERec

[SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"

Language: Python - Size: 80.3 MB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 50 - Forks: 5

lucidrains/LVMAE-pytorch

Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch

Language: Python - Size: 740 KB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 45 - Forks: 1

MCG-NJU/VideoMAE-Action-Detection

[NeurIPS 2022 Spotlight] VideoMAE for Action Detection

Language: Python - Size: 580 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 38 - Forks: 1

naver-ai/augsub

[CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"

Language: Python - Size: 251 KB - Last synced at: 2 months ago - Pushed at: 2 months ago - Stars: 34 - Forks: 1

Event-AHU/VFM-Det

VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models

Language: Python - Size: 7.14 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 31 - Forks: 3

Westlake-AI/A2MIM

[ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN

Language: Python - Size: 174 KB - Last synced at: 25 days ago - Pushed at: 10 months ago - Stars: 27 - Forks: 4

shlokk/mae-contrastive

Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".

Language: Python - Size: 598 KB - Last synced at: about 1 year ago - Pushed at: about 2 years ago - Stars: 21 - Forks: 1

sunilhoho/EVEREST

Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].

Language: Python - Size: 2.77 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 19 - Forks: 1

liruiw/Dec-SSL

Understanding Self-Supervised Learning in a Decentralized Setting

Language: Python - Size: 662 KB - Last synced at: about 1 year ago - Pushed at: over 2 years ago - Stars: 19 - Forks: 1

jakhac/CSMAE

Cross-Sensor Masked Autoencoder for Content Based Image Retrieval in Remote Sensing

Language: Python - Size: 1.07 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 16 - Forks: 4

lyhkevin/MT-Net

Multi-scale Transformer Network for Cross-Modality MR Image Synthesis (IEEE TMI)

Language: Python - Size: 13.1 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 16 - Forks: 1

bayartsogt-ya/albert-mongolian

ALBERT trained on Mongolian text corpus

Language: Jupyter Notebook - Size: 263 KB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 15 - Forks: 2

stoneMo/DeepAVFusion

Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".

Language: Python - Size: 26.4 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 12 - Forks: 0

naver-ai/lut

[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"

Language: Python - Size: 4.67 MB - Last synced at: 6 months ago - Pushed at: 6 months ago - Stars: 11 - Forks: 0

yifanzhang-pro/M-MAE

Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)

Language: Python - Size: 74.2 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 11 - Forks: 3

JJLi0427/CNN_Masked_Autoencoder

Design a patches masked autoencoder by CNN

Language: Python - Size: 1.07 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 11 - Forks: 0

Ryan21wy/HSIMAE

HSIMAE: A Unified Masked Autoencoder with large-scale pretraining for Hyperspectral Image Classification

Language: Python - Size: 75.2 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 10 - Forks: 0

mkang315/PK-YOLO

[WACV'25] Official implementation of "PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplane MRI Slices".

Language: Python - Size: 792 KB - Last synced at: 7 days ago - Pushed at: 7 days ago - Stars: 8 - Forks: 3

MSA-LMC/MAE-SFER

MAE pre-training models (ViT and ConvNeXt) using AffectNet images for static facial expression recognition (SFER).

Language: Python - Size: 168 KB - Last synced at: 4 days ago - Pushed at: 5 days ago - Stars: 7 - Forks: 1

waldo-vision/models

Repository for model development and training

Language: Python - Size: 665 KB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 7 - Forks: 4

jonahanton/SSL_audio

Codebase for Imperial MSc AI Individual Project - Self-Supervised Learning for Audio Inference

Language: Python - Size: 59.3 MB - Last synced at: about 2 years ago - Pushed at: about 2 years ago - Stars: 6 - Forks: 0

danelpeng/RDMAE_Nav

A robust embodied navigation agent to various visual corruptions.

Language: Python - Size: 11.1 MB - Last synced at: about 2 months ago - Pushed at: 3 months ago - Stars: 5 - Forks: 0

EdisonLeeeee/lrGAE

A comprehensive (masked) graph autoencoders benchmark.

Language: Python - Size: 1.53 MB - Last synced at: about 1 month ago - Pushed at: 6 months ago - Stars: 5 - Forks: 0

samsad35/VQ-MAE-S-code

A Vector Quantized Masked AutoEncoder for speech emotion recognition

Language: Python - Size: 4.71 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 5 - Forks: 0

chris-santiago/met

Reproducing the MET framework with PyTorch

Language: Python - Size: 10.9 MB - Last synced at: 6 days ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 1

mvrl/BirdSAT

A PyTorch implementation of "BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping"

Language: Python - Size: 6.34 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 4 - Forks: 2

YunghuiHsu/ebird_project

Extraction of deep features/representation of birds by deep learning algorithms.

Language: Jupyter Notebook - Size: 8.42 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 4 - Forks: 1

asbjrnmunk/amaes

Masked Autoencoder Pretraining on 3D Brain MRI

Language: Python - Size: 187 KB - Last synced at: 9 months ago - Pushed at: 9 months ago - Stars: 3 - Forks: 0

tomouellette/autoencodersplz

Generative modeling and representation learning through reconstruction

Language: Python - Size: 27.8 MB - Last synced at: 28 days ago - Pushed at: over 1 year ago - Stars: 3 - Forks: 0

maple-research-lab/AdPE

code for "AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+"

Language: Python - Size: 95.7 KB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 3 - Forks: 0

vahidzee/torchde

PyTorch wrapper for Deep Density Estimation Models

Language: Python - Size: 181 KB - Last synced at: about 2 months ago - Pushed at: almost 3 years ago - Stars: 3 - Forks: 1

Talented-Q/MSMAE

Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification

Language: Python - Size: 20.3 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 2 - Forks: 0

dan-crdll/nn_project_dreamdiffusion

Re-implementation of the method proposed in ''DreamDiffusion: Generating High-Quality Images from Brain EEG Signals'' by Y. Bai, X. Wang et al. for Neural Network Course exam Topics

Language: Jupyter Notebook - Size: 5.56 MB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 2 - Forks: 0

sinatayebati/R-MAE

R-MAE: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing

Language: Python - Size: 6.53 MB - Last synced at: 12 months ago - Pushed at: 12 months ago - Stars: 2 - Forks: 0

e-hulten/made

PyTorch implementation of MADE

Language: Python - Size: 1.16 MB - Last synced at: over 2 years ago - Pushed at: over 4 years ago - Stars: 2 - Forks: 0

samsad35/VQ-MAE-AudioVisual-code

A vector quantized masked autoencoder for audiovisual speech emotion recognition

Language: Python - Size: 21.7 MB - Last synced at: about 1 month ago - Pushed at: about 1 month ago - Stars: 1 - Forks: 0

samsad35/code-ancogen

AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder

Language: Python - Size: 2.49 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 1 - Forks: 0

s35lay/PMCF

"Adaptive Filter Attention for Multi-level Structure-Aware Graph Self-supervised Learning"

Language: Python - Size: 10.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 1 - Forks: 0

eminorhan/optimized-mae

An optimized implementation of masked autoencoders (MAEs)

Language: Python - Size: 69.3 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 1 - Forks: 1

thinh-re/mae

Train MAE on Kaggle 2 GPUs (T4x2), Log to Wandb

Language: Python - Size: 899 KB - Last synced at: over 1 year ago - Pushed at: over 2 years ago - Stars: 1 - Forks: 0

sathishkumar67/Masked-Autoencoders-Are-Scalable-Vision-Learners

Implementation of Masked AutoEncoder

Language: Python - Size: 9.51 MB - Last synced at: 3 months ago - Pushed at: 3 months ago - Stars: 0 - Forks: 0

max-kuk/bimae_seed_classification

BiMAE - A Bimodal Masked Autoencoder Architecture for Single-Label Hyperspectral Image Classification, CVPRW 2024

Language: Python - Size: 298 KB - Last synced at: about 2 months ago - Pushed at: 5 months ago - Stars: 0 - Forks: 0

jwmao1/MSMAE

Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification

Language: Python - Size: 20.3 MB - Last synced at: 7 months ago - Pushed at: 7 months ago - Stars: 0 - Forks: 0

teddyld/mae-torch

A PyTorch implementation of Masked Autoencoders are Scalable Vision Learners in Jupyter Notebook

Language: Jupyter Notebook - Size: 261 KB - Last synced at: 8 months ago - Pushed at: 8 months ago - Stars: 0 - Forks: 0

turhancan97/Learning-by-Reconstruction-with-MAE

Enhancing Representation Learning in Masked Autoencoders by Focusing on Low-Variance Components

Language: Python - Size: 48.9 MB - Last synced at: 9 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ZY-LIi/Diffusion_Enhanced_Masked_Autoencoder

Pre-training a Masked Autoencoder with the idea of Diffusion Models for Hyperspectral Image Classification.

Language: Python - Size: 14.2 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

ImRTon/TiMAE-Lightning

Unofficial implementation of Ti-MAE in PyTorch Lightning.

Language: Python - Size: 73.2 KB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

lambda-xyz-01/AGMAE

AG-MAE: Anatomically Guided Spatio-Temporal Masked Auto-Encoder for Online Hand Gesture Recognition

Language: Python - Size: 1.37 MB - Last synced at: 10 months ago - Pushed at: 10 months ago - Stars: 0 - Forks: 0

eminorhan/optimized-stmae

An optimized implementation of spatiotemporal masked autoencoders

Language: Python - Size: 152 KB - Last synced at: 11 months ago - Pushed at: 11 months ago - Stars: 0 - Forks: 0

RichardScottOZ/grid-mae

Investigate possibilities for Vision Transformers with multiscale grids

Language: Python - Size: 114 KB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

RichardScottOZ/torchgeo-cimae Fork of Modexus/torchgeo

TorchGeo: datasets, transforms, and models for geospatial data

Language: Python - Size: 72 MB - Last synced at: about 1 year ago - Pushed at: about 1 year ago - Stars: 0 - Forks: 0

Video-MAC/VideoMAC

Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”

Language: Python - Size: 5 MB - Last synced at: over 1 year ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 1

LorenzoBonanni/CVProject

Project for Computer Vision course @ MSc in Artificial Intelligence, UniVR

Language: TeX - Size: 5.45 MB - Last synced at: 7 months ago - Pushed at: over 1 year ago - Stars: 0 - Forks: 0

sebastienmeyer2/masked-change-detection

Change detection on satellite images with masked autoencoders.

Language: Python - Size: 5.57 MB - Last synced at: over 1 year ago - Pushed at: about 2 years ago - Stars: 0 - Forks: 0

Related Topics
self-supervised-learning 34 pytorch 18 deep-learning 15 masked-image-modeling 13 mae 12 vision-transformer 12 representation-learning 8 video-understanding 6 computer-vision 6 video-representation-learning 5 transformer 4 autoencoder 4 action-recognition 4 machine-learning 4 transformers 4 artificial-intelligence 3 medical-image-analysis 3 imagenet 3 graph-neural-networks 3 cnn 3 audio 3 hyperspectral-image-classification 3 pretrained-models 3 pretraining 3 foundation-models 3 bert 3 contrastive-learning 3 object-detection 3 unsupervised-learning 2 video-transformer 2 remote-sensing 2 optimized 2 sparse-convolution 2 speech 2 transformer-architecture 2 coco 2 ade20k 2 speech-synthesis 2 imagenet-classification 2 semantic-segmentation 2 vit 2 cvpr2023 2 audio-processing 2 convolutional-neural-networks 2 benchmark 2 attention-mechanism 2 pretrain 2 pytorch-lightning 2 segmentation 2 vision-transformers 2 density-estimation 2 neurips-2022 2 reconstruction 2 emotion-recognition 2 diffusion-models 2 gpt 1 generative-models 1 robustness 1 masked-modeling 1 pre-training 1 visual-navigation 1 video-analysis 1 awesome-mim 1 awesome-list 1 embodied-ai 1 seed-classification 1 3d 1 3d-deep-learning 1 tts 1 mlp-mixer 1 neural-networks 1 representations 1 resnet 1 variational-autoencoder 1 vq-vae 1 cifar10 1 jupyter-notebook 1 encoder-decoder 1 attribute-learning 1 big-model 1 pre-trained-big-model 1 vehicle-detection 1 vehiclemae 1 adversarial-learning 1 hydra 1 taskfile 1 generative-model 1 pitch-estimation 1 pitch-shift 1 speech-analysis 1 voice-conversion 1 eccv2024 1 spatio-temporal-action-localization 1 temporal-action-localization 1 video-clip 1 video-data 1 video-dataset 1 video-question-answering 1 video-retrieval 1 zero-shot-classification 1