awesome-MIM

Reading list for research topics in Masked Image Modeling(MIM).

We list the most popular methods for MIM, if I missed something, please submit a request. (Note: We show the date of the first version of Arxiv here. But the link of paper may be not the early version.)

Self-supervied Vision Transformers as backbone models.

Date	Method	Conference	Title	Code
2021-06-14	BeiT	ICLR 2022(Oral)	BEiT: BERT Pre-Training of Image Transformers	BeiT
2021-11-11	MAE	Arxiv 2021	Masked Autoencoders Are Scalable Vision Learners	MAE
2021-11-15	iBoT	Arxiv 2021	iBOT: Image BERT Pre-Training with Online Tokenizer	iBoT
2021-11-18	SimMIM	Arxiv 2021	SimMIM: A Simple Framework for Masked Image Modeling	SimMIM
2021-12-16	MaskFeat	Arxiv 2021	Masked Feature Prediction for Self-Supervised Visual Pre-Training	None
2021-12-20	SplitMask	Arxiv 2021	Are Large-scale Datasets Necessary for Self-Supervised Pre-training?	None
2022-01-31	ADIOS	Arxiv 2022	Adversarial Masking for Self-Supervised Learning	None
2022-02-07	CAE	Arxiv 2022	Context Autoencoder for Self-Supervised Representation Learning	None
2022-02-07	CIM	Arxiv 2022	Corrupted Image Modeling for Self-Supervised Visual Pre-Training	None

Reading list for research topics in Masked Image Modeling

Related tags

Overview

awesome-MIM

Self-supervied Vision Transformers as backbone models.

Owner

ligang

Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge

Spatial-Location-Constraint-Prototype-Loss-for-Open-Set-Recognition

The final project of "Applying AI to 3D Medical Imaging Data" from "AI for Healthcare" nanodegree - Udacity.

Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]

Reproducing code of hair style replacement method from Barbershorp.

MMRazor: a model compression toolkit for model slimming and AutoML

Code and models for "Rethinking Deep Image Prior for Denoising" (ICCV 2021)

NER for Indian languages

A clear, concise, simple yet powerful and efficient API for deep learning.

Improving Machine Translation Systems via Isotopic Replacement

Patch-Diffusion Code (AAAI2022)

SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks

Federated_learning codes used for the the paper "Evaluation of Federated Learning Aggregation Algorithms" and "A Federated Learning Aggregation Algorithm for Pervasive Computing: Evaluation and Comparison"

Rocket-recycling with Reinforcement Learning

Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection

Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

A multi-mode modulator for multi-domain few-shot classification (ICCV)

Weakly Supervised Learning of Rigid 3D Scene Flow

SigOpt wrappers for scikit-learn methods

Mosaic of Object-centric Images as Scene-centric Images (MosaicOS) for long-tailed object detection and instance segmentation.