UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

Last update: Jan 02, 2023

Related tags

Overview

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

This is the official PyTorch implementation for UniMoCo paper:

@article{dai2021unimoco,
  author  = {Zhigang Dai and Bolun Cai and Yugeng Lin and Junying Chen},
  title   = {UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning},
  journal = {arXiv preprint arXiv:2103.10773},
  year    = {2021},
}

In UniMoCo, we generalize MoCo to a unified contrastive learning framework, which supports unsupervised, semi-supervised and full-supervised visual representation learning. Based on MoCo, we maintain a label queue to store supervised labels. With the label queue, we can construct the multi-hot target on-the-fly, which represents postives and negatives of the given query. Besides, we propose a unified contrastive loss to deal with arbitrary number of positives and negatives. There is a comparison between MoCo and UniMoCo.

ImageNet Pre-training

Data Preparation

Install PyTorch and ImageNet dataset following the official PyTorch ImageNet training code.

Pre-training

To perform supervised contrastive learning of ResNet-50 model on ImageNet with 8 gpus for 800 epochs, run:

python main_unimoco.py \
  -a resnet50 \
  --lr 0.03 \
  --batch-size 256 \
  --epochs 800 \
  --dist-url 'tcp://localhost:10001' \
  --multiprocessing-distributed --world-size 1 --rank 0 \
  --mlp \
  --moco-t 0.2 \
  --aug-plus \
  --cos \
  [your imagenet-folder with train and val folders]

By default, the script performs full-supervised contrasitve learning.

Set --supervised-list to perform semi-supervised contrastive learning with different label ratios. For exmaple, 60% labels: --supervised-list ./label_info/60percent.txt.

This script uses all the default hyper-parameters as described in the MoCo v2.

Results

ImageNet Linear classification and COCO detection 1x schedule (R50-C4) results:

model	ratios	top-1 acc.	top-5 acc.	COCO AP
UniMoCo	0%	71.1	90.1	39.0
UniMoCo	10%	72.0	90.3	39.3
UniMoCo	30%	75.1	92.5	39.6
UniMoCo	60%	76.2	93.0	39.8
UniMoCo	100%	76.4	93.1	39.6

Check more details about linear classification and detection fine-tuning on MoCo.

Models are coming soon.

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

Related tags

Overview

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

ImageNet Pre-training

Data Preparation

Pre-training

Results

License

Owner

dddzg

Serve TensorFlow ML models with TF-Serving and then create a Streamlit UI to use them

Compact Bidirectional Transformer for Image Captioning

Cortex-compatible model server for Python and TensorFlow

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Use unsupervised and supervised learning to predict stocks

PiRapGenerator - Make anyone rap the digits of pi

DrWhy is the collection of tools for eXplainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

A colab notebook for training Stylegan2-ada on colab, transfer learning onto your own dataset.

VGG16 model-based classification project about brain tumor detection.

Summary of related papers on visual attention

Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

Implementation of Wasserstein adversarial attacks.

A Multi-modal Model Chinese Spell Checker Released on ACL2021.

This repository contains python code necessary to replicated the experiments performed in our paper "Invariant Ancestry Search"

A GUI for Face Recognition, based upon Docker, Tkinter, GPU and a camera device.

Single-step adversarial training (AT) has received wide attention as it proved to be both efficient and robust.

The code is the training example of AAAI2022 Security AI Challenger Program Phase 8: Data Centric Robot Learning on ML models.

Predictive AI layer for existing databases.