FreeSOLO for unsupervised instance segmentation, CVPR 2022

Last update: Jan 02, 2023

Overview

FreeSOLO: Learning to Segment Objects without Annotations

This project hosts the code for implementing the FreeSOLO algorithm for unsupervised instance segmentation.

FreeSOLO: Learning to Segment Objects without Annotations,
Xinlong Wang, Zhiding Yu, Shalini De Mello, Jan Kautz, Anima Anandkumar, Chunhua Shen, Jose M. Alvarez
In: Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2022
arXiv preprint (arXiv 2202.12181)

Visual Results

Installation

Prerequisites

Linux or macOS with Python >= 3.6
PyTorch >= 1.5 and torchvision that matches the PyTorch installation.
scikit-image

Install PyTorch in Conda env

# create conda env
conda create -n detectron2 python=3.6
# activate the enviorment
conda activate detectron2
# install PyTorch >=1.5 with GPU
conda install pytorch torchvision -c pytorch

Build Detectron2 from Source

Follow the INSTALL.md to install Detectron2 (commit id 11528ce has been tested).

Datasets

Follow the datasets/README.md to set up the MS COCO dataset.

Pre-trained model

Download the DenseCL pre-trained model from here. Convert it to detectron2's format and put the converted model under "training_dir/pre-trained/DenseCL" directory.

python tools/convert-pretrain-to-detectron2.py {WEIGHT_FILE}.pth {WEIGHT_FILE}.pkl

Usage

Free Mask

Download the prepared free masks in json format from here. Put it under "datasets/coco/annotations" directory. Or, generate it by yourself:

bash inference_freemask.sh

Training

# train with free masks
bash train.sh

# generate pseudo labels
bash gen_pseudo_labels.sh

# self-train
bash train_pl.sh

Testing

Download the trained model from here.

bash test.sh {MODEL_PATH}

Citations

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follow.

@article{wang2022freesolo,
  title={{FreeSOLO}: Learning to Segment Objects without Annotations},
  author={Wang, Xinlong and Yu, Zhiding and De Mello, Shalini and Kautz, Jan and Anandkumar, Anima and Shen, Chunhua and Alvarez, Jose M},
  journal={arXiv preprint arXiv:2202.12181},
  year={2022}
}

FreeSOLO for unsupervised instance segmentation, CVPR 2022

Related tags

Overview

FreeSOLO: Learning to Segment Objects without Annotations

Visual Results

Installation

Prerequisites

Install PyTorch in Conda env

Build Detectron2 from Source

Datasets

Pre-trained model

Usage

Free Mask

Training

Testing

Citations

Owner

NVIDIA Research Projects

Team Enigma at ArgMining 2021 Shared Task: Leveraging Pretrained Language Models for Key Point Matching

Explainable Zero-Shot Topic Extraction

This repository contains code used to audit the stability of personality predictions made by two algorithmic hiring systems

A Pose Estimator for Dense Reconstruction with the Structured Light Illumination Sensor

A Differentiable Recipe for Learning Visual Non-Prehensile Planar Manipulation

MAUS: A Dataset for Mental Workload Assessment Using Wearable Sensor - Baseline system

Using pytorch to implement unet network for liver image segmentation.

maximal update parametrization (µP)

Unofficial implementation of "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" (https://arxiv.org/abs/2103.14030)

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning Source Code

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Pytorch Implementation of Interaction Networks for Learning about Objects, Relations and Physics

Memory efficient transducer loss computation

Display, filter and search log messages in your terminal

AI Flow is an open source framework that bridges big data and artificial intelligence.

Code I use to automatically update my videos' metadata on YouTube

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

This repository is for our paper Exploiting Scene Graphs for Human-Object Interaction Detection accepted by ICCV 2021.

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

Code for our ACL 2021 paper "One2Set: Generating Diverse Keyphrases as a Set"