SOTR: Segmenting Objects with Transformers [ICCV 2021]

Last update: Dec 20, 2022

Related tags

Deep Learning SOTR

Overview

SOTR: Segmenting Objects with Transformers [ICCV 2021]

By Ruohao Guo, Dantong Niu, Liao Qu, Zhenbo Li

Introduction

This is the official implementation of SOTR.

Models

COCO Instance Segmentation Baselines with SOTR

Name	mask AP	AP_S	AP_M	AP_L	download
SOTR_R101	40.2	10.2	59.0	73.1	model
SOTR_R101_DCN	42.0	11.4	60.7	74.5	model

Installation & Quick start

First install Detectron2 following the official guide: INSTALL.md.
Then build SOTR with:

https://github.com/easton-cau/SOTR
cd SOTR
python setup.py build develop

Then follow datasets/README.md to set up the datasets (e.g., MS-COCO).

Evaluating

Download the trained models for COCO.

Run the following command

python tools/train_net.py \
    --config-file configs/SOTR/R101.yaml \
    --eval-only \
    --num-gpus 4 \
    MODEL.WEIGHTS work_dir/SOTR_R101/SOTR_R101.pth

Training

Run the following command

python tools/train_net.py \
    --config-file configs/SOTR/R101.yaml \
    --num-gpus 4 \

Acknowledgement

Thanks Detectron2 and AdelaiDet contribution to the community!

The work is supported by National Key R&D Program of China (2020YFD0900204) and Key-Area Research and Development Program of Guangdong Province China (2020B0202010009).

FAQ

If you want to improve the usability or any piece of advice, please feel free to contant directly ([email protected]).

Citation

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follow.

@misc{guo2021sotr,
      title={SOTR: Segmenting Objects with Transformers}, 
      author={Ruohao Guo and Dantong Niu and Liao Qu and Zhenbo Li},
      year={2021},
      eprint={2108.06747},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

SOTR: Segmenting Objects with Transformers [ICCV 2021]

Related tags

Overview

SOTR: Segmenting Objects with Transformers [ICCV 2021]

Introduction

Models

COCO Instance Segmentation Baselines with SOTR

Installation & Quick start

Acknowledgement

FAQ

Citation

Owner

Code to reproduce results from the paper "AmbientGAN: Generative models from lossy measurements"

Official PyTorch implementation of PS-KD

Implementation of SSMF: Shifting Seasonal Matrix Factorization

A Tensorflow implementation of the Text Conditioned Auxiliary Classifier Generative Adversarial Network for Generating Images from text descriptions

A-SDF: Learning Disentangled Signed Distance Functions for Articulated Shape Representation (ICCV 2021)

Pytorch implementation of NEGEV method. Paper: "Negative Evidence Matters in Interpretable Histology Image Classification".

Implementation of paper "Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal"

Code in PyTorch for the convex combination linear IAF and the Householder Flow, J.M. Tomczak & M. Welling

[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.

Implementation of neural class expression synthesizers

Deep Illuminator is a data augmentation tool designed for image relighting. It can be used to easily and efficiently generate a wide range of illumination variants of a single image.

UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities

Repository for the paper "PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation", CVPR 2021.

なりすまし検出(anti-spoof-mn3)のWebカメラ向けデモ

paper: Hyperspectral Remote Sensing Image Classification Using Deep Convolutional Capsule Network

Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).

DziriBERT: a Pre-trained Language Model for the Algerian Dialect

Model-based 3D Hand Reconstruction via Self-Supervised Learning, CVPR2021

A program that can analyze videos according to the weights you select

Repository aimed at compiling code, papers, demos etc.. related to my PhD on 3D vision and machine learning for fruit detection and shape estimation at the university of Lincoln