DetCo: Unsupervised Contrastive Learning for Object Detection

Last update: Dec 18, 2022

Related tags

Deep Learning DetCo

Overview

DetCo: Unsupervised Contrastive Learning for Object Detection

arxiv link

News

Sparse RCNN+DetCo improves from 45.0 AP to 46.5 AP(+1.5) with 3x+ms train. See details in SparseRCNN.
Pretrained weights has been released.

Highlights

State-of-the-art transfer performance on dense prediction tasks.
Improving 1.6/1.2/1.0 AP than supervised ImageNet pretrain on Mask RCNN-C4/FPN/RetinaNet with COCO 1x schedule.
Comprehensively improving most instance-level detection and semantic segmentation tasks.

Pipeline

Performances

Install

Same as OpenSelfSup.

Codes

Pretext Task Pretrain

Coming Soon.

Transfer to Downstream tasks

We provide training scripts on COCO, because the performance of COCO is more stable than VOC and Cityscapes. See results in Table 3-5 and Table 13.

We provide Mask RCNN-C4, Mask RCNN-FPN and RetinaNet with 12k, 90k and 180k iterations.

First, you need to download model(.pkl) to benchmarks/detection/pths, and convert pretrain model to detectron2_version. See this script.

Second, start training and testing.

sh tools_local/dist_test_coco.sh $PTH $WORK_DIR

For example:

sh tools_local/dist_test_coco.sh benchmarks/detection/pths/detco_200ep_AA.pkl benchmarks/detection/work_dirs/detco_AA

Download Models

DetCo-200ep: [Google Drive], [Baidu Drive] Fetch Code: okfp

DetCo-200ep-AA: [Google Drive], [Baidu Drive] Fetch Code: fg7h

Citations

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follows.

@misc{xie2021detco,
      title={DetCo: Unsupervised Contrastive Learning for Object Detection}, 
      author={Enze Xie and Jian Ding and Wenhai Wang and Xiaohang Zhan and Hang Xu and Zhenguo Li and Ping Luo},
      year={2021},
      eprint={2102.04803},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Acknowledges

We would like to thank Huawei AI Theory Group to support 200+ V100 GPUs for this research project without which this work would not be possible.

License

For academic use, this project is licensed under the 2-clause BSD License - see the LICENSE file for details. For commercial use, please contact the authors.

DetCo: Unsupervised Contrastive Learning for Object Detection

Related tags

Overview

DetCo: Unsupervised Contrastive Learning for Object Detection

News

Highlights

Pipeline

Performances

Install

Codes

Pretext Task Pretrain

Transfer to Downstream tasks

Download Models

Citations

Acknowledges

License

Owner

Enze Xie

Virtual Dance Reality Stage: a feature that offers you to share a stage with another user virtually

This repo in the implementation of EMNLP'21 paper "SPARQLing Database Queries from Intermediate Question Decompositions" by Irina Saparina, Anton Osokin

MERLOT: Multimodal Neural Script Knowledge Models

Code and models for "Rethinking Deep Image Prior for Denoising" (ICCV 2021)

Deep Implicit Moving Least-Squares Functions for 3D Reconstruction

(JMLR' 19) A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)

PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)

This program was designed to detect whether someone is wearing a facemask through a live video stream.

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking

Source code for "FastBERT: a Self-distilling BERT with Adaptive Inference Time".

Efficient electromagnetic solver based on rigorous coupled-wave analysis for 3D and 2D multi-layered structures with in-plane periodicity

Full Resolution Residual Networks for Semantic Image Segmentation

Code needed to reproduce the examples found in "The Temporal Robustness of Stochastic Signals"

Generate indoor scenes with Transformers

Random-Afg - Afghanistan Random Old Idz Cloner Tools

[ACMMM 2021, Oral] Code release for "Elastic Tactile Simulation Towards Tactile-Visual Perception"

PyTorch implementation of MSBG hearing loss model and MBSTOI intelligibility metric

Manifold-Mixup implementation for fastai V2

L-Verse: Bidirectional Generation Between Image and Text

a curated list of docker-compose files prepared for testing data engineering tools, databases and open source libraries.