Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

Last update: Dec 24, 2022

Related tags

Deep Learning GroupRCNN

Overview

Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

By Shilong Zhang*, Zhuoran Yu*, Liyang Liu*, Xinjiang Wang, Aojun Zhou, Kai Chen

Abstract:

We study the problem of weakly semi-supervised object detection with points (WSSOD-P), where the training data is combined by a small set of fully annotated images with bounding boxes and a large set of weakly-labeled images with only a single point annotated for each instance. The core of this task is to train a point-to-box regressor on well labeled images that can be used to predict credible bounding boxes for each point annotation. Group R-CNN significantly outperforms the prior method Point DETR by 3.9 mAP with 5% well-labeled images, which is the most challenging scenario.

Install

The project has been fully tested under MMDetection V2.22.0 and MMCV V1.4.6, other versions may not be compatible. so you have to install mmcv and mmdetection firstly. You can refer to Installation of MMCV & Installation of MMDetection

Prepare the dataset

mmdetection
├── data
│   ├── coco
│   │   ├── annotations
│   │   │      ├──instances_train2017.json
│   │   │      ├──instances_val2017.json
│   │   ├── train2017
│   │   ├── val2017

You can generate point annotations with the command. It may take you several minutes for instances_train2017.json

python tools/generate_anns.py /data/coco/annotations/instances_train2017.json
python tools/generate_anns.py /data/coco/annotations/instances_val2017.json

Then you can find a point_ann directory, all annotations in the directory contain point annotations. Then you should replace the original annotations in data/coco/annotations with generated annotations.

NOTES

Here, we sample a point from the mask for all instances. But we split the images into two divisions in :class:PointCocoDataset.

Images with only bbox annotations(well-labeled images): Only be used in training phase. We sample a point from its bbox as point annotations each iteration.
Images with only point annotations(weakly-labeled sets): Only be used to generate bbox annotations from point annotations with trained point to bbox regressor.

Train and Test

8 is the number of gpus.

For slurm

Train

GPUS=8 sh tools/slurm_train.sh partition_name  job_name projects/configs/10_coco/group_rcnn_24e_10_percent_coco_detr_augmentation.py  ./exp/group_rcnn

Evaluate the quality of generated bbox annotations on val dataset with pre-defined point annotations.

GPUS=8 sh tools/slurm_test.sh partition_name  job_name projects/configs/10_coco/group_rcnn_24e_10_percent_coco_detr_augmentation.py ./exp/group_rcnn/latest.pth --eval bbox

Run the inference process on weakly-labeled images with point annotations to get bbox annotations.

GPUS=8 sh tools/slurm_test.sh partition_name  job_name  projects/configs/10_coco/group_rcnn_50e_10_percent_coco_detr_augmentation.py   path_to_checkpoint  --format-only --options  "jsonfile_prefix=./generated"

For Pytorch distributed

Train

sh tools/dist_train.sh projects/configs/10_coco/group_rcnn_24e_10_percent_coco_detr_augmentation.py 8 --work-dir ./exp/group_rcnn

Evaluate the quality of generated bbox annotations on val dataset with pre-defined point annotations.

sh tools/dist_test.sh  projects/configs/10_coco/group_rcnn_24e_10_percent_coco_detr_augmentation.py  path_to_checkpoint 8 --eval bbox

Run the inference process on weakly-labeled images with point annotations to get bbox annotations.

sh tools/dist_test.sh  projects/configs/10_coco/group_rcnn_50e_10_percent_coco_detr_augmentation.py   path_to_checkpoint 8 --format-only --options  "jsonfile_prefix=./data/coco/annotations/generated"

Then you can train the student model focs.

sh tools/dist_train.sh projects/configs/10_coco/01_student_fcos.py 8 --work-dir ./exp/01_student_fcos

Results & Checkpoints

We find that the performance of teacher is unstable under 24e setting and may fluctuate by about 0.2 mAP. We report the average.

Model	Backbone	Lr schd	Augmentation	box AP	Config	Model	log	Generated Annotations
Teacher(Group R-CNN)	R-50-FPN	24e	DETR Aug	39.2	config	ckpt	log	-
Teacher(Group R-CNN)	R-50-FPN	50e	DETR Aug	39.9	config	ckpt	log	generated.bbox.json
Student(FCOS)	R-50-FPN	12e	Normal 1x Aug	33.1	config	ckpt	log	-

Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

Related tags

Overview

Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)

Abstract:

Install

Prepare the dataset

NOTES

Train and Test

For slurm

For Pytorch distributed

Results & Checkpoints

Owner

Shilong Zhang

This repository contains tutorials for the py4DSTEM Python package

Reproduction process of AlexNet

Modelisation on galaxy evolution using PEGASE-HR

A Python library for common tasks on 3D point clouds

A list of awesome PyTorch scholarship articles, guides, blogs, courses and other resources.

Generating Anime Images by Implementing Deep Convolutional Generative Adversarial Networks paper

Code for "Unsupervised Layered Image Decomposition into Object Prototypes" paper

Implements VQGAN+CLIP for image and video generation, and style transfers, based on text and image prompts. Emphasis on ease-of-use, documentation, and smooth video creation.

[NeurIPS 2021] Low-Rank Subspaces in GANs

A python package for generating, analyzing and visualizing building shadows

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Multistream CNN for Robust Acoustic Modeling

Kaggle competition: Springleaf Marketing Response

A code implementation of AC-GC: Activation Compression with Guaranteed Convergence, in NeurIPS 2021.

Open-source code for Generic Grouping Network (GGN, CVPR 2022)

Convert weight file.pth to weight file.blob

Code Repo for the ACL21 paper "Common Sense Beyond English: Evaluating and Improving Multilingual LMs for Commonsense Reasoning"

One implementation of the paper "DMRST: A Joint Framework for Document-Level Multilingual RST Discourse Segmentation and Parsing".

Buffon’s needle: one of the oldest problems in geometric probability

an implementation of softmax splatting for differentiable forward warping using PyTorch