code for `Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation`

Last update: Jan 05, 2023

Related tags

Overview

Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation (CVPR 2021)

Introduction

PBR is a conceptually simple yet effective post-processing refinement framework to improve the boundary quality of instance segmentation. Following the idea of looking closer to segment boundaries better, BPR extracts and refines a series of small boundary patches along the predicted instance boundaries. The proposed BPR framework (as shown below) yields significant improvements over the Mask R-CNN baseline on the Cityscapes benchmark, especially on the boundary-aware metrics.

For more details, please refer to our paper.

Installation

Please refer to INSTALL.md.

Training

Prepare patches dataset [optional]

First, you need to generate the instance segmentation results on the Cityscapes training and validation set, as the following format:

maskrcnn_train
- aachen_000000_000019_leftImg8bit_pred.txt
- aachen_000001_000019_leftImg8bit_0_person.png
- aachen_000001_000019_leftImg8bit_10_car.png
- ...

maskrcnn_val
- frankfurt_000001_064130_leftImg8bit_pred.txt
- frankfurt_000001_064305_leftImg8bit_0_person.png
- frankfurt_000001_064305_leftImg8bit_10_motorcycle.png
- ...

The content of the txt file is the same as the standard format required by cityscape script, e.g.:

frankfurt_000000_000294_leftImg8bit_0_person.png 24 0.9990299940109253
frankfurt_000000_000294_leftImg8bit_1_person.png 24 0.9810258746147156
...

Then use the provided script to generate the training set:

sh tools/prepare_dataset.sh \
  maskrcnn_train \
  maskrcnn_val \
  maskrcnn_r50

Note that this step can take about 2 hours. Feel free to skip it by downloading the processed training set.

Train the network

Point DATA_ROOT to the patches dataset and run the training script

DATA_ROOT=maskrcnn_r50/patches \
bash tools/dist_train.sh \
  configs/bpr/hrnet18s_128.py \
  4

Inference

Suppose you have some instance segmentation results of Cityscapes dataset, as the following format:

maskrcnn_val
- frankfurt_000001_064130_leftImg8bit_pred.txt
- frankfurt_000001_064305_leftImg8bit_0_person.png
- frankfurt_000001_064305_leftImg8bit_10_motorcycle.png
- ...

We provide a script (tools/inference.sh) to perform refinement operation, usage:

IOU_THRESH=0.55 \
IMG_DIR=data/cityscapes/leftImg8bit/val \
GT_JSON=data/cityscapes/annotations/instancesonly_filtered_gtFine_val.json \
BPR_ROOT=. \
GPUS=4 \
sh tools/inference.sh configs/bpr/hrnet48_256.py ckpts/hrnet48_256.pth maskrcnn_val maskrcnn_val_refined

The refinement results will be saved in maskrcnn_val_refined/refined.

For COCO model, use tools/inference_coco.sh instead.

Models

Backbone	Dataset	Checkpoint
HRNet-18s	Cityscapes	Tsinghua Cloud
HRNet-48	Cityscapes	Tsinghua Cloud
HRNet-18s	COCO	Tsinghua Cloud

Acknowledgement

This project is based on mmsegmentation code base.

Citation

If you find this project useful in your research, please consider citing:

@article{tang2021look,
  title={Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation},
  author={Chufeng Tang and Hang Chen and Xiao Li and Jianmin Li and Zhaoxiang Zhang and Xiaolin Hu},
  journal={arXiv preprint arXiv:2104.05239},
  year={2021}
}

code for `Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation`

Related tags

Overview

Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation (CVPR 2021)

Introduction

Installation

Training

Prepare patches dataset [optional]

Train the network

Inference

Models

Acknowledgement

Citation

Owner

H.Chen

Ludwig Benchmarking Toolkit

A small library for creating and manipulating custom JAX Pytree classes

YOLOv5 Series Multi-backbone, Pruning and quantization Compression Tool Box.

Improving adversarial robustness by a coupling rejection strategy

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

Repository for tackling Kaggle Ultrasound Nerve Segmentation challenge using Torchnet.

Multi-Glimpse Network With Python

Third party Pytorch implement of Image Processing Transformer (Pre-Trained Image Processing Transformer arXiv:2012.00364v2)

Codes for "Template-free Prompt Tuning for Few-shot NER".

Source code and Dataset creation for the paper "Neural Symbolic Regression That Scales"

PINN(s): Physics-Informed Neural Network(s) for von Karman vortex street

Consumer Fairness in Recommender Systems: Contextualizing Definitions and Mitigations

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

PyTorch implementation of DARDet: A Dense Anchor-free Rotated Object Detector in Aerial Images

This repo is to present various code demos on how to use our Graph4NLP library.

mPose3D, a mmWave-based 3D human pose estimation model.

Autonomous Perception: 3D Object Detection with Complex-YOLO

The PyTorch implementation of paper REST: Debiased Social Recommendation via Reconstructing Exposure Strategies

PSANet: Point-wise Spatial Attention Network for Scene Parsing, ECCV2018.

Image Captioning on google cloud platform based on iot