Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

Last update: Dec 12, 2022

Overview

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

Abstract

Due to the limited and even imbalanced data, semi-supervised semantic segmentation tends to have poor performance on some certain categories, e.g., tailed categories in Cityscapes dataset which exhibits a long-tailed label distribution. Existing approaches almost all neglect this problem, and treat categories equally. Some popular approaches such as consistency regularization or pseudo-labeling may even harm the learning of under-performing categories, that the predictions or pseudo labels of these categories could be too inaccurate to guide the learning on the unlabeled data. In this paper, we look into this problem, and propose a novel framework for semi-supervised semantic segmentation, named adaptive equalization learning (AEL). AEL adaptively balances the training of well and badly performed categories, with a confidence bank to dynamically track category-wise performance during training. The confidence bank is leveraged as an indicator to tilt training towards under-performing categories, instantiated in three strategies: 1) adaptive Copy-Paste and CutMix data augmentation approaches which give more chance for under-performing categories to be copied or cut; 2) an adaptive data sampling approach to encourage pixels from under-performing category to be sampled; 3) a simple yet effective re-weighting method to alleviate the training noise raised by pseudo-labeling. Experimentally, AEL outperforms the state-of-the-art methods by a large margin on the Cityscapes and Pascal VOC benchmarks under various data partition protocols. For more details, please refer to our NeurIPS paper (arxiv).

Installation

Check INSTALL.md for installation instructions.

Training and Evaluation

For example, perform training and evaluation with 1/2 data parttition on Cityscapes dataset.

cd experiments/cityscapes_2
bash train.sh

For other partition protocols, change n_sup in config.yaml.

TODO

Other SOTA semi-supervised segmentation methods

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

Related tags

Overview

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

Abstract

Installation

Training and Evaluation

TODO

Owner

Hanzhe Hu

Code for binary and multiclass model change active learning, with spectral truncation implementation.

Vision-Language Transformer and Query Generation for Referring Segmentation (ICCV 2021)

The fastai book, published as Jupyter Notebooks

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

Deep Structured Instance Graph for Distilling Object Detectors (ICCV 2021)

Pseudo lidar - (CVPR 2019) Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

MMRazor: a model compression toolkit for model slimming and AutoML

The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.

This is an open solution to the Home Credit Default Risk challenge 🏡

A hyperparameter optimization framework

Source code for our paper "Empathetic Response Generation with State Management"

A PyTorch Implementation of Single Shot MultiBox Detector

Working demo of the Multi-class and Anomaly classification model using the CLIP feature space

Use VITS and Opencpop to develop singing voice synthesis; Maybe it will VISinger.

Swin-Transformer is basically a hierarchical Transformer whose representation is computed with shifted windows.

Code for the paper “The Peril of Popular Deep Learning Uncertainty Estimation Methods”

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.

Tutorials and implementations for "Self-normalizing networks"

SimplEx - Explaining Latent Representations with a Corpus of Examples