Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Abstract

Recent works have made great success in semantic segmentation by exploiting contextual information in a local or global manner within individual image and supervising the model with pixel-wise cross entropy loss. However, from the holistic view of the whole dataset, semantic relations not only exist inside one single image, but also prevail in the whole training data, which makes solely considering intra-image correlations insufficient. Inspired by recent progress in unsupervised contrastive learning, we propose the region-aware contrastive learning (RegionContrast) for semantic segmentation in the supervised manner. In order to enhance the similarity of semantically similar pixels while keeping the discrimination from others, we employ contrastive learning to realize this objective. With the help of memory bank, we explore to store all the representative features into the memory. Without loss of generality, to efficiently incorporate all training data into the memory bank while avoiding taking too much computation resource, we propose to construct region centers to represent features from different categories for every image. Hence, the proposed region-aware contrastive learning is performed in a region level for all the training data, which saves much more memory than methods exploring the pixel-level relations. The proposed RegionContrast brings little computation cost during training and requires no extra overhead for testing. Extensive experiments demonstrate that our method achieves state-of-the-art performance on three benchmark datasets including Cityscapes, ADE20K and COCO Stuff. For more details, please refer to our ICCV paper (paper).

Installation

Check INSTALL.md for installation instructions.

Training and Evaluation

cd experiments/v3_contrast
bash train.sh

Citation

@InProceedings{Hu_2021_ICCV,
    author    = {Hu, Hanzhe and Cui, Jinshi and Wang, Liwei},
    title     = {Region-Aware Contrastive Learning for Semantic Segmentation},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {16291-16301}
}

TODO

Dynamic Sampling

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Related tags

Overview

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Abstract

Installation

Training and Evaluation

Citation

TODO

Owner

Hanzhe Hu

Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)

This is an implementation of PIFuhd based on Pytorch

Using this codebase as a tool for my own research. Making some modifications to the original repo for my own purposes.

🐦 Quickly annotate data from the comfort of your Jupyter notebook

This is a simple face recognition mini project that was completed by a team of 3 members in 1 week's time

Traditional deepdream with VQGAN+CLIP and optical flow. Ready to use in Google Colab

This repository consists of Blender python scripts and corresponding assets to generate variants of the CANDLE dataset

Pytorch implementation of Value Iteration Networks (NIPS 2016 best paper)

Scalable implementation of Lee / Mykland (2012) and Ait-Sahalia / Jacod (2012) Jump tests for noisy high frequency data

Implémentation en pyhton de l'article Depixelizing pixel art de Johannes Kopf et Dani Lischinski

Migration of Edge-based Distributed Federated Learning

Full-featured Decision Trees and Random Forests learner.

Conformer: Local Features Coupling Global Representations for Visual Recognition

Semi-supervised Representation Learning for Remote Sensing Image Classification Based on Generative Adversarial Networks

Video-Captioning - A machine Learning project to generate captions for video frames indicating the relationship between the objects in the video

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

Based on Stockfish neural network(similar to LcZero)

Code for paper "ASAP-Net: Attention and Structure Aware Point Cloud Sequence Segmentation"

"Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation

Where2Act: From Pixels to Actions for Articulated 3D Objects