Gated-Shape CNN for Semantic Segmentation (ICCV 2019)

Last update: Dec 26, 2022

Overview

GSCNN

This is the official code for:

Gated-SCNN: Gated Shape CNNs for Semantic Segmentation

Towaki Takikawa, David Acuna, Varun Jampani, Sanja Fidler

ICCV 2019 [Paper] [Project Page]

Based on based on https://github.com/NVIDIA/semantic-segmentation.

License

Copyright (C) 2019 NVIDIA Corporation. Towaki Takikawa, David Acuna, Varun Jampani, Sanja Fidler
All rights reserved.
Licensed under the CC BY-NC-SA 4.0 license (https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode).

Permission to use, copy, modify, and distribute this software and its documentation
for any non-commercial purpose is hereby granted without fee, provided that the above
copyright notice appear in all copies and that both that copyright notice and this
permission notice appear in supporting documentation, and that the name of the author
not be used in advertising or publicity pertaining to distribution of the software
without specific, written prior permission.

THE AUTHOR DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL
IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR ANY PARTICULAR PURPOSE.
IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL
DAMAGES OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS,
WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING
OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.
~

Usage

Clone this repo

git clone https://github.com/nv-tlabs/GSCNN
cd GSCNN

Python requirements

Currently, the code supports Python 3

numpy
PyTorch (>=1.1.0)
torchvision
scipy
scikit-image
tensorboardX
tqdm
torch-encoding
opencv
PyYAML

Download pretrained models

Download the pretrained model from the Google Drive Folder, and save it in 'checkpoints/'

Download inferred images

Download (if needed) the inferred images from the Google Drive Folder

Evaluation (Cityscapes)

python train.py --evaluate --snapshot checkpoints/best_cityscapes_checkpoint.pth

Training

A note on training- we train on 8 NVIDIA GPUs, and as such, training will be an issue with WiderResNet38 if you try to train on a single GPU.

If you use this code, please cite:

@article{takikawa2019gated,
  title={Gated-SCNN: Gated Shape CNNs for Semantic Segmentation},
  author={Takikawa, Towaki and Acuna, David and Jampani, Varun and Fidler, Sanja},
  journal={ICCV},
  year={2019}
}

Gated-Shape CNN for Semantic Segmentation (ICCV 2019)

Related tags

Overview

GSCNN

Gated-SCNN: Gated Shape CNNs for Semantic Segmentation

License

Usage

Clone this repo

Python requirements

Download pretrained models

Download inferred images

Evaluation (Cityscapes)

Training

Owner

Code and results accompanying our paper titled Mixture Proportion Estimation and PU Learning: A Modern Approach at Neurips 2021 (Spotlight)

LiDAR R-CNN: An Efficient and Universal 3D Object Detector

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

Instance-based label smoothing for improving deep neural networks generalization and calibration

A Transformer-Based Siamese Network for Change Detection

Official implementation of the PICASO: Permutation-Invariant Cascaded Attentional Set Operator

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)

Cross Quality LFW: A database for Analyzing Cross-Resolution Image Face Recognition in Unconstrained Environments

Implementation of GGB color space

SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR 2019 (Oral)

Semi-SDP Semi-supervised parser for semantic dependency parsing.

UNet model with VGG11 encoder pre-trained on Kaggle Carvana dataset

unet-family: Ultimate version

BraTs-VNet - BraTS(Brain Tumour Segmentation) using V-Net

Pmapper is a super-resolution and deconvolution toolkit for python 3.6+

[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.

"Neural Turing Machine" in Tensorflow

SEOVER: Sentence-level Emotion Orientation Vector based Conversation Emotion Recognition Model