List of awesome things around semantic segmentation 🎉

Last update: Nov 26, 2022

Overview

Awesome Semantic Segmentation

List of awesome things around semantic segmentation 🎉

Semantic segmentation is a computer vision task in which we label specific regions of an image according to what's being shown. Semantic segmentation awswers for the question: "What's in this image, and where in the image is it located?".

Semantic segmentation is a critical module in robotics related applications, especially autonomous driving, remote sensing. Most of the research on semantic segmentation is focused on improving the accuracy with less attention paid to computationally efficient solutions.

The recent appoarch in semantic segmentation is using deep neural network, specifically Fully Convolutional Network (a.k.a FCN). We can follow the trend of semantic segmenation approach at: paper-with-code.

Evaluate metrics: mIOU, accuracy, speed,...

State-Of-The-Art (SOTA) methods of Semantic Segmentation

	Paper	Benchmark on PASALVOC12	Release	Implement
EfficientNet-L2+NAS-FPN	Rethinking Pre-training and Self-training	90.5%	NeurIPS 2020	TF
DeepLab V3+	Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation	89%	ECCV 2018	TF, Keras, Pytorch, Demo
DeepLab V3	Rethinking Atrous Convolution for Semantic Image Segmentation	86.9%	17 Jun 2017	TF, TF
Smooth Network with Channel Attention Block	Learning a Discriminative Feature Network for Semantic Segmentation	86.2%	CVPR 2018	Pytorch
PSPNet	Pyramid Scene Parsing Network	85.4%	CVPR 2017	Keras, Pytorch, Pytorch
ResNet-38 MS COCO	Wider or Deeper: Revisiting the ResNet Model for Visual Recognition	84.9%	30 Nov 2016	MXNet
RefineNet	RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation	84.2%	CVPR 2017	Matlab, Keras
GCN	Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network	83.6%	CVPR 2017	TF
CRF-RNN	Conditional Random Fields as Recurrent Neural Networks	74.7%	ICCV 2015	Matlab, TF
ParseNet	ParseNet: Looking Wider to See Better	69.8%	15 Jun 2015	Caffe
Dilated Convolutions	Multi-Scale Context Aggregation by Dilated Convolutions	67.6%	23 Nov 2015	Caffe
FCN	Fully Convolutional Networks for Semantic Segmentation	67.2%	CVPR 2015	Caffe

Variants

FCN with VGG(Resnet, Densenet) backbone: pytorch
The easiest implementation of fully convolutional networks (FCN8s VGG): pytorch
TernausNet (UNet model with VGG11 encoder pre-trained on Kaggle Carvana dataset paper: pytorch
TernausNetV2: Fully Convolutional Network for Instance Segmentation: pytorch

Review list of Semantic Segmentation

Evolution of Image Segmentation using Deep Convolutional Neural Network: A Survey 2020 (University of Gour Banga,India) ⭐ ⭐ ⭐ ⭐ ⭐
A peek of Semantic Segmentation 2018 (mc.ai) ⭐ ⭐ ⭐ ⭐
Semantic Segmentation guide 2018 (towardds) ⭐ ⭐ ⭐ ⭐
An overview of semantic image segmentation (jeremyjordan.me) ⭐ ⭐ ⭐ ⭐ ⭐
Recent progress in semantic image segmentation 2018 (arxiv, towardsdatascience) ⭐ ⭐ ⭐ ⭐
A 2017 Guide to Semantic Segmentation Deep Learning Review (blog.qure.ai) ⭐ ⭐ ⭐ ⭐ ⭐
Review popular network architecture (medium-towardds) ⭐ ⭐ ⭐ ⭐ ⭐
Lecture 11 - Detection and Segmentation - CS231n (slide, vid): ⭐ ⭐ ⭐ ⭐ ⭐
A Survey of Semantic Segmentation 2016 (arxiv) ⭐ ⭐ ⭐ ⭐ ⭐

Case studies

Dstl Satellite Imagery Competition, 3rd Place Winners' Interview: Vladimir & Sergey: Blog, Code
Carvana Image Masking Challenge–1st Place Winner's Interview: Blog, Code
Data Science Bowl 2017, Predicting Lung Cancer: Solution Write-up, Team Deep Breath: Blog
MICCAI 2017 Robotic Instrument Segmentation: Code and explain
2018 Data Science Bowl Find the nuclei in divergent images to advance medical discovery: 1st place, 2nd, 3rd, 4th, 5th, 10th
Airbus Ship Detection Challenge: 4th place, 6th

Most used loss functions

Pixel-wise cross entropy loss:
Dice loss: which is pretty nice for balancing dataset
Focal loss:
Lovasz-Softmax loss:

Datasets

Visual Object Classes Challenge 2012 (VOC2012): 400+ classes of real-world data
COCO Dataset: 164k images, 72 classes: 80 thing classes, 91 stuff classes and 1 class 'unlabeled'
Cityscapes: This dataset consists of segmentation ground truths for roads, lanes, vehicles and objects on road. The dataset contains 30 classes and of 50 cities collected over different environmental and weather conditions
PASCAL-Context
ADE20K: 20k+ images
Semantic3d
CamVid
lartpang/awesome-segmentation-saliency-dataset
Kaggle

Frameworks for segmentation

Semantic Segmentation in PyTorch (by yassouali): Semantic segmentation models, datasets and losses implemented in PyTorch.
Semantic Segmentation Suite (by George Seif): Semantic Segmentation Suite in TensorFlow. Implement, train, and test new Semantic Segmentation models easily!
Segmentation Training Pipeline: Research Pipeline for image masking/segmentation in Keras
Tramac/awesome-semantic-segmentation-pytorch Semantic Segmentation on PyTorch (include FCN, PSPNet, Deeplabv3, Deeplabv3+, DANet, DenseASPP, BiSeNet, EncNet, DUNet, ICNet, ENet, OCNet, CCNet, PSANet, CGNet, ESPNet, LEDNet, DFANet)
CSAILVision/semantic-segmentation-pytorch Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset
divamgupta/image-segmentation-keras Implementation of Segnet, FCN, UNet , PSPNet and other models in Keras.

Related techniques

Atrous/ Dilated Convolution
Transpose Convolution (Deconvolution, Upconvolution)
Unpooling
A technical report on convolution arithmetic in the context of deep learning
CRF

Feel free to show your ❤️ by giving a star ⭐

🎁 Check Out the List of Contributors - Feel free to add your details here!

List of awesome things around semantic segmentation 🎉

Related tags

Overview

Awesome Semantic Segmentation

List of awesome things around semantic segmentation 🎉

State-Of-The-Art (SOTA) methods of Semantic Segmentation

Variants

Review list of Semantic Segmentation

Case studies

Most used loss functions

Datasets

Frameworks for segmentation

Related techniques

Feel free to show your ❤️ by giving a star ⭐

🎁 Check Out the List of Contributors - Feel free to add your details here!

Owner

Dam Minh Tien

Elucidating Robust Learning with Uncertainty-Aware Corruption Pattern Estimation

Synthesizing and manipulating 2048x1024 images with conditional GANs

Running AlphaFold2 (from ColabFold) in Azure Machine Learning

Train SN-GAN with AdaBelief

Streaming over lightweight data transformations

DeepHawkeye is a library to detect unusual patterns in images using features from pretrained neural networks

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution SmallObject Detection

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Unsupervised clustering of high content screen samples

Paper: De-rendering Stylized Texts

Unofficial PyTorch Implementation for HifiFace (https://arxiv.org/abs/2106.09965)

Official PyTorch implementation of "Edge Rewiring Goes Neural: Boosting Network Resilience via Policy Gradient".

Paddle-Adversarial-Toolbox (PAT) is a Python library for Deep Learning Security based on PaddlePaddle.

The repo of the preprinting paper "Labels Are Not Perfect: Inferring Spatial Uncertainty in Object Detection"

This Deep Learning Model Predicts that from which disease you are suffering.

Full Stack Deep Learning Labs

Repository relating to the CVPR21 paper TimeLens: Event-based Video Frame Interpolation

The official codes for the ICCV2021 Oral presentation "Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework"

TakeInfoatNistforICS - Take Information in NIST NVD for ICS

Code for ICCV 2021 paper "HuMoR: 3D Human Motion Model for Robust Pose Estimation"