Knowledge Distillation Toolbox for Semantic Segmentation

Last update: Dec 12, 2022

Related tags

Overview

SegDistill: Toolbox for Knowledge Distillation on Semantic Segmentation Networks

This repo contains the supported code and configuration files for SegDistill .It is based on mmsegmentaion.

Installation

conda create -n mmcv python=3.8 -y
conda activate mmcv

pip install torch==1.7.1+cu110 torchvision==0.8.2+cu110 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html

pip install mmcv-full==1.2.2 -f https://download.openmmlab.com/mmcv/dist/cu110/torch1.7.0/index.html

pip install future tensorboard
pip install IPython
pip install attr
pip install timm

git clone https://github.com/wzpscott/SegDistill.git -b main
cd SegDistill
pip install -e .

Prepare Data

We conducted experiments on ADE20k dataset. The training and validation set of ADE20K could be download from this link. Test set can be download from here. After downloading the dataset, you need to arrange the structure of your dataset like:

mmsegmentation
├── mmseg
├── tools
├── configs
├── data
│   ├── ade
│   │   ├── ADEChallengeData2016
│   │   │   ├── annotations
│   │   │   │   ├── training
│   │   │   │   ├── validation
│   │   │   ├── images
│   │   │   │   ├── training
│   │   │   │   ├── validation
│   ├── ...

See here for more instructions on data preparation.

Prepare Models

We provide links to pretrained weights of models used in the paper.

Model	Pretrained on ImageNet-1K	Trained on ADE20k
Segformer	link	link
Swin-Transformer	link	link
PSPNet	link	link

Write configs for semantic segmentaion KD

We use mmcv-fashion configs to control the KD process.

Run an example config with the following command:

 bash tools/dist_train.sh distillation_configs/example_config.py {num_gpu}

See here for detailed instructions for custom KD process on various network architectures.

Channel Group Distillation

Our Channel Group Distillation (CGD) considers a more extensive range of correlations inthe activation map and works well fortransformer structures than previous KD methods.

Comparison to Other KD methods

Results on ADE20k

Qualitative segmentation results on ADE20k produced from Segformer B0: (a) raw images, (b) ground truth (GT), (c) outputof the original student model (d) Channel-wise Distillation (CD) and (e) Channel Group Distillation(CGD)

Knowledge Distillation Toolbox for Semantic Segmentation

Related tags

Overview

SegDistill: Toolbox for Knowledge Distillation on Semantic Segmentation Networks

Installation

Prepare Data

Prepare Models

Write configs for semantic segmentaion KD

Channel Group Distillation

Owner

TensorFlow implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"

Deep Learning for humans

Official implementation of "DSP: Dual Soft-Paste for Unsupervised Domain Adaptive Semantic Segmentation"

Transformers based fully on MLPs

[NAACL & ACL 2021] SapBERT: Self-alignment pretraining for BERT.

An Intelligent Self-driving Truck System For Highway Transportation

SelfAugment extends MoCo to include automatic unsupervised augmentation selection.

CROSS-LINGUAL ABILITY OF MULTILINGUAL BERT: AN EMPIRICAL STUDY

StyleGAN2 Webtoon / Anime Style Toonify

Instant Real-Time Example-Based Style Transfer to Facial Videos

Aiming at the common training datsets split, spectrum preprocessing, wavelength select and calibration models algorithm involved in the spectral analysis process

PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.

Official code repository for ICCV 2021 paper: Gravity-Aware Monocular 3D Human Object Reconstruction

Rule Extraction Methods for Interactive eXplainability

Pre-Trained Image Processing Transformer (IPT)

OpenLT: An open-source project for long-tail classification

An implementation of the BADGE batch active learning algorithm.

PSANet: Point-wise Spatial Attention Network for Scene Parsing, ECCV2018.

Official PyTorch implementation and pretrained models of the paper Self-Supervised Classification Network

Face and Pose detector that emits MQTT events when a face or human body is detected and not detected.