This repo uses a combination of logits and feature distillation method to teach the PSPNet model of ResNet18 backbone with the PSPNet model of ResNet50 backbone. All the models are trained and tested on the PASCAL-VOC2012 dataset.

Last update: Dec 01, 2022

Overview

PSPNet-logits and feature-distillation

Introduction

This repository is based on PSPNet and modified from semseg and Pixelwise_Knowledge_Distillation_PSPNet18 which uses a logits knowledge distillation method to teach the PSPNet model of ResNet18 backbone with the PSPNet model of ResNet50 backbone. All the models are trained and tested on the PASCAL-VOC2012 dataset(Enhanced Version).

Innovation and Limitations

This repo adds a feature distillation in the aux layer of PSPNet without a linear feature mapping since the teacher and student model's output dimension after the aux layer is the same. On the other hand, if you want to adapt this repo to other structures, a mapping should be needed. Also, the output of the aux layer is very close to which of the final layer, so you should pay attention to the overfitting problem. Or you can distillate the features in earlier layers and add a mapping, of course, just like Fitnet.

For reimplementation

Please download related datasets and symlink the relevant paths. The temperature parameter(T) and corresponding weights can be changed flexibly. All the numbers showed in the name of python code indicate the number of layers; for instance, train_50_18.py represents the distillation of 50 layers to 18 layers.

Please note that you should train a teacher model( PSPNet model of ResNet50 backbone) at first, and save the checkpoints or just use a well trained PSPNet50 model, which you can refer to the original public code at semseg, and you should download the initial models and corresponding lists in semseg and put them in right paths, also all the environmental requirements in this repo are the same as semseg.

Usage

Requirement: PyTorch>=1.1.0, Python3, tensorboardX, GPU
Clone the repository:

git clone https://github.com/asaander719/PSPNet-knowledge-distillation.git

Download initialization models and lists, also trained models and predictions can be optional, by the link shows in semseg, and put them in files followed by instructions.
Download official dataset PASCAL-VOC2012, please note that it is Enhanced Version,and put them in corresponding paths follwed by data lists.
Train and test a teacher model: adjust parameters in config (voc2012_pspnet50.yaml), like layers. etc.., and the checkpoints will be saved automaticly, or you can just download a trained model, and put it in a right path.

python train_50.py

python test_50.py

Train and test a student model(optional, only for comparison): adjust parameters in config (voc2012_pspnet18.yaml), like layers. etc.., and the checkpoints will be saved automaticly, or you can just download a trained model, and put it in a right path.

python train_18.py

python test_18.py

Distillation and Test: the results should between the teacher and the student model.

Please note that you should adjust some parameters when you use fuctions in the file named model.

python train_50_18_my.py

python test_50_18.py

Reference

@misc{semseg2019, author={Zhao, Hengshuang}, title={semseg}, howpublished={\url{https://github.com/hszhao/semseg}}, year={2019} }

@inproceedings{zhao2017pspnet, title={Pyramid Scene Parsing Network}, author={Zhao, Hengshuang and Shi, Jianping and Qi, Xiaojuan and Wang, Xiaogang and Jia, Jiaya}, booktitle={CVPR}, year={2017} }

@inproceedings{zhao2018psanet, title={{PSANet}: Point-wise Spatial Attention Network for Scene Parsing}, author={Zhao, Hengshuang and Zhang, Yi and Liu, Shu and Shi, Jianping and Loy, Chen Change and Lin, Dahua and Jia, Jiaya}, booktitle={ECCV}, year={2018} }

This repo uses a combination of logits and feature distillation method to teach the PSPNet model of ResNet18 backbone with the PSPNet model of ResNet50 backbone. All the models are trained and tested on the PASCAL-VOC2012 dataset.

Related tags

Overview

PSPNet-logits and feature-distillation

Introduction

Innovation and Limitations

For reimplementation

Usage

Reference

Owner

LIAO Shuiying

Img-process-manual - Utilize Python Numpy and Matplotlib to realize OpenCV baisc image processing function

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

A Bayesian cognition approach for belief updating of correlation judgement through uncertainty visualizations

Vector Quantized Diffusion Model for Text-to-Image Synthesis

disentanglement_lib is an open-source library for research on learning disentangled representations.

City-seeds - A random generator of cultural characteristics intended to spark ideas and help draw threads

Code repository for our paper "Learning to Generate Scene Graph from Natural Language Supervision" in ICCV 2021

The official PyTorch code implementation of "Personalized Trajectory Prediction via Distribution Discrimination" in ICCV 2021.

FCA: Learning a 3D Full-coverage Vehicle Camouflage for Multi-view Physical Adversarial Attack

Awesome AI Learning with +100 AI Cheat-Sheets, Free online Books, Top Courses, Best Videos and Lectures, Papers, Tutorials, +99 Researchers, Premium Websites, +121 Datasets, Conferences, Frameworks, Tools

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation

Stacs-ci - A set of modules to enable integration of STACS with commonly used CI / CD systems

Implementation of Multistream Transformers in Pytorch

Post-training Quantization for Neural Networks with Provable Guarantees

Generative code template for PixelBeasts 10k NFT project.

Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation

On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition

A new framework, collaborative cascade prediction based on graph neural networks (CCasGNN) to jointly utilize the structural characteristics, sequence features, and user profiles.

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

This repo uses a combination of logits and feature distillation method to teach the PSPNet model of ResNet18 backbone with the PSPNet model of ResNet50 backbone. All the models are trained and tested on the PASCAL-VOC2012 dataset.

Related tags

Overview

PSPNet-logits and feature-distillation

Introduction

Innovation and Limitations

For reimplementation

Usage

Reference

Owner

LIAO Shuiying

Img-process-manual - Utilize Python Numpy and Matplotlib to realize OpenCV baisc image processing function

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

A Bayesian cognition approach for belief updating of correlation judgement through uncertainty visualizations

Vector Quantized Diffusion Model for Text-to-Image Synthesis

disentanglement_lib is an open-source library for research on learning disentangled representations.

City-seeds - A random generator of cultural characteristics intended to spark ideas and help draw threads

Code repository for our paper "Learning to Generate Scene Graph from Natural Language Supervision" in ICCV 2021

The official PyTorch code implementation of "Personalized Trajectory Prediction via Distribution Discrimination" in ICCV 2021.

FCA: Learning a 3D Full-coverage Vehicle Camouflage for Multi-view Physical Adversarial Attack

Awesome AI Learning with +100 AI Cheat-Sheets, Free online Books, Top Courses, Best Videos and Lectures, Papers, Tutorials, +99 Researchers, Premium Websites, +121 Datasets, Conferences, Frameworks, Tools

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation

Stacs-ci - A set of modules to enable integration of STACS with commonly used CI / CD systems

Implementation of Multistream Transformers in Pytorch

Post-training Quantization for Neural Networks with Provable Guarantees

Generative code template for PixelBeasts 10k NFT project.

Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation

On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition

A new framework, collaborative cascade prediction based on graph neural networks (CCasGNN) to jointly utilize the structural characteristics, sequence features, and user profiles.

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"

THIS IS THE **OLD** PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD: