The implementation for "Comprehensive Knowledge Distillation with Causal Intervention".

Last update: Nov 03, 2022

Related tags

Overview

Comprehensive Knowledge Distillation with Causal Intervention

This repository is a PyTorch implementation of "Comprehensive Knowledge Distillation with Causal Intervention". The code is modified from CRD, and the pretrained teachers (except WRN-40-4) are also downloaded from CRD.

Requirements

The code was tested on

Python 3.6
torch 1.2.0
torchvision 0.4.0

Evaluation

To evaluate our pre-trained light-weight student networks, first download the folder "pretrained_student_model" from CID models into the "save" folder, then simply run the command below to evaluate these light-weight students:

run evaluate_scripts.sh

Training

To train students from scratch by distilling knowledge from teacher networks with CID, first download the pretrained teacher folder "models" from CID models into the "save" folder, and then simply run the command below to compress large models to smaller ones:

run train_scripts.sh

Citation

If you find this code helpful, you may consider citing this paper:

@inproceedings{deng2021comprehensive,
  title={Comprehensive Knowledge Distillation with Causal Intervention},
  author={Deng, Xiang and Zhang, Zhongfei},
  booktitle = {Proceedings of the 30th Annual Conference on Neural Information Processing Systems},
  year={2021}
}

The implementation for "Comprehensive Knowledge Distillation with Causal Intervention".

Related tags

Overview

Comprehensive Knowledge Distillation with Causal Intervention

Requirements

Evaluation

Training

Citation

Owner

Xiang Deng

基于PaddleOCR搭建的OCR server... 离线部署用

Distributionally robust neural networks for group shifts

Re-implememtation of MAE (Masked Autoencoders Are Scalable Vision Learners) using PyTorch.

Pretrained models for Jax/Haiku; MobileNet, ResNet, VGG, Xception.

Distributing reference energies for SMIRNOFF implementations

Unsupervised captioning - Code for Unsupervised Image Captioning

Hyperbolic Hierarchical Clustering.

Code for generating a single image pretraining dataset

Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.

Code for our ICASSP 2021 paper: SA-Net: Shuffle Attention for Deep Convolutional Neural Networks

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Official pytorch implement for “Transformer-Based Source-Free Domain Adaptation”

OMAMO: orthology-based model organism selection

This is the solution for 2nd rank in Kaggle competition: Feedback Prize - Evaluating Student Writing.

An 16kHz implementation of HiFi-GAN for soft-vc.

Evaluation toolkit of the informative tracking benchmark comprising 9 scenarios, 180 diverse videos, and new challenges.

[NeurIPS 2021] SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning

Awesome Graph Classification - A collection of important graph embedding, classification and representation learning papers with implementations.

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

PyTorch implementation of our paper: Decoupling and Recoupling Spatiotemporal Representation for RGB-D-based Motion Recognition