AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation

Last update: Oct 13, 2022

Related tags

Deep Learning AMTML-KD-code

Overview

Adaptive Multi-Teacher Multi-level Knowledge Distillation(AMTML-KD)

Paper has been accepted by Neurocomputing 415(2020): 106–113.

Authors: Yuang Liu, Wei Zhang and Jun Wang.

Links: [ pdf ] [ code ]

Requirements

PyTorch >= 1.0.0
Jupyter
visdom

Introduction

Knowledge distillation (KD) is an effective learning paradigm for improving the performance of light-weight student networks by utilizing additional supervision knowledge distilled from teacher networks. Most pioneering studies either learn from only a single teacher in their distillation learning methods, neglecting the potential that a student can learn from multiple teachers simultaneously, or simply treat each teacher to be equally important, unable to reveal the different importance of teachers for specific examples. To bridge this gap, we propose a novel adaptive multi-teacher multi-level knowledge distillation learning framework (AMTML-KD), which consists two novel insights: (i) associating each teacher with a latent representation to adaptively learn instance-level teacher importance weights which are leveraged for acquiring integrated soft-targets (high-level knowledge) and (ii) enabling the intermediate-level hints (intermediate-level knowledge) to be gathered from multiple teachers by the proposed multi-group hint strategy. As such, a student model can learn multi-level knowledge from multiple teachers through AMTML-KD. Extensive results on publicly available datasets demonstrate the proposed learning framework ensures student to achieve improved performance than strong competitors.

Citation

@article{LIU2020106,
    title = {Adaptive multi-teacher multi-level knowledge distillation},
    author = {Yuang Liu and Wei Zhang and Jun Wang},
    journal = {Neurocomputing},
    volume = {415},
    pages = {106 -- 113},
    year = {2020},
    issn = {0925 -- 2312},
}

AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation

Related tags

Overview

Adaptive Multi-Teacher Multi-level Knowledge Distillation(AMTML-KD)

Requirements

Introduction

Citation

Owner

Frank Liu

This repository stores the code to reproduce the results published in "TiWS-iForest: Isolation Forest in Weakly Supervised and Tiny ML scenarios"

A flexible ML framework built to simplify medical image reconstruction and analysis experimentation.

A Pytorch Implementation of Domain adaptation of object detector using scissor-like networks

A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes".

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

SOTR: Segmenting Objects with Transformers [ICCV 2021]

Prompt Tuning with Rules

Official implementation of TMANet.

DTCN SMP Challenge - Sequential prediction learning framework and algorithm

Self-Supervised Learning with Kernel Dependence Maximization

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

Implementation of Basic Machine Learning Algorithms on small datasets using Scikit Learn.

This project is a loose implementation of paper "Algorithmic Financial Trading with Deep Convolutional Neural Networks: Time Series to Image Conversion Approach"

Code for classifying international patents based on the text of their titles/abstracts

Pytorch cuda extension of grid_sample1d

Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving

Exploit ILP to learn symmetry breaking constraints of ASP programs.

Provide partial dates and retain the date precision through processing

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation

This repository is the official implementation of Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation

Related tags

Overview

Adaptive Multi-Teacher Multi-level Knowledge Distillation(AMTML-KD)

Requirements

Introduction

Citation

Owner

Frank Liu

This repository stores the code to reproduce the results published in "TiWS-iForest: Isolation Forest in Weakly Supervised and Tiny ML scenarios"

A flexible ML framework built to simplify medical image reconstruction and analysis experimentation.

A Pytorch Implementation of Domain adaptation of object detector using scissor-like networks

A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes".

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

SOTR: Segmenting Objects with Transformers [ICCV 2021]

Prompt Tuning with Rules

Official implementation of TMANet.

DTCN SMP Challenge - Sequential prediction learning framework and algorithm

Self-Supervised Learning with Kernel Dependence Maximization

[CVPR 2021] Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion

Implementation of Basic Machine Learning Algorithms on small datasets using Scikit Learn.

This project is a loose implementation of paper "Algorithmic Financial Trading with Deep Convolutional Neural Networks: Time Series to Image Conversion Approach"

Code for classifying international patents based on the text of their titles/abstracts

Pytorch cuda extension of grid_sample1d

Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving

Exploit ILP to learn symmetry breaking constraints of ASP programs.

Provide partial dates and retain the date precision through processing

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation

This repository is the official implementation of Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务