Official implementation for (Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching, AAAI-2021)

Last update: Dec 16, 2022

Overview

Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching

Official pytorch implementation of "Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching" (AAAI-2021)

Requirements

Python3
PyTorch (> 1.2.0)
torchvision
numpy
Pillow

Training

We include a trained WRN-40-2 parameters at /trained/wrn40x2/model.pth.
Run main.py with student network as WRN-16-2 and teacher as WRN-40-2 to reproduce experiment result on CIFAR100.

python main.py --data_dir PATH_TO_DATA --data CIFAR100 --trained_dir /trained/wrn40x2/model.pth\
 --model wrn16x2 --model_t wrn40x2 --beta 200

License

Copyright 2021-present NAVER Corp.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Official implementation for (Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching, AAAI-2021)

Related tags

Overview

Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching

Requirements

Training

License

Owner

Clova AI Research

Real-world Anomaly Detection in Surveillance Videos- pytorch Re-implementation

Official implementation for TTT++: When Does Self-supervised Test-time Training Fail or Thrive

Predict stock movement with Machine Learning and Deep Learning algorithms

Pose estimation with MoveNet Lightning

Toward Multimodal Image-to-Image Translation

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.

Jetson Nano-based smart camera system that measures crowd face mask usage in real-time.

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.

PlaidML is a framework for making deep learning work everywhere.

This is the source code for: Context-aware Entity Typing in Knowledge Graphs.

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

Datasets and source code for our paper Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach

Texture mapping with variational auto-encoders

Python interface for the DIGIT tactile sensor

This repo provides a demo for the CVPR 2021 paper "A Fourier-based Framework for Domain Generalization" on the PACS dataset.

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Machine Translation Implement By Bi-GRU And Transformer

ColossalAI-Benchmark - Performance benchmarking with ColossalAI

The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks

Riemannian Convex Potential Maps

Official implementation for (Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching, AAAI-2021)

Related tags

Overview

Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching

Requirements

Training

License

Owner

Clova AI Research

Real-world Anomaly Detection in Surveillance Videos- pytorch Re-implementation

Official implementation for TTT++: When Does Self-supervised Test-time Training Fail or Thrive

Predict stock movement with Machine Learning and Deep Learning algorithms

Pose estimation with MoveNet Lightning

Toward Multimodal Image-to-Image Translation

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.

Jetson Nano-based smart camera system that measures crowd face mask usage in real-time.

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long*, Evan Shelhamer*, and Trevor Darrell. CVPR 2015 and PAMI 2016.

PlaidML is a framework for making deep learning work everywhere.

This is the source code for: Context-aware Entity Typing in Knowledge Graphs.

EMNLP 2021: Single-dataset Experts for Multi-dataset Question-Answering

Datasets and source code for our paper Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach

Texture mapping with variational auto-encoders

Python interface for the DIGIT tactile sensor

This repo provides a demo for the CVPR 2021 paper "A Fourier-based Framework for Domain Generalization" on the PACS dataset.

A Light in the Dark: Deep Learning Practices for Industrial Computer Vision

Machine Translation Implement By Bi-GRU And Transformer

ColossalAI-Benchmark - Performance benchmarking with ColossalAI

The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks

Riemannian Convex Potential Maps

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.