Network Pruning that Matters: A Case Study on Retraining Variants

This repository contains the implementation of the paper Network Pruning that Matters: A Case Study on Retraining Variants.

Duong H. Le, Binh-Son Hua (ICLR 2021)

In this work, we study the behavior of pruned networks under different retraining settings. By leveraging the right learning rate schedule in retraining, we demonstrate a counter-intuitive phenomenon in that randomly pruned networks could even achieve better performance than methodically pruned networks (fine-tuned with the conventional approach) in many scenariors. Our results emphasize the cruciality of the learning rate schedule in pruned network retraining – a detail often overlooked by practioners during the implementation of network pruning.

If you find the paper/code helpful, please cite our paper:

@inproceedings{
le2021network,
title={Network Pruning That Matters:  A Case Study on Retraining Variants},
author={Duong Hoang Le and Binh-Son Hua},
booktitle={International Conference on Learning Representations},
year={2021},
url={https://openreview.net/forum?id=Cb54AMqHQFP}
}

How to Run

To run the code:

Copy the Imagenet/CIFAR-10 dataset to ./data folder
Run init.sh
Download checkpoints here then uncompress it here
Run the desired script in each subfolder.

Acknowledgement

Our implementation is based on the official code of HRank, Taylor Pruning, Soft Filter Pruning, Rethinking.

Network Pruning That Matters: A Case Study on Retraining Variants (ICLR 2021)

Related tags

Overview

Network Pruning that Matters: A Case Study on Retraining Variants

How to Run

Acknowledgement

Owner

Duong H. Le

Morphable Detector for Object Detection on Demand

Towards Representation Learning for Atmospheric Dynamics (AtmoDist)

DeepRec is a recommendation engine based on TensorFlow.

1st Solution For ICDAR 2021 Competition on Mathematical Formula Detection

Norm-based Analysis of Transformer

Set of models for classifcation of 3D volumes

[ICLR'21] FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

python debugger and anti-vm that checks if you're in a virtual machine or if someones trying to debug your file

Face Mask Detection is a project to determine whether someone is wearing mask or not, using deep neural network.

Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.

Neural Style and MSG-Net

MemStream: Memory-Based Anomaly Detection in Multi-Aspect Streams with Concept Drift

PyTorch implementation of Towards Accurate Alignment in Real-time 3D Hand-Mesh Reconstruction (ICCV 2021).

[NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning

Collaborative forensic timeline analysis

[NeurIPS'21 Spotlight] PyTorch code for our paper "Aligned Structured Sparsity Learning for Efficient Image Super-Resolution"

CARL provides highly configurable contextual extensions to several well-known RL environments.

Computing Shapley values using VAEAC

An Implicit Function Theorem (IFT) optimizer for bi-level optimizations

A spatial genome aligner for analyzing multiplexed DNA-FISH imaging data.