Efficient Lottery Ticket Finding: Less Data is More

Last update: Sep 04, 2022

Overview

Efficient Lottery Ticket Finding: Less Data is More

Codes for this paper Efficient Lottery Ticket Finding: Less Data is More. [ICML 2021]

Zhenyu Zhang*, Xuxi Chen*, Tianlong Chen*, Zhangyang Wang

Overview

The lottery ticket hypothesis (LTH) reveals the existence of winning tickets (sparse but critical subnetworks) for dense networks, that can be trained in isolation from random initialization to match the latter’s accuracies. However, finding winning tickets requires burdensome computations in the train-prune-retrain process, especially on large-scale datasets (e.g., ImageNet), restricting their practical benefits. This paper explores a new perspective on finding lottery tickets more efficiently, by doing so only with a specially selected subset of data, called Pruning- Aware Critical set (PrAC set), rather than using the full training set. The concept of PrAC set was inspired by the recent observation, that deep networks have samples that are either hard to memorize during training, or easy to forget during pruning. A PrAC set is thus hypothesized to capture those most challenging and informative examples for the dense model. We observe that a high-quality winning ticket can be found with training and pruning the dense network on the very compact PrAC set, which can substantially save training iterations for the ticket finding process.

Prerequisites

Pytorch >= 1.4

torchvision

advertorch

Usage

Vanilla Lottery Tickets

python -u main_imp.py \
	--data data/cifar10 \
	--dataset cifar10 \
	--arch res20s \
	--batch_size 128 \
	--lr 0.1 \
	--pruning_times 16 \
	--prune_type rewind_lt \
	--rewind_epoch 2 \
	--save_dir lt_cifar10_res20s

PrAC Lottery Tickets

python -u main_PrAC_imp.py \
	--data data/cifar10 \
	--dataset cifar10 \
	--arch res20s \
	--split_file npy_files/cifar10-train-val.npy \
	--batch_size 128 \
	--lr 0.1 \
	--pruning_times 16 \
	--eb_eps 0.08 \
	--prune_type rewind_lt \
	--rewind_epoch 2 \
	--threshold 0 \
	--save_dir PrAC_lt_cifar10_res20s

Train subnetworks

python -u main_train.py \
	--data data/cifar10 \
	--dataset cifar10 \
	--arch res20s \
	--batch_size 128 \
	--lr 0.1 \
	--init_dir PrAC_lt_cifar10_res20s/1checkpoint.pth.tar \ 
	--mask_dir PrAC_lt_cifar10_res20s/1checkpoint.pth.tar \ # sparsity=20%
	--save_dir retrain_PrAC_lt_cifar10_res20s/1

Efficient Lottery Ticket Finding: Less Data is More

Related tags

Overview

Efficient Lottery Ticket Finding: Less Data is More

Overview

Prerequisites

Usage

Vanilla Lottery Tickets

PrAC Lottery Tickets

Train subnetworks

Citation

Owner

VITA

Diverse Object-Scene Compositions For Zero-Shot Action Recognition

This repository is for Competition for ML_data class

nnFormer: Interleaved Transformer for Volumetric Segmentation

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

[ICCV'21] Neural Radiance Flow for 4D View Synthesis and Video Processing

Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting

Self-Adaptable Point Processes with Nonparametric Time Decays

Road Crack Detection Using Deep Learning Methods

NR-GAN: Noise Robust Generative Adversarial Networks

PyTorch implementations of the paper: "Learning Independent Instance Maps for Crowd Localization"

MINOS: Multimodal Indoor Simulator

A foreign language learning aid using a neural network to predict probability of translating foreign words

Job Assignment System by Real-time Emotion Detection

Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"

Multistream CNN for Robust Acoustic Modeling

YOLOv5 🚀 is a family of object detection architectures and models pretrained on the COCO dataset

using STGCN to achieve egg classification task

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.

《Train in Germany, Test in The USA: Making 3D Object Detectors Generalize》(CVPR 2020)

Gym for multi-agent reinforcement learning

Efficient Lottery Ticket Finding: Less Data is More

Related tags

Overview

Efficient Lottery Ticket Finding: Less Data is More

Overview

Prerequisites

Usage

Vanilla Lottery Tickets

PrAC Lottery Tickets

Train subnetworks

Citation

Owner

VITA

Diverse Object-Scene Compositions For Zero-Shot Action Recognition

This repository is for Competition for ML_data class

nnFormer: Interleaved Transformer for Volumetric Segmentation

Cooperative multi-agent reinforcement learning for high-dimensional nonequilibrium control

[ICCV'21] Neural Radiance Flow for 4D View Synthesis and Video Processing

Augmenting Physical Models with Deep Networks for Complex Dynamics Forecasting

Self-Adaptable Point Processes with Nonparametric Time Decays

Road Crack Detection Using Deep Learning Methods

NR-GAN: Noise Robust Generative Adversarial Networks

PyTorch implementations of the paper: "Learning Independent Instance Maps for Crowd Localization"

MINOS: Multimodal Indoor Simulator

A foreign language learning aid using a neural network to predict probability of translating foreign words

Job Assignment System by Real-time Emotion Detection

Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"

Multistream CNN for Robust Acoustic Modeling

YOLOv5 🚀 is a family of object detection architectures and models pretrained on the COCO dataset

using STGCN to achieve egg classification task

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren*, Raymond A. Yeh*, Alexander G. Schwing.

《Train in Germany, Test in The USA: Making 3D Object Detectors Generalize》(CVPR 2020)

Gym for multi-agent reinforcement learning

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.