[ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"

Last update: Nov 20, 2022

Overview

Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks

By Yikai Wang, Yi Yang, Fuchun Sun, Anbang Yao.

This is the pytorch implementation of our paper "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks", published in ICCV 2021.

Citation

If you find our code useful for your research, please consider citing:

@inproceedings{wang2021snn,
    title={Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks},
    author={Wang, Yikai and Yang, Yi and Sun, Fuchun and Yao, Anbang},
    booktitle={International Conference on Computer Vision (ICCV)},
    year={2021}
}

Dataset

Following this repository,

Download the ImageNet dataset from http://www.image-net.org.
Then move validation images to labeled subfolders, using the following script.

Requirements:

python3, pytorch 1.4.0, torchvision 0.5.0

Training

(1) Step1: binarizing activations (or you can omit this step by using our Step1 model checkpoint_ba.pth.tar),

Change directory to ./step1,
Run the folowing script,

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py --data=path/to/ILSVRC2012/  --batch_size=512 --learning_rate=1e-3 --epochs=256 --weight_decay=1e-5

(2) Step2: binarizing weights + activations,

Change directory to ./step2,
Create new folder ./models and copy checkpoint_ba.pth.tar (obtained from Step1) to ./models,
Run the folowing script,

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py --data=path/to/ILSVRC2012/  --batch_size=512 --learning_rate=1e-3 --epochs=256 --weight_decay=0 --bit-num=5

Comment: --bit-num=5 corresponds to 0.56 bit (bit-num indicates tau in the paper).

Results

This implementation is based on ResNet-18 of ReActNet.

Bit-Width	Top1-Acc	Top5-Acc	#Params	Bit-OPs	Model & Log
1W / 1A	65.7%	86.3%	10.99Mbit	1.677G	Google Drive
0.67W / 1A	63.4%	84.5%	7.324Mbit	0.883G	Google Drive
0.56W / 1A	62.1%	83.8%	6.103Mbit	0.501G	Google Drive
0.44W / 1A	60.7%	82.7%	4.882Mbit	0.297G	Google Drive

License

SNN is released under MIT License.

[ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"

Related tags

Overview

Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks

Citation

Dataset

Requirements:

Training

Results

License

Owner

Yikai Wang

Traffic4D: Single View Reconstruction of Repetitious Activity Using Longitudinal Self-Supervision

This repository contains the source code and data for reproducing results of Deep Continuous Clustering paper

Rethinking Nearest Neighbors for Visual Classification

Current state of supervised and unsupervised depth completion methods

RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching

This project intends to use SVM supervised learning to determine whether or not an individual is diabetic given certain attributes.

Recall Loss for Semantic Segmentation (This repo implements the paper: Recall Loss for Semantic Segmentation)

A curated list of Generative Deep Art projects, tools, artworks, and models

Improving Factual Consistency of Abstractive Text Summarization

Flexible-CLmser: Regularized Feedback Connections for Biomedical Image Segmentation

this is a lite easy to use virtual keyboard project for anyone to use

This is a Machine Learning Based Hand Detector Project, It Uses Machine Learning Models and Modules Like Mediapipe, Developed By Google!

A fast python implementation of Ray Tracing in One Weekend using python and Taichi

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers

for a paper about leveraging discourse markers for training new models

This is code to fit per-pixel environment map with spherical Gaussian lobes, using LBFGS optimization

Coarse implement of the paper "A Simultaneous Denoising and Dereverberation Framework with Target Decoupling", On DNS-2020 dataset, the DNSMOS of first stage is 3.42 and second stage is 3.47.

Attention mechanism with MNIST dataset

This repo is the official implementation of "L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization".

Keras implementation of the GNM model in paper ’Graph-Based Semi-Supervised Learning with Nonignorable Nonresponses‘