Implementation of "RaScaNet: Learning Tiny Models by Raster-Scanning Image" from CVPR 2021.

Last update: Dec 26, 2022

Related tags

Overview

RaScaNet: Learning Tiny Models by Raster-Scanning Images

Deploying deep convolutional neural networks on ultra-low power systems is challenging, because the systems put a hard limit on the size of on-chip memory. To overcome this drawback, we propose a novel Raster-Scanning Network, named RaScaNet, inspired by raster-scanning in image sensors.

RaScaNet reads only a few rows of pixels at a time using a convolutional neural network and then sequentially learns the representation of the whole image using a recurrent neural network. The proposed method requires 15.9-24.3x smaller peak memory and 5.3-12.9x smaller weight memory than the state-of-the-art tiny models. The total memory usage of RaScaNet does not exceed 60 KB, in the VWW dataset with competitive accuracy.

Conference: CVPR 2021
Paper | Video | Citation

Requirements

python 3.6
torch 1.7.0
torchvision 0.8.1
pycocotools 2.0.1
numpy 0.19.0
VWW dataset

Usage

For running the model, (only support vww dataset)

python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=240 --model_path=checkpoint/rascanet_210x240.pth.tar
python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=120 --model_path=checkpoint/rascanet_105x120.pth.tar

With early termination,

python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=240 --model_path=checkpoint/rascanet_210x240.pth.tar --early_terminate=1
python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=120 --model_path=checkpoint/rascanet_105x120.pth.tar --early_terminate=1

Currently, we do not provide the code for training.

Result

Model	Weight Memory	Peak Memory	OPs Cnt.	Accuracy
rascanet(210x240)	47.03 KB	7.92 KB	56.34 M	91.835%
rascanet(105x120)	31.77 KB	3.60 KB	9.71 M	88.100%

Citation

@InProceedings{Yoo_2021_CVPR,
    author    = {Yoo, Jaehyoung and Lee, Dongwook and Son, Changyong and Jung, Sangil and Yoo, ByungIn and Choi, Changkyu and Han, Jae-Joon and Han, Bohyung},
    title     = {RaScaNet: Learning Tiny Models by Raster-Scanning Images},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {13673-13682}
}

License

Copyright (C) 2021 Samsung Electronics Co. LTD

This software is a property of Samsung Electronics.
No part of this software, either material or conceptual may be copied or distributed, transmitted,
transcribed, stored in a retrieval system or translated into any human or computer language in any form by any means,
electronic, mechanical, manual or otherwise, or disclosed
to third parties without the express written permission of Samsung Electronics.
(Use of the Software is restricted to non-commercial, personal or academic, research purpose only)

Implementation of "RaScaNet: Learning Tiny Models by Raster-Scanning Image" from CVPR 2021.

Related tags

Overview

RaScaNet: Learning Tiny Models by Raster-Scanning Images

Requirements

Usage

Result

Citation

License

Owner

SAIT (Samsung Advanced Institute of Technology)

From Perceptron model to Deep Neural Network from scratch in Python.

My implementation of Image Inpainting - A deep learning Inpainting model

The code for our paper Semi-Supervised Learning with Multi-Head Co-Training

PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

Uses OpenCV and Python Code to detect a face on the screen

adversarial_multi_armed_bandit_variable_plays

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

A framework for the elicitation, specification, formalization and understanding of requirements.

This repo contains the code required to train the multivariate time-series Transformer.

Official implementation of the ICLR 2021 paper

An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates neural fields, predictive coding, top-down-bottom-up, and attention (consensus between columns)

ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations which measure how well they generalize to unseen concepts.

PyTorch CZSL framework containing GQA, the open-world setting, and the CGE and CompCos methods.

Codes for our IJCAI21 paper: Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting Summarization

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

Graph Convolutional Networks for Temporal Action Localization (ICCV2019)

Awesome Long-Tailed Learning

The code for "Deep Level Set for Box-supervised Instance Segmentation in Aerial Images".

[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark

Implementation of "RaScaNet: Learning Tiny Models by Raster-Scanning Image" from CVPR 2021.

Related tags

Overview

RaScaNet: Learning Tiny Models by Raster-Scanning Images

Requirements

Usage

Result

Citation

License

Owner

SAIT (Samsung Advanced Institute of Technology)

From Perceptron model to Deep Neural Network from scratch in Python.

My implementation of Image Inpainting - A deep learning Inpainting model

The code for our paper Semi-Supervised Learning with Multi-Head Co-Training

PyTorch implementation of ShapeConv: Shape-aware Convolutional Layer for RGB-D Indoor Semantic Segmentation.

Uses OpenCV and Python Code to detect a face on the screen

adversarial_multi_armed_bandit_variable_plays

This is the pytorch implementation for the paper: *Learning Accurate Performance Predictors for Ultrafast Automated Model Compression*, which is in submission to TPAMI

Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers"

A framework for the elicitation, specification, formalization and understanding of requirements.

This repo contains the code required to train the multivariate time-series Transformer.

Official implementation of the ICLR 2021 paper

An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates neural fields, predictive coding, top-down-bottom-up, and attention (consensus between columns)

ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations which measure how well they generalize to unseen concepts.

PyTorch CZSL framework containing GQA, the open-world setting, and the CGE and CompCos methods.

Codes for our IJCAI21 paper: Dialogue Discourse-Aware Graph Model and Data Augmentation for Meeting Summarization

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

Graph Convolutional Networks for Temporal Action Localization (ICCV2019)

Awesome Long-Tailed Learning

The code for "Deep Level Set for Box-supervised Instance Segmentation in Aerial Images".

[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark

This is the pytorch implementation for the paper: Learning Accurate Performance Predictors for Ultrafast Automated Model Compression, which is in submission to TPAMI