Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

Last update: Dec 27, 2022

Related tags

Deep Learning InfoPro-Pytorch

Overview

InfoPro-Pytorch

The Information Propagation algorithm for training deep networks with local supervision.

(ICLR 2021) Revisiting Locally Supervised Learning: an Alternative to End-to-end Training

Update on 2021/01/25: Release Pre-trained models on ImageNet and Cityscapes.

Update on 2021/01/24: Release Code for Image Classification on CIFAR/SVHN/STL10/ImageNet and Semantic Segmentation on Cityscapes.

Introduction

We propose Information Propagation (InfoPro), a locally supervised deep learning algorithm, from the information-theoretic perspective. By splitting the whole deep network into multiple local modules and training them with local InfoPro loss, we reduce the GPU memory footprint by 40-60% without introducing notable extra computational cost or training time, but improve the performance moderately.

Citation

If you find this work valuable or use our code in your own research, please consider citing us with the following bibtex:

@inproceedings{wang2021revisiting,
        title = {Revisiting Locally Supervised Learning: an Alternative to End-to-end Training},
       author = {Yulin Wang and Zanlin Ni and Shiji Song and Le Yang and Gao Huang},
    booktitle = {International Conference on Learning Representations (ICLR)},
         year = {2021},
          url = {https://openreview.net/forum?id=fAbkE6ant2}
}

Get Started

Please go to the folder Experiments on CIFAR-SVHN-STL10, Experiments on ImageNet and Semantic segmentation for specific docs.

Results

CIFAR & STL-10

ImageNet

Semantic Segmentation

GPU Memory Cost

In the paper, we report the minimally required GPU memory to run the InfoPro* algorithm with torch.backends.cudnn.benchmark=True (for practical acceleration). Note that this result is (sometimes largely) different from what is printed by nvidia-smi.

Contact

This repo is a re-implementation of our original code. If you have any question, please feel free to contact the authors. Yulin Wang: [email protected].

Acknowledgments

Our code of Semantic Segmentation is from MMSegmentation. We highly appreciate their awesome work!

Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

Related tags

Overview

InfoPro-Pytorch

Introduction

Citation

Get Started

Results

GPU Memory Cost

Contact

Acknowledgments

Owner

Keyword-BERT: Keyword-Attentive Deep Semantic Matching

The Simplest DCGAN Implementation

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

Annotate with anyone, anywhere.

Finding Donors for CharityML

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

Evaluation suite for large-scale language models.

[ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.

A Python library for Deep Probabilistic Modeling

Contrastive Feature Loss for Image Prediction

Code for the paper: Sketch Your Own GAN

Code for Reciprocal Adversarial Learning for Brain Tumor Segmentation: A Solution to BraTS Challenge 2021 Segmentation Task

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

A Python package for generating concise, high-quality summaries of a probability distribution

A copy of Ares that costs 30 fucking dollars.

Source code for the BMVC-2021 paper "SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation".

[IROS2021] NYU-VPR: Long-Term Visual Place Recognition Benchmark with View Direction and Data Anonymization Influences

Deploy recommendation engines with Edge Computing

Install alphafold on the local machine, get out of docker.

Extremely easy multi instancing software for minecraft speedrunning.