Code repository for "Reducing Underflow in Mixed Precision Training by Gradient Scaling" presented at IJCAI '20

Last update: Apr 14, 2022

Overview

Reducing Underflow in Mixed Precision Training by Gradient Scaling

This project implements the gradient scaling method to improve the performance of mixed precision training.

The old repository: https://github.com/ada-loss/ada-loss

@inproceedings{ijcai2020-404,
  title     = {Reducing Underflow in Mixed Precision Training by Gradient Scaling},
  author    = {Zhao, Ruizhe and Vogel, Brian and Ahmed, Tanvir and Luk, Wayne},
  booktitle = {Proceedings of the Twenty-Ninth International Joint Conference on
               Artificial Intelligence, {IJCAI-20}},
  publisher = {International Joint Conferences on Artificial Intelligence Organization},             
  editor    = {Christian Bessiere}	
  pages     = {2922--2928},
  year      = {2020},
  month     = {7},
  note      = {Main track}
  doi       = {10.24963/ijcai.2020/404},
  url       = {https://doi.org/10.24963/ijcai.2020/404},
}

Introduction

Loss scaling is a technique that scales up loss values to mitigate underflow caused by low precision data representation in backpropagated activation gradients. The original implementation uses a fixed loss scale value predetermined before training starts for all layers, which may not be optimal since the statistics of gradients change across layers and training epochs. Instead, our method calculates the loss scale value for each layer based on their runtime statistics.

Installation

We are using Anaconda to manage package dependencies:

conda create -f environment.yml
conda activate ada_loss

To install this project, please consider using this command:

pip install -e . # in the project root

Project structure

The structure of this project is as follows: the core of the adaptive loss scaling method is implemented in the ada_loss package; chainerlp provides the implementation of some baseline models; and models includes third party implementation of more complicated baseline models.

Usage

Example usage for chainer (other frameworks will be released later):

from ada_loss.chainer import AdaLossScaled
from ada_loss.chainer import transforms

# transform your link to support adaptive loss scaling
link = AdaLossScaled(link, transforms=[
    transforms.AdaLossTransformLinear(),
    transforms.AdaLossTransformConvolution2D(),
    # ...
])

It tries to convert links within the given link to ones that supports adaptive loss scaling based on the provided list of transforms. Adaptive loss scaled links are located under ada_loss.chainer.links. Transforms are extended based on AdaLossTransform in ada_loss.chainer.transforms.base and stored under ada_loss.chainer.transforms. For now, users are required to go through their link and specify explicitly transforms that should be taken.

Examples

Examples are located here.

Testing

Tests can be launched by calling pytest. Some tests are specified to be run on GPUs.

Code repository for "Reducing Underflow in Mixed Precision Training by Gradient Scaling" presented at IJCAI '20

Related tags

Overview

Reducing Underflow in Mixed Precision Training by Gradient Scaling

Introduction

Installation

Project structure

Usage

Examples

Testing

Owner

Ruizhe Zhao

Pytorch implemenation of Stochastic Multi-Label Image-to-image Translation (SMIT)

Airborne Optical Sectioning (AOS) is a wide synthetic-aperture imaging technique

This is an implementation for the CVPR2020 paper "Learning Invariant Representation for Unsupervised Image Restoration"

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

Automatic differentiation with weighted finite-state transducers.

EfficientDet (Scalable and Efficient Object Detection) implementation in Keras and Tensorflow

MVP Benchmark for Multi-View Partial Point Cloud Completion and Registration

Twin-deep neural network for semi-supervised learning of materials properties

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

7th place solution of Human Protein Atlas - Single Cell Classification on Kaggle

Neural Nano-Optics for High-quality Thin Lens Imaging

Learning to trade under the reinforcement learning framework

A curated list of long-tailed recognition resources.

Neural implicit reconstruction experiments for the Vector Neuron paper

Code for "Neural 3D Scene Reconstruction with the Manhattan-world Assumption" CVPR 2022 Oral

Toontown: Galaxy, a new Toontown game based on Disney's Toontown Online

yolov5 deepsort 行人车辆跟踪检测计数

Google Landmark Recogntion and Retrieval 2021 Solutions

Boundary IoU API (Beta version)

Code repository for "Reducing Underflow in Mixed Precision Training by Gradient Scaling" presented at IJCAI '20

Related tags

Overview

Reducing Underflow in Mixed Precision Training by Gradient Scaling

Introduction

Installation

Project structure

Usage

Examples

Testing

Owner

Ruizhe Zhao

Pytorch implemenation of Stochastic Multi-Label Image-to-image Translation (SMIT)

Airborne Optical Sectioning (AOS) is a wide synthetic-aperture imaging technique

This is an implementation for the CVPR2020 paper "Learning Invariant Representation for Unsupervised Image Restoration"

A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

Automatic differentiation with weighted finite-state transducers.

EfficientDet (Scalable and Efficient Object Detection) implementation in Keras and Tensorflow

MVP Benchmark for Multi-View Partial Point Cloud Completion and Registration

Twin-deep neural network for semi-supervised learning of materials properties

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

7th place solution of Human Protein Atlas - Single Cell Classification on Kaggle

Neural Nano-Optics for High-quality Thin Lens Imaging

Learning to trade under the reinforcement learning framework

A curated list of long-tailed recognition resources.

Neural implicit reconstruction experiments for the Vector Neuron paper

Code for "Neural 3D Scene Reconstruction with the Manhattan-world Assumption" CVPR 2022 Oral

Toontown: Galaxy, a new Toontown game based on Disney's Toontown Online

yolov5 deepsort 行人 车辆 跟踪 检测 计数

Google Landmark Recogntion and Retrieval 2021 Solutions

Boundary IoU API (Beta version)

yolov5 deepsort 行人车辆跟踪检测计数