Implementation of Online Label Smoothing in PyTorch

Last update: Dec 14, 2022

Related tags

Overview

Online Label Smoothing

Pytorch implementation of Online Label Smoothing (OLS) presented in Delving Deep into Label Smoothing.

Introduction

As the abstract states, OLS is a strategy to generates soft labels based on the statistics of the model prediction for the target category. The core idea is that instead of using fixed soft labels for every epoch, we go updating them based on the stats of correct predicted samples.

More details and experiment results can be found in the paper.

Usage

Usage of OnlineLabelSmoothing is pretty straightforward. Just use it as you would use PyTorch CrossEntropyLoss. The only thing that is different is that at the end of the epoch you should call OnlineLabelSmoothing.next_epoch(). It updates the OnlineLabelSmoothing.supervise matrix that will be used in the next epoch for the soft labels.

Standalone

from ols import OnlineLabelSmoothing
import torch

k = 4  # Number of classes
b = 32  # Batch size
criterion = OnlineLabelSmoothing(alpha=0.5, n_classes=k, smoothing=0.1)
logits = torch.randn(b, k)  # Predictions
y = torch.randint(k, (b,))  # Ground truth

loss = criterion(logits, y)

PyTorch

from ols import OnlineLabelSmoothing

criterion = OnlineLabelSmoothing(alpha=..., n_classes=...)
for epoch in range(...):  # loop over the dataset multiple times
    for i, data in enumerate(...):
        inputs, labels = data
        # zero the parameter gradients
        optimizer.zero_grad()
        # forward + backward + optimize
        outputs = net(inputs)
        loss = criterion(outputs, labels)
        loss.backward()
        optimizer.step()
    print(f'Epoch {epoch} finished!')
    # Update the soft labels for next epoch
    criterion.next_epoch()

PyTorchLightning

With PL you can simply call next_epoch() at the end of the epoch with:

import pytorch_lightning as pl
from ols import OnlineLabelSmoothing


class LitClassification(pl.LightningModule):
    def __init__(self):
        super().__init__()
        self.criterion = OnlineLabelSmoothing(alpha=..., n_classes=...)

    def forward(self, x):
        pass

    def configure_optimizers(self):
        pass

    def training_step(self, train_batch, batch_idx):
        pass

    def on_train_epoch_end(self, **kwargs):
        self.criterion.next_epoch()

Installation

pip install -r requirements.txt

Citation

@misc{zhang2020delving,
      title={Delving Deep into Label Smoothing}, 
      author={Chang-Bin Zhang and Peng-Tao Jiang and Qibin Hou and Yunchao Wei and Qi Han and Zhen Li and Ming-Ming Cheng},
      year={2020},
      eprint={2011.12562},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Implementation of Online Label Smoothing in PyTorch

Related tags

Overview

Online Label Smoothing

Introduction

Usage

Standalone

PyTorch

PyTorchLightning

Installation

Citation

Owner

A lightweight Python-based 3D network multi-agent simulator. Uses a cell-based congestion model. Calculates risk, loudness and battery capacities of the agents. Suitable for 3D network optimization tasks.

deep learning model with only python and numpy with test accuracy 99 % on mnist dataset and different optimization choices

Turning SymPy expressions into PyTorch modules.

(SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’

DeRF: Decomposed Radiance Fields

YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931)

Extract MNIST handwritten digits dataset binary file into bmp images

Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.

Supervised & unsupervised machine-learning techniques are applied to the database of weighted P4s which admit Calabi-Yau hypersurfaces.

ML-Decoder: Scalable and Versatile Classification Head

MAGMA - a GPT-style multimodal model that can understand any combination of images and language

Python scripts form performing stereo depth estimation using the high res stereo model in PyTorch .

Universal Adversarial Triggers for Attacking and Analyzing NLP (EMNLP 2019)

PyTorch reimplementation of REALM and ORQA

Code for the paper titled "Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages"

An official source code for "Augmentation-Free Self-Supervised Learning on Graphs"

Constructing Neural Network-Based Models for Simulating Dynamical Systems

Simulating an AI playing 2048 using the Expectimax algorithm

This is the dataset and code release of the OpenRooms Dataset.