Efficient Sharpness-aware Minimization for Improved Training of Neural Networks

Code for “Efficient Sharpness-aware Minimization for Improved Training of Neural Networks”

Requisite

This code is implemented in PyTorch, and we have tested the code under the following environment settings:

python = 3.8.8
torch = 1.8.0
torchvision = 0.9.0

What is in this repository

Codes for our ESAM on CIFAR10/CIFAR100 datasets.

How to use it

from utils.layer_dp_sam import ESAM
base_optimizer = torch.optim.SGD(model.parameters(),lr=args.learning_rate,momentum=0.9,weight_decay=args.weight_decay)
optimizer = ESAM(paras, base_optimizer, rho=args.rho, weight_dropout=args.weight_dropout,adaptive=args.isASAM,nograd_cutoff=args.nograd_cutoff,opt_dropout = args.opt_dropout,temperature=args.temperature)

--beta the SWP hyperparameter

--gamma the SDS hyperparameter

During training loss_fct should have reduction="none", to return instance-wise losses. defined_backward is the function used for DDP and mixed precision backward

loss_fct = torch.nn.CrossEntropyLoss(reduction="none")
def defined_backward():
    if args.fp16:
    with amp.scale_loss(loss, optimizer0) as scaled_loss:
        scaled_loss.backward()
    else:
        loss.backward()

paras = [inputs,targets,loss_fct,model,defined_backward]
optimizer.paras = paras
optimizer.step()
predictions_logits,loss = optimizer.returnthings

Example

bash run.sh

Reference Code

[1] SAM

Efficient Sharpness-aware Minimization for Improved Training of Neural Networks

Related tags

Overview

Efficient Sharpness-aware Minimization for Improved Training of Neural Networks

Requisite

What is in this repository

How to use it

Example

Reference Code

Owner

Angusdu

Emotion classification of online comments based on RNN

This is the PyTorch implementation of GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation

Cereal box identification in store shelves using computer vision and a single train image per model.

StyleMapGAN - Official PyTorch Implementation

Unified learning approach for egocentric hand gesture recognition and fingertip detection

Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)

Code release for the paper “Worldsheet Wrapping the World in a 3D Sheet for View Synthesis from a Single Image”, ICCV 2021.

Arquitetura e Desenho de Software.

Breaking the Curse of Space Explosion: Towards Efficient NAS with Curriculum Search

Airbus Ship Detection Challenge

Constraint-based geometry sketcher for blender

Segmentation vgg16 fcn - cityscapes

Pipeline code for Sequential-GAM(Genome Architecture Mapping).

Code for "Typilus: Neural Type Hints" PLDI 2020

Use MATLAB to simulate the signal and extract features. Use PyTorch to build and train deep network to do spectrum sensing.

Python implementation of the multistate Bennett acceptance ratio (MBAR)

Beyond imagenet attack (accepted by ICLR 2022) towards crafting adversarial examples for black-box domains.

Caffe-like explicit model constructor. C(onfig)Model

🔮 A refreshing functional take on deep learning, compatible with your favorite libraries

High performance distributed framework for training deep learning recommendation models based on PyTorch.