A GridMixup augmentation, inspired by GridMask and CutMix

Last update: Dec 28, 2022

Related tags

Deep Learning GridMixup

Overview

GridMixup

A GridMixup augmentation, inspired by GridMask and CutMix

Easy install

pip install git+https://github.com/IlyaDobrynin/GridMixup.git

Overview

This simple augmentation is inspired by the GridMask and CutMix augmentations. The combination of this two augmentations forms proposed method.

Example

To run simple examples notebooks, you should install requirements:

pip install -r requirements.txt

Simple examples are here: demo and pipeline demo

TlDr:

from gridmix import GridMixupLoss

gridmix_cls = GridMixupLoss(
    alpha=(0.4, 0.7),
    hole_aspect_ratio=1.,
    crop_area_ratio=(0.5, 1),
    crop_aspect_ratio=(0.5, 2),
    n_holes_x=(2, 6)
)

images, targets = batch['images'], batch['targets']
images_mixed, targets_mixed = gridmix_cls.get_sample(images=images, targets=targets)
preds = model(images_mixed)
loss = criterion(preds, targets_mixed)

Before

After

GridMixup loss defined as:

lam * CrossEntropyLoss(preds, trues1) + (1 - lam) * CrossEntropyLoss(preds, trues2)

where:

lam - the area of the main image
(1 - lam) - area of the secondary image

Parameters

GridMixupLoss takes follow arguments:

alpha - parameter define area of the main image in mixed image. Could be float or Tuple[float, float].
- if float: lambda parameter gets from the beta-dictribution np.random.beta(alpha, alpha);
- if Tuple[float, float]: lambda parameter gets from the uniform distribution np.random.uniform(alpha[0], alpha[1]).
n_holes_x - number of holes in crop by X axis.
hole_aspect_ratio - aspect ratio of holes.
crop_area_ratio - parameter define area of the secondary image on a mixed image.
crop_aspect_ratio - aspect ratio of crop.

A GridMixup augmentation, inspired by GridMask and CutMix

Related tags

Overview

GridMixup

Easy install

Overview

Example

Parameters

Owner

IlyaDo

Improving Compound Activity Classification via Deep Transfer and Representation Learning

Vehicle detection using machine learning and computer vision techniques for Udacity's Self-Driving Car Engineer Nanodegree.

Pixel-Perfect Structure-from-Motion with Featuremetric Refinement (ICCV 2021, Oral)

A demonstration of using a live Tensorflow session to create an interactive face-GAN explorer.

This repository contains the code for the paper "PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization"

Perform zero-order Hankel Transform for an 1D array (float or real valued).

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

A multilingual version of MS MARCO passage ranking dataset

iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis

Self-Learned Video Rain Streak Removal: When Cyclic Consistency Meets Temporal Correspondence

This repository contains the code for the binaural-detection model used in the publication arXiv:2111.04637

Convert ONNX model graph to Keras model format.

CBKH: The Cornell Biomedical Knowledge Hub

Keyhole Imaging: Non-Line-of-Sight Imaging and Tracking of Moving Objects Along a Single Optical Path

Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image

A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

ilpyt: imitation learning library with modular, baseline implementations in Pytorch

GPU Accelerated Non-rigid ICP for surface registration

Universal Adversarial Examples in Remote Sensing: Methodology and Benchmark

Experiments with the Robust Binary Interval Search (RBIS) algorithm, a Query-Based prediction algorithm for the Online Search problem.