Code associated with the paper "Towards Understanding the Data Dependency of Mixup-style Training".

Last update: Nov 11, 2021

Related tags

Overview

Mixup-Data-Dependency

Code associated with the paper "Towards Understanding the Data Dependency of Mixup-style Training".

Running Alternating Line Experiments

In order to generate the plots found in Section 2.3 ("A Mixup Failure Case"), one can run the following command for different values of alpha.

python3 tasks/train_models.py --task-name NCAL --alpha 128 --num-runs 10

If running using slurm, it is also possible to just run:

./tasks/run_task_with_erm.sh NCAL 128 10 0

The generated output files can be found under runs/ and plots/ with file names based on the provided parameters.

Running Image Classification Experiments

In order to generate the plots found in Section 2.4 ("Sufficient Conditions for Minimizing the Original Risk"), one can run the following commands for different values of alpha.

python3 tasks/train_models.py --task-name MNIST --alpha 1024 --num-runs 5
python3 tasks/train_models.py --task-name CIFAR10 --alpha 1024 --num-runs 5
python3 tasks/train_models.py --task-name CIFAR100 --alpha 1024 --num-runs 5

Once again, if running using slurm it is possible to instead run ./tasks/run_task_with_erm.sh with the same arguments as above and an additional fourth argument set to 0. As before, output files can be found in runs/ and plots/.

Running Angular Distance Analysis

To recreate the approximate epsilon computation found in Section 2.4 (in the discussion of application of sufficient conditions), one can run the following command after manually setting subset_prop and alpha in analysis/mixup_point_analysis.py.

python3 analysis/mixup_point_analysis.py

Running Two Moons Experiments

To recreate the two moons experiments found in Section 3.1 ("The Margin of Mixup Classifiers"), set alpha_1 and alpha_2 in tasks/two_moons/py to the mixing parameters to be compared and then run the following command.

python3 tasks/two_moons.py

Code associated with the paper "Towards Understanding the Data Dependency of Mixup-style Training".

Related tags

Overview

Mixup-Data-Dependency

Running Alternating Line Experiments

Running Image Classification Experiments

Running Angular Distance Analysis

Running Two Moons Experiments

Owner

Muthu Chidambaram

Kohei's 5th place solution for xview3 challenge

SMD-Nets: Stereo Mixture Density Networks

Breast Cancer Classification Model is applied on a different dataset

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

The source code for CATSETMAT: Cross Attention for Set Matching in Bipartite Hypergraphs

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language (NeurIPS 2021)

Training BERT with Compute/Time (Academic) Budget

The Balloon Learning Environment - flying stratospheric balloons with deep reinforcement learning.

A scanpy extension to analyse single-cell TCR and BCR data.

Kaggle Lyft Motion Prediction for Autonomous Vehicles 4th place solution

FastCover: A Self-Supervised Learning Framework for Multi-Hop Influence Maximization in Social Networks by Anonymous.

[ICCV'21] NEAT: Neural Attention Fields for End-to-End Autonomous Driving

Machine learning algorithms for many-body quantum systems

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning

Cervix ROI Segmentation Using U-NET

Si Adek Keras is software VR dangerous object detection.

🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

A new codebase for Group Activity Recognition. It contains codes for ICCV 2021 paper: Spatio-Temporal Dynamic Inference Network for Group Activity Recognition and some other methods.

Deep learning (neural network) based remote photoplethysmography: how to extract pulse signal from video using deep learning tools