Simple data balancing baselines for worst-group-accuracy benchmarks.

Last update: Dec 02, 2022

Related tags

Overview

BalancingGroups

Code to replicate the experimental results from Simple data balancing baselines achieve competitive worst-group-accuracy.

Replicating the main results

Installing dependencies

Easiest way to have a working environment for this repo is to create a conda environement with the following commands

conda env create -f environment.yaml
conda activate balancinggroups

If conda is not available, please install the dependencies listed in the requirements.txt file.

Download, extract and Generate metadata for datasets

This script downloads, extracts and formats the datasets metadata so that it works with the rest of the code out of the box.

python setup_datasets.py --download --data_path data

Launch jobs

To reproduce the experiments in the paper on a SLURM cluster :

# Launching 1400 combo seeds = 50 hparams for 4 datasets for 7 algorithms
# Each combo seed is ran 5 times to compute error bars, totalling 7000 jobs
python train.py --data_path data --output_dir main_sweep --num_hparams_seeds 1400 --num_init_seeds 5 --partition <slurm_partition>

If you want to run the jobs localy, omit the --partition argument.

Parse results

The parse.py script can generate all of the plots and tables from the paper. By default, it generates the best test worst-group-accuracy table for each dataset/method. This script can be called while the experiments are still running.

python parse.py main_sweep

License

This source code is released under the CC-BY-NC license, included here.

Simple data balancing baselines for worst-group-accuracy benchmarks.

Related tags

Overview

BalancingGroups

Replicating the main results

Installing dependencies

Download, extract and Generate metadata for datasets

Launch jobs

Parse results

License

Owner

Meta Research

Transfer-Learn is an open-source and well-documented library for Transfer Learning.

🤖 A Python library for learning and evaluating knowledge graph embeddings

PyTorch Implementation of PIXOR: Real-time 3D Object Detection from Point Clouds

Relative Positional Encoding for Transformers with Linear Complexity

PyTorch Implementation of Backbone of PicoDet

Pre-trained model, code, and materials from the paper "Impact of Adversarial Examples on Deep Learning Models for Biomedical Image Segmentation" (MICCAI 2019).

Official source code of paper 'IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo'

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Official code repository for the EMNLP 2021 paper

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

Shared Attention for Multi-label Zero-shot Learning

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

Adjust Decision Boundary for Class Imbalanced Learning

Using PyTorch Perform intent classification using three different models to see which one is better for this task

Official pytorch implementation of the IrwGAN for unaligned image-to-image translation

Automated Melanoma Recognition in Dermoscopy Images via Very Deep Residual Networks

Detecting drunk people through thermal images using Deep Learning (CNN)

PyTorch implementation of "A Simple Baseline for Low-Budget Active Learning".

Simple data balancing baselines for worst-group-accuracy benchmarks.

Related tags

Overview

BalancingGroups

Replicating the main results

Installing dependencies

Download, extract and Generate metadata for datasets

Launch jobs

Parse results

License

Owner

Meta Research

Transfer-Learn is an open-source and well-documented library for Transfer Learning.

🤖 A Python library for learning and evaluating knowledge graph embeddings

PyTorch Implementation of PIXOR: Real-time 3D Object Detection from Point Clouds

Relative Positional Encoding for Transformers with Linear Complexity

PyTorch Implementation of Backbone of PicoDet

Pre-trained model, code, and materials from the paper "Impact of Adversarial Examples on Deep Learning Models for Biomedical Image Segmentation" (MICCAI 2019).

Official source code of paper 'IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo'

CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)

Official code repository for the EMNLP 2021 paper

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

Shared Attention for Multi-label Zero-shot Learning

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren*, Raymond A. Yeh*, Alexander G. Schwing.

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

Adjust Decision Boundary for Class Imbalanced Learning

Using PyTorch Perform intent classification using three different models to see which one is better for this task

Official pytorch implementation of the IrwGAN for unaligned image-to-image translation

Automated Melanoma Recognition in Dermoscopy Images via Very Deep Residual Networks

Detecting drunk people through thermal images using Deep Learning (CNN)

PyTorch implementation of "A Simple Baseline for Low-Budget Active Learning".

code for paper "Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning" by Zhongzheng Ren, Raymond A. Yeh, Alexander G. Schwing.