Kaggle G2Net Gravitational Wave Detection : 2nd place solution

Last update: Dec 26, 2022

Related tags

Deep Learning kaggle-g2net-public

Overview

Kaggle G2Net Gravitational Wave Detection : 2nd place solution

Solution writeup: https://www.kaggle.com/c/g2net-gravitational-wave-detection/discussion/275341

Instructions

1. Download data

You have to download the competition dataset from competition website, and place the files in input/ directory.

┣ input/
┃   ┣ training_labels.csv
┃   ┣ sample_submission.csv
┃   ┣ train/
┃   ┣ test/
┃
┣ configs.py
┣ ...

(Optional:) Add your hardware configurations

# configs.py
HW_CFG = {
    'RTX3090': (16, 128, 1, 24), # CPU count, RAM amount(GB), GPU count, GPU RAM(GB)
    'A100': (9, 60, 1, 40), 
    'Your config', (128, 512, 8, 40) # add your hardware config!
}

2. Setup python environment

conda

conda env create -n kumaconda -f=environment.yaml
conda activate kumaconda

docker

WIP

3. Prepare data

Two new files - input/train.csv and input/test/.csv will be created.

python prep_data.py

(Optional:) Prepare waveform cache

Optionally you can speed up training by making waveform cache.
This is not recommend if your machine has RAM size smaller than 32GB.
input/train_cache.pickle and input/test_cache.pickle will be created.

python prep_data.py --cache

Then, add cache path to Baseline class in configs.py.

# configs.py
class Baseline:
    name = 'baseline'
    seed = 2021
    train_path = INPUT_DIR/'train.csv'
    test_path = INPUT_DIR/'test.csv'
    train_cache = INPUT_DIR/'train_cache.pickle' # here
    test_cache = INPUT_DIR/'test_cache.pickle' # here
    cv = 5

4. Train nueral network

Each experiment class has a name (e.g. name for Nspec16 is nspec_16).
Outputs of an experiment are

outoffolds.npy : (train size, 1) np.float32
predictions.npy : (cv fold, test size, 1) np.float32
{name}_{timestamp}.log : training log
foldx.pt : pytorch checkpoint

All outputs will be created in results/{name}/.

python train.py --config {experiment class}
# [Options]
# --progress_bar    : Everyone loves progress bar
# --inference       : Run inference only
# --tta             : Run test time augmentations (FlipWave)
# --limit_fold x    : Train a single fold x. You must run inference again by yourself.

5. Train neural network again (pseudo-label)

For experiments with name starting with Pseudo, you must use train_pseudo.py.
Outputs and options are the same as train.py.
Make sure the dependent experiment (see the table below) was successfully run.

python train_pseudo.py --config {experiment class}

Experiments

#	Experiment	Dependency	Frontend	Backend	Input size	CV	Public LB	Private LB
1	Pseudo06	Nspec12	CWT	efficientnet-b2	256 x 512	0.8779	0.8797	0.8782
2	Pseodo07	Nspec16	CWT	efficientnet-b2	128 x 1024	0.87841	0.8801	0.8787
3	Pseudo12	Nspec12arch0	CWT	densenet201	256 x 512	0.87762	0.8796	0.8782
4	Pseudo13	MultiInstance04	CWT	xcit-tiny-p16	384 x 768	0.87794	0.8800	0.8782
5	Pseudo14	Nspec16arch17	CWT	efficientnet-b7	128 x 1024	0.87957	0.8811	0.8800
6	Pseudo18	Nspec21	CWT	efficientnet-b4	256 x 1024	0.87942	0.8812	0.8797
7	Pseudo10	Nspec16spec13	CWT	efficientnet-b2	128 x 1024	0.87875	0.8802	0.8789
8	Pseudo15	Nspec22aug1	WaveNet	efficientnet-b2	128 x 1024	0.87846	0.8809	0.8794
9	Pseudo16	Nspec22arch2	WaveNet	efficientnet-b6	128 x 1024	0.87982	0.8823	0.8807
10	Pseudo19	Nspec22arch6	WaveNet	densenet201	128 x 1024	0.87831	0.8818	0.8804
11	Pseudo17	Nspec23arch3	CNN	efficientnet-b6	128 x 1024	0.87982	0.8823	0.8808
12	Pseudo21	Nspec22arch7	WaveNet	effnetv2-m	128 x 1024	0.87861	0.8831	0.8815
13	Pseudo22	Nspec23arch5	CNN	effnetv2-m	128 x 1024	0.87847	0.8817	0.8799
14	Pseudo23	Nspec22arch12	WaveNet	effnetv2-l	128 x 1024	0.87901	0.8829	0.8811
15	Pseudo24	Nspec30arch2	WaveNet	efficientnet-b6	128 x 1024	0.8797	0.8817	0.8805
16	Pseudo25	Nspec25arch1	WaveNet	efficientnet-b3	256 x 1024	0.87948	0.8820	0.8803
17	Pseudo26	Nspec22arch10	WaveNet	resnet200d	128 x 1024	0.87791	0.881	0.8797
18	PseudoSeq04	Seq03aug3	ResNet1d-18		-	0.87663	0.8804	0.8785
19	PseudoSeq07	Seq12arch4	WaveNet		-	0.87698	0.8796	0.8784
20	PseudoSeq03	Seq09	DenseNet1d-121		-	0.86826	0.8723	0.8703

Kaggle G2Net Gravitational Wave Detection : 2nd place solution

Related tags

Overview

Kaggle G2Net Gravitational Wave Detection : 2nd place solution

Instructions

1. Download data

(Optional:) Add your hardware configurations

2. Setup python environment

conda

docker

3. Prepare data

(Optional:) Prepare waveform cache

4. Train nueral network

5. Train neural network again (pseudo-label)

Experiments

Owner

Hiroshechka Y

StarGAN - Official PyTorch Implementation (CVPR 2018)

SmartSim Infrastructure Library.

Shape Matching of Real 3D Object Data to Synthetic 3D CADs (3DV project @ ETHZ)

PointPillars inference with TensorRT

Fairness Metrics: All you need to know

DrWhy is the collection of tools for eXplainable AI (XAI). It's based on shared principles and simple grammar for exploration, explanation and visualisation of predictive models.

Pytorch implementation of paper Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data

[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis

Scribble-Supervised LiDAR Semantic Segmentation, CVPR 2022 (ORAL)

Sentinel-1 vessel detection model used in the xView3 challenge

Axel - 3D printed robotic hands and they controll with Raspberry Pi and Arduino combo

Interactive Image Generation via Generative Adversarial Networks

Evaluating different engineering tricks that make RL work

Data visualization app for H&M competition in kaggle

git《Self-Attention Attribution: Interpreting Information Interactions Inside Transformer》(AAAI 2021) GitHub:

Methods to get the probability of a changepoint in a time series.

A PyTorch implementation of a Factorization Machine module in cython.

Implementation of ICLR 2020 paper "Revisiting Self-Training for Neural Sequence Generation"

Codebase for testing whether hidden states of neural networks encode discrete structures.

Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer