PyTorch implementation of "Optimization Planning for 3D ConvNets"

Last update: Jan 12, 2022

Overview

Optimization-Planning-for-3D-ConvNets

Code for the ICML 2021 paper: Optimization Planning for 3D ConvNets.

Authors: Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei

1. Requirement

The provided codes have been tested with Python-3.9.5 & Pytorch-1.9.0 on four Tesla-V100s.

2. Project structure

├─ base_config             # Pre-set config file for each dataset
├─ dataset                 # Video lists (NOT provided) and code to load video data
├─ jpgs                    # Images for README
├─ layers                  # Custom network layers
├─ model                   # Network architectures
├─ record                  # Config file for each run
├─ utils                   # Basic functions
├─ extract_score_3d.py     # Main script to extract predicted score
├─ helpers.py              # Helper functions for main scripts
├─ merge_score.py          # Main script to merge scores from different clips
├─ train_3d.py             # Main script to launch a training using given strategy
├─ train_3d_op.py          # Main script to launch a searching of best strategy
└─ run.sh                  # Shell script for training-extracting-merging pipeline

3. Run the code

Pre-process the target dataset and put the lists in to the dataset folder. Codes in dataset/video_dataset.py can load three video formats (raw video, jpeg frames and video LMDB) and can be simply modified to support the custom format.
Make config file in the record folder. The config examples include op-*.yml for pre-searched strategy, kinetics-*.yml for simple strategy on Kinetics-400,
Run run.sh for the training-extracting-merging pipeline or replace train_3d.py with train_3d_op.py for searching the optimal strategy.

4. TO DO

Add more explainations and examples.

5. Contact

Please feel free to email to Zhaofan Qiu if you have any question regarding the paper or any suggestions for further improvements.

6. Citation

If you find this code helpful, thanks for citing our work as

@inproceedings{qiu2021optimization,
title={Optimization Planning for 3D ConvNets},
author={Qiu, Zhaofan and Yao, Ting and Ngo, Chong-Wah and Mei, Tao},
booktitle={Proceedings of the 38th International Conference on Machine Learning (ICML)},
publisher={PMLR},
year={2021}
}

Please also pay attention to the citations of the included networks/algorithms.

PyTorch implementation of "Optimization Planning for 3D ConvNets"

Related tags

Overview

Optimization-Planning-for-3D-ConvNets

Code for the ICML 2021 paper: Optimization Planning for 3D ConvNets.

Authors: Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei

1. Requirement

2. Project structure

3. Run the code

4. TO DO

5. Contact

6. Citation

Owner

Zhaofan Qiu

A Pytorch implementation of "LegoNet: Efficient Convolutional Neural Networks with Lego Filters" (ICML 2019).

Pytorch Implementation of Adversarial Deep Network Embedding for Cross-Network Node Classification

Test-Time Personalization with a Transformer for Human Pose Estimation, NeurIPS 2021

Tool cek opsi checkpoint facebook!

A TensorFlow implementation of the Mnemonic Descent Method.

TensorFlow Similarity is a python package focused on making similarity learning quick and easy.

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

Deep Illuminator is a data augmentation tool designed for image relighting. It can be used to easily and efficiently generate a wide range of illumination variants of a single image.

GANTheftAuto is a fork of the Nvidia's GameGAN

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation

PyTorch implementation of DeepLab v2 on COCO-Stuff / PASCAL VOC

[ICML 2021] Break-It-Fix-It: Learning to Repair Programs from Unlabeled Data

PyTorch implementation of EigenGAN

Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.

CellRank's reproducibility repository.

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

A Home Assistant custom component for Lobe. Lobe is an AI tool that can classify images.

Unsupervised Feature Ranking via Attribute Networks.

Statistical-Rethinking-with-Python-and-PyMC3 - Python/PyMC3 port of the examples in " Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath

Single Image Random Dot Stereogram for Tensorflow