LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

Last update: Oct 11, 2022

Related tags

Overview

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

This Repository contains the code on AVA of our ACM MM 2021 paper: LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

Installation

See INSTALL.md for details on installing the codebase, including requirement and environment settings

Data

For data preparation and setup, our LSTC strictly follows the processing of PySlowFast, See DATASET.md for details on preparing the data.

Run the code

We take SlowFast-ResNet50 as an example

train the model

python3 tools/run_net.py --cfg config/AVA/SLOWFAST_32x12_R50_LFB.yaml \
    AVA.FEATURE_BANK_PATH 'path/to/feature/bank/folder' \
    TRAIN.CHECKPOINT_FILE_PATH 'path/to/pretrained/backbone' \
    OUTPUT_DIR 'path/to/output/folder'

test the model

python3 tools/run_net.py --cfg config/AVA/SLOWFAST_32x12_R50_LFB.yaml \
    AVA.FEATURE_BANK_PATH 'path/to/feature/bank/folder' \
    OUTPUT_DIR 'path/to/output/folder' \
    TRAIN.ENABLE False \ 
    TEST.ENABLE True

If you want to start the DDP training from command line with torch.distributed.launch, please set start_method='cmd' in tools/run_net.py

Resource

The codebase provide following resources for fast training and validation

Pretrained backbone on Kinetics

backbone	dataset	model type	link
ResNet50	Kinetics400	Caffe2	Google Drive/Baidu Disk (Code: y1wl)
ResNet101	Kinetics600	Caffe2	Google Drive/Baidu Disk (Code: slde)

Extracted long term feature bank

backbone	feature bank (LMDB)	dimension
ResNet50	Google Drive	1280
ResNet101	Google Drive	2304

Checkpoint file

backbone	checkpoint	model type
ResNet50	Google Drive/Baidu Disk (Code: fi0s)	pytorch
ResNet101	Google Drive/Baidu Disk (Code: g63o)	pytorch

Acknowledgement

This codebase is built upon PySlowFast.

Citation

If you find this repository helps your research, please refer following paper

@InProceedings{Yuxi_2021_ACM,
  author = {Li, Yuxi and Zhang, Boshen and Li, Jian and Wang, Yabiao and Wang, Chengjie and Li, Jilin and Huang, Feiyue and Lin, Weiyao},
  title = {LSTC: Boosting Atomic Action Detection with Long-Short-Term Context},
  booktitle = {ACM Conference on Multimedia},
  month = {October},
  year = {2021}
}

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

Related tags

Overview

LSTC: Boosting Atomic Action Detection with Long-Short-Term Context

Installation

Data

Run the code

Resource

Pretrained backbone on Kinetics

Extracted long term feature bank

Checkpoint file

Acknowledgement

Citation

Owner

Tencent YouTu Research

Official pytorch implementation of Rainbow Memory (CVPR 2021)

Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021

We evaluate our method on different datasets (including ShapeNet, CUB-200-2011, and Pascal3D+) and achieve state-of-the-art results, outperforming all the other supervised and unsupervised methods and 3D representations, all in terms of performance, accuracy, and training time.

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

Labels4Free: Unsupervised Segmentation using StyleGAN

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

The implementation of DeBERTa

KwaiRec: A Fully-observed Dataset for Recommender Systems (Density: Almost 100%)

Neuralnetwork - Basic Multilayer Perceptron Neural Network for deep learning

Code for Deep Single-image Portrait Image Relighting

Official pytorch implementation of paper Dual-Level Collaborative Transformer for Image Captioning (AAAI 2021).

Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference

Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.

A Demo server serving Bert through ONNX with GPU written in Rust with <3

COVID-Net Open Source Initiative

Codes for "CSDI: Conditional Score-based Diffusion Models for Probabilistic Time Series Imputation"

Implementation of "Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency"

Tensorflow implementation of Character-Aware Neural Language Models.

Graph-based community clustering approach to extract protein domains from a predicted aligned error matrix

PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".