OadTR

Code for our ICCV2021 paper: "OadTR: Online Action Detection with Transformers" ["Paper"]

Update

July 28, 2021: Our Paper "OadTR: Online Action Detection with Transformers" was accepted by ICCV2021. At the same time, we released THUMOS14-Kinetics feature.

Dependencies

pytorch==1.6.0
json
numpy
tensorboard-logger
torchvision==0.7.0

Prepare

Unzip the anno file "./data/anno_thumos.zip"
Download the feature THUMOS14-Anet feature (Note: HDD and TVSeries are available by contacting the authors of the datasets and signing agreements due to the copyrights. You can use this Repo to extract features.)

Training

python main.py --num_layers 3 --decoder_layers 5 --enc_layers 64 --output_dir models/en_3_decoder_5_lr_drop_1

Validation

python main.py --num_layers 3 --decoder_layers 5 --enc_layers 64 --output_dir models/en_3_decoder_5_lr_drop_1 --eval --resume models/en_3_decoder_5_lr_drop_1/checkpoint000{}.pth

Citing OadTR

Please cite our paper in your publications if it helps your research:

@article{wang2021oadtr,
  title={OadTR: Online Action Detection with Transformers},
  author={Wang, Xiang and Zhang, Shiwei and Qing, Zhiwu and Shao, Yuanjie and Zuo, Zhengrong and Gao, Changxin and Sang, Nong},
  journal={arXiv preprint arXiv:2106.11149},
  year={2021}
}

Code for our ICCV 2021 Paper "OadTR: Online Action Detection with Transformers".

Related tags

Overview

OadTR

Update

Dependencies

Prepare

Training

Validation

Citing OadTR

Owner

M3DSSD: Monocular 3D Single Stage Object Detector

Fusion-DHL: WiFi, IMU, and Floorplan Fusion for Dense History of Locations in Indoor Environments

Tutorial to set up TensorFlow Object Detection API on the Raspberry Pi

Supervised Classification from Text (P)

ProjectOxford-ClientSDK - This repo has moved :house: Visit our website for the latest SDKs & Samples

Official implementation of Self-supervised Graph Attention Networks (SuperGAT), ICLR 2021.

This is an official implementation of "Polarized Self-Attention: Towards High-quality Pixel-wise Regression"

Code for our ICASSP 2021 paper: SA-Net: Shuffle Attention for Deep Convolutional Neural Networks

Linescanning - Package for (pre)processing of anatomical and (linescanning) fMRI data

Predicting future trajectories of people in cameras of novel scenarios and views.

A lightweight Python-based 3D network multi-agent simulator. Uses a cell-based congestion model. Calculates risk, loudness and battery capacities of the agents. Suitable for 3D network optimization tasks.

Python library containing BART query generation and BERT-based Siamese models for neural retrieval.

Machine learning Bot detection technique, based on United States election dataset

Structure Information is the Key: Self-Attention RoI Feature Extractor in 3D Object Detection

efficient neural audio synthesis in the waveform domain

This is the official repository for our paper: ''Pruning Self-attentions into Convolutional Layers in Single Path''.

Neural Re-rendering for Full-frame Video Stabilization

TransVTSpotter: End-to-end Video Text Spotter with Transformer

免费获取http代理并生成proxifier配置文件

Weakly Supervised 3D Object Detection from Point Cloud with Only Image Level Annotation