Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Last update: Jul 08, 2021

Related tags

Overview

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

This is a PyTorch implementation of the model described in our paper:

Z. Qi, S. Wang, C. Su, L. Su, W. Zhang, and Q. Huang. Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis. ACM MM 2020.

Dependencies

Pytorch 1.2.0
Cuda 9.2.148
Cudnn 7.6.2
Opencv-python 4.2.0.34
Python 3.6.9

Data

Dataset Prepare

Download the pre-trained concept detector weights from Baidu passward 'wv0e' or Google Grive and put them in folder weights/
Download the FCVID dataset from http://bigvid.fudan.edu.cn/FCVID/.
The annotation information of each dataset is provided in folder data/FCVID/video_labels.
Extract the video frames for each video and put the extracted frames in folder data/FCVID/frames/.

For ActivityNet dataset ( http://activity-net.org/. ) , we use the latest released version of the dataset (v1.3).

Train

python main.py --gpu_ids 0,1 --model_name tdcmn_si_soa --dataset FCVID --no_test

for other hyperparameters, please refer to opts.py file.

Test

Pretrained model weigths are avaiable in Baidu passward 'szlk' or Google Grive
Download the pre-trained weights and put them in folder results/
python main.py --gpu_ids 0,1 --model_name tdcmn_si_soa --dataset FCVID --resume_path pretrained_model/tdcmn_si_soa.pth --no_train --test_crop_number 1

Citation

Please cite our paper if you use this code in your own work:

@inproceedings{qi2020modeling,
  title={Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis},
  author={Qi, Zhaobo and Wang, Shuhui and Su, Chi and Su, Li and Zhang, Weigang and Huang, Qingming},
  booktitle={Proceedings of the 28th ACM International Conference on Multimedia},
  pages={3798--3806},
  year={2020}
}

Contcat

If you have any problem about our code, feel free to contact

[email protected]

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Related tags

Overview

Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

Dependencies

Data

Dataset Prepare

Train

Test

Citation

Contcat

Owner

qzhb

Official PyTorch implementation of StyleGAN3

A toolset of Python programs for signal modeling and indentification via sparse semilinear autoregressors.

The official PyTorch code for NeurIPS 2021 ML4AD Paper, "Does Thermal data make the detection systems more reliable?"

BarcodeRattler - A Raspberry Pi Powered Barcode Reader to load a game on the Mister FPGA using MBC

Unoffical implementation about Image Super-Resolution via Iterative Refinement by Pytorch

A Tensorfflow implementation of Attend, Infer, Repeat

Implementation of RegretNet with Pytorch

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

Repo for the Video Person Clustering dataset, and code for the associated paper

Aesara is a Python library that allows one to define, optimize, and efficiently evaluate mathematical expressions involving multi-dimensional arrays.

Labelbox is the fastest way to annotate data to build and ship artificial intelligence applications

Implementation of the famous Image Manipulation\Forgery Detector "ManTraNet" in Pytorch

Official code of the paper "ReDet: A Rotation-equivariant Detector for Aerial Object Detection" (CVPR 2021)

Deep learning toolbox based on PyTorch for hyperspectral data classification.

This is the reference implementation for "Coresets via Bilevel Optimization for Continual Learning and Streaming"

data/code repository of "C2F-FWN: Coarse-to-Fine Flow Warping Network for Spatial-Temporal Consistent Motion Transfer"

Pytorch implementation of DeepMind's differentiable neural computer paper.

Dynamic Capacity Networks using Tensorflow

A super lightweight Lagrangian model for calculating millions of trajectories using ERA5 data

Official implementation for paper: A Latent Transformer for Disentangled Face Editing in Images and Videos.