Temporal Segment Networks (TSN) in PyTorch

Last update: Jan 03, 2023

Overview

TSN-Pytorch

We have released MMAction, a full-fledged action understanding toolbox based on PyTorch. It includes implementation for TSN as well as other STOA frameworks for various tasks. The lessons we learned in this repo are incorporated into MMAction to make it bettter. We highly recommend you switch to it. This repo will remain here for historical references.

Note: always use git clone --recursive https://github.com/yjxiong/tsn-pytorch to clone this project. Otherwise you will not be able to use the inception series CNN archs.

This is a reimplementation of temporal segment networks (TSN) in PyTorch. All settings are kept identical to the original caffe implementation.

For optical flow extraction and video list generation, you still need to use the original TSN codebase.

Training

To train a new model, use the main.py script.

The command to reproduce the original TSN experiments of RGB modality on UCF101 can be

python main.py ucf101 RGB <ucf101_rgb_train_list> <ucf101_rgb_val_list> \
   --arch BNInception --num_segments 3 \
   --gd 20 --lr 0.001 --lr_steps 30 60 --epochs 80 \
   -b 128 -j 8 --dropout 0.8 \
   --snapshot_pref ucf101_bninception_

For flow models:

python main.py ucf101 Flow <ucf101_flow_train_list> <ucf101_flow_val_list> \
   --arch BNInception --num_segments 3 \
   --gd 20 --lr 0.001 --lr_steps 190 300 --epochs 340 \
   -b 128 -j 8 --dropout 0.7 \
   --snapshot_pref ucf101_bninception_ --flow_pref flow_

For RGB-diff models:

python main.py ucf101 RGBDiff <ucf101_rgb_train_list> <ucf101_rgb_val_list> \
   --arch BNInception --num_segments 7 \
   --gd 40 --lr 0.001 --lr_steps 80 160 --epochs 180 \
   -b 128 -j 8 --dropout 0.8 \
   --snapshot_pref ucf101_bninception_

Testing

After training, there will checkpoints saved by pytorch, for example ucf101_bninception_rgb_checkpoint.pth.

Use the following command to test its performance in the standard TSN testing protocol:

python test_models.py ucf101 RGB <ucf101_rgb_val_list> ucf101_bninception_rgb_checkpoint.pth \
   --arch BNInception --save_scores <score_file_name>

Or for flow models:

python test_models.py ucf101 Flow <ucf101_rgb_val_list> ucf101_bninception_flow_checkpoint.pth \
   --arch BNInception --save_scores <score_file_name> --flow_pref flow_

Temporal Segment Networks (TSN) in PyTorch

Related tags

Overview

TSN-Pytorch

Training

Testing

Owner

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [2021]

Yet another video caption

🔥 Cogitare - A Modern, Fast, and Modular Deep Learning and Machine Learning framework for Python

Reinforcement Learning via Supervised Learning

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

DeepLab is a state-of-art deep learning system for semantic image segmentation built on top of Caffe.

PyTorch implementation for NED. It can be used to manipulate the facial emotions of actors in videos based on emotion labels or reference styles.

Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle

Self-attentive task GAN for space domain awareness data augmentation.

MILK: Machine Learning Toolkit

BERTMap: A BERT-Based Ontology Alignment System

MakeItTalk: Speaker-Aware Talking-Head Animation

Preparation material for Dropbox interviews

🎓Automatically Update CV Papers Daily using Github Actions (Update at 12:00 UTC Every Day)

Check out the StyleGAN repo and place it in the same directory hierarchy as the present repo

[CVPR 2019 Oral] Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation

Data cleaning, missing value handle, EDA use in this project

Asymmetric Bilateral Motion Estimation for Video Frame Interpolation, ICCV2021

A CV toolkit for my papers.

An implementation for the loss function proposed in Decoupled Contrastive Loss paper.