CT-Net: Channel Tensorization Network for Video Classification

Last update: Nov 15, 2022

Related tags

Overview

[ICLR2021] CT-Net: Channel Tensorization Network for Video Classification

@inproceedings{
li2021ctnet,
title={{\{}CT{\}}-Net: Channel Tensorization Network for Video Classification},
author={Kunchang Li and Xianhang Li and Yali Wang and Jun Wang and Yu Qiao},
booktitle={International Conference on Learning Representations},
year={2021},
url={https://openreview.net/forum?id=UoaQUQREMOs}
}

Overview

[2021/6/3] We release the PyTorch code of CT-Net. More details and models will be available.

Model Zoo

More models will be released in a month...

Now we release the model for visualization, please download it from here and put it in ./model. (passward: t3to)

Install

pip install -r requirements.txt

Dataset

In our paper, we conduct experiments on Kinetics-400, Something-Something V1&V2, UCF101, and HMDB51. Please refer to TSM repo for the detailed guide of data pre-processing.

Training and Testing

Please refer to scripts/train.sh and scripts/test.sh, more details can be found in the appendix of our paper.

Setting environment

source ./init.sh

Training

We use dense sampling and uniform sampling for Kinetics and Something-Something respecitively.

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 \
python3 main.py something RGB \
     --root-log ./log \
     --root-model ./model \
     --arch resnet50 --model CT_Net --num-segments 8 \
     --gd 20 --lr 0.02 --unfrozen-epoch 0 --lr-type cos \
     --warmup 10 --tune-epoch 10 --tune-lr 0.02 --epochs 45 \
     --batch-size 8 -j 24 --dropout 0.3 --consensus-type=avg \
     --npb --num-total 7 --full-res --gpus 0 1 2 3 4 5 6 7 --suffix 2021

Testing

CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 \
python3 test_acc.py something RGB \
     --arch resnet50 --model CT_Net --num-segments 8 \
     --batch-size 64 -j 8 --consensus-type=avg \
     --resume ./model/ct_net_8f_r50.pth.tar \
     --npb --num-total 7 --evaluate --test-crops 1 --full-res --gpus 0 1 2 3 4 5 6 7

Demo and visiualization

See demo/show_cam.ipynb，

source ./init.sh
cd demo
jupyter notebook

CT-Net: Channel Tensorization Network for Video Classification

Related tags

Overview

[ICLR2021] CT-Net: Channel Tensorization Network for Video Classification

Overview

Model Zoo

Install

Dataset

Training and Testing

Setting environment

Training

Testing

Demo and visiualization

Owner

Code to generate datasets used in "How Useful is Self-Supervised Pretraining for Visual Tasks?"

Unified Interface for Constructing and Managing Workflows on different workflow engines, such as Argo Workflows, Tekton Pipelines, and Apache Airflow.

Pretrained Cost Model for Distributed Constraint Optimization Problems

Bi-level feature alignment for versatile image translation and manipulation (Under submission of TPAMI)

Housing Price Prediction

Simple-System-Convert--C--F - Simple System Convert With Python

Official implementation for CVPR 2021 paper: Adaptive Class Suppression Loss for Long-Tail Object Detection

All of the figures and notebooks for my deep learning book, for free!

Official repository for the paper, MidiBERT-Piano: Large-scale Pre-training for Symbolic Music Understanding.

Self-describing JSON-RPC services made easy

AntroPy: entropy and complexity of (EEG) time-series in Python

Code for the paper "Improved Techniques for Training GANs"

A Review of Deep Learning Techniques for Markerless Human Motion on Synthetic Datasets

Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation" in EMNLP 2021

Code for our paper 'Generalized Category Discovery'

UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model

My personal code and solution to the Synacor Challenge from 2012 OSCON.

Hamiltonian Dynamics with Non-Newtonian Momentum for Rapid Sampling

This repository contains the code for: RerrFact model for SciVer shared task

Face Identity Disentanglement via Latent Space Mapping [SIGGRAPH ASIA 2020]