Implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

Last update: Dec 29, 2022

Related tags

Deep Learning PRP

Overview

PRP

Introduction

This is the implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

Getting started

Install

Our experiments run on Python 3.6.1 and PyTorch 0.4.1. All dependencies can be installed using pip:
```
python -m pip install -r requirements.txt
```

Data preparation

We construct experiments on UCF101 and HMDB51 (the split1 of UCF101 for pre-training and the rest for fine-tuning). The expected dataset directory hierarchy is as follow:

├── UCF101/HMDB51
│   ├── split
│   │   ├── classInd.txt
│   │   ├── testlist01.txt
│   │   ├── trainlist01.txt
│   │   └── ...
│   └── video
│       ├── ApplyEyeMakeup
│       │   └── *.avi
│       └── ...
└── ...

Train and Test Pre-training on Pretext Task

python train_predict.py --gpu 0 --epoch 300 --model_name c3d/r21d/r3d

Action Recognition

python ft_classfy.py --gpu 0 --model_name c3d/r21d/r3d --pre_path [your pre-trained model] --split 1/2/3
python test_classify.py

Video Retrieval

Please refer to the code video_retrieval_samples.py of VCOP.

Model zoo

Models

Pre-trained PRP model on the split1 of UCF101: C3D(OneDrive); R3D(OneDrive); R(2+1)D(OneDrive)
Action Recognition Results

Architecture UCF101(%) HMDB51(%)

C3D 69.1 34.5

R3D 66.5 29.7

R(2+1)D 72.1 35.0

Architecture	UCF101(%)	HMDB51(%)
C3D	69.1	34.5
R3D	66.5	29.7
R(2+1)D	72.1	35.0

License

This project is released under the Apache 2.0 license.

Citation

Please cite the following paper if you feel RSPNet useful to your research

@InProceedings{Yao_2020_CVPR,  
author = {Yao, Yuan and Liu, Chang and Luo, Dezhao and Zhou, Yu and Ye, Qixiang},  
title = {Video Playback Rate Perception for Self-Supervised Spatio-Temporal Representation Learning},  
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},  
month = {June},  
year = {2020}  
}

Implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

Related tags

Overview

PRP

Introduction

Getting started

Model zoo

License

Citation

Owner

yuanyao366

This project is based on RIFE and aims to make RIFE more practical for users by adding various features and design new models

95.47% on CIFAR10 with PyTorch

Tensorflow-Project-Template - A best practice for tensorflow project template architecture.

Ground truth data for the Optical Character Recognition of Historical Classical Commentaries.

A Pytree Module system for Deep Learning in JAX

PyTorch implementation of DeepDream algorithm

Density-aware Single Image De-raining using a Multi-stream Dense Network (CVPR 2018)

Re-implementation of the vector capsule with dynamic routing

RL and distillation in CARLA using a factorized world model

IGCN : Image-to-graph convolutional network

Official pytorch implementation of Rainbow Memory (CVPR 2021)

Pytorch implementation code for [Neural Architecture Search for Spiking Neural Networks]

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

Repository for open research on optimizers.

Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.

you can add any codes in any language by creating its respective folder (if already not available).

Food recognition model using convolutional neural network & computer vision

Training a Resilient Q-Network against Observational Interference, Causal Inference Q-Networks

Multi-task head pose estimation in-the-wild

GNNAdvisor: An Efficient Runtime System for GNN Acceleration on GPUs