code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction

Last update: Dec 14, 2022

Overview

Video_Pace

This repository contains the code for the following paper:

Jiangliu Wang, Jianbo Jiao and Yunhui Liu, "Self-Supervised Video Representation Learning by Pace Prediction", In: ECCV (2020).

Main idea:

Framework:

Requirements

pytroch >= 1.3.0
tensorboardX
cv2
scipy

Usage

Data preparation

UCF101 dataset

Download the original UCF101 dataset from the official website. And then extarct RGB images from videos.
Or direclty download the pre-processed RGB data of UCF101 here provided by feichtenhofer.

Pre-train

Train with pace prediction task on S3D-G, the default clip length is 64 and input video size is 224 x 224.

python train.py --rgb_prefix RGB_DIR --gpu 0,1,2,3 --bs 32 --lr 0.001 --height 256 --width 256 --crop_sz 224 --clip_len 64

Train with pace prediction task on c3d/r3d/r21d, the default clip length is 16 and input video size is 112 x 112.

python train.py --rgb_prefix RGB_DIR --gpu 0 --bs 30 --lr 0.001 --model c3d/r3d/r21d --height 128 --width 171 --crop_sz 112 --clip_len 16

Evaluation

To be updated...

Citation

If you find this work useful or use our code, please consider citing:

@InProceedings{Wang20,
  author       = "Jiangliu Wang and Jianbo Jiao and Yunhui Liu",
  title        = "Self-Supervised Video Representation Learning by Pace Prediction",
  booktitle    = "European Conference on Computer Vision",
  year         = "2020",
}

Acknowlegement

Part of our codes are adapted from S3D-G HowTO100M, we thank the authors for their contributions.

code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction

Related tags

Overview

Video_Pace

Main idea:

Framework:

Requirements

Usage

Data preparation

Pre-train

Evaluation

Citation

Acknowlegement

Owner

Jiangliu Wang

PyTorch implementation of Towards Accurate Alignment in Real-time 3D Hand-Mesh Reconstruction (ICCV 2021).

A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

Official repository for Automated Learning Rate Scheduler for Large-Batch Training (8th ICML Workshop on AutoML)

A Bayesian cognition approach for belief updating of correlation judgement through uncertainty visualizations

Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)

Optimizing Value-at-Risk and Conditional Value-at-Risk of Black Box Functions with Lacing Values (LV)

Official implementation of the paper DeFlow: Learning Complex Image Degradations from Unpaired Data with Conditional Flows

Disentangled Lifespan Face Synthesis

Source code for CIKM 2021 paper for Relation-aware Heterogeneous Graph for User Profiling

Neural Message Passing for Computer Vision

A diff tool for language models

Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning

SpineAI Bilsky Grading With Python

Implementation of ConvMixer in TensorFlow and Keras

[ICCV21] Self-Calibrating Neural Radiance Fields

Repo for the Tutorials of Day1-Day3 of the Nordic Probabilistic AI School 2021 (https://probabilistic.ai/)

PyTorch implementation of DeepDream algorithm

In this project we combine techniques from neural voice cloning and musical instrument synthesis to achieve good results from as little as 16 seconds of target data.

Madanalysis5 - A package for event file analysis and recasting of LHC results

HarDNeXt: Official HarDNeXt repository