Learning Correspondence from the Cycle-consistency of Time (CVPR 2019)

Last update: Nov 29, 2022

Related tags

Overview

TimeCycle

Code for Learning Correspondence from the Cycle-consistency of Time (CVPR 2019, Oral). The code is developed based on the PyTorch framework, in version PyTorch 0.4 with Python 2. It also runs smoothly with PyTorch 1.0. This repo includes the training code for learning semi-dense correspondence from unlabeled videos, and testing code for applying this correspondence on segmentation mask tracking in videos.

Citation

If you use our code in your research or wish to refer to the baseline results, please use the following BibTeX entry.

@inproceedings{CVPR2019_CycleTime,
    Author = {Xiaolong Wang and Allan Jabri and Alexei A. Efros},
    Title = {Learning Correspondence from the Cycle-Consistency of Time},
    Booktitle = {CVPR},
    Year = {2019},
}

Model and Result

Our trained model can be downloaded from here. The tracking performance on DAVIS-2017 for this model (without training on DAVIS-2017) is:

cropSize	J_mean	J_recall	J_decay	F_mean	F_recall	F_decay
320 x 320	0.419	0.409	0.272	0.394	0.336	0.328
400 x 400	0.430	0.437	0.296	0.426	0.413	0.356
480 x 480	0.464	0.500	0.332	0.500	0.480	0.379

Note that one can easily improve the results in test time by increasing the input image size "cropSize" in the script. The training and testing procedures for this model are described as follows.

Converting Our Model to Standard Pytorch ResNet-50

Please see convert_model.ipynb for converting our model here to standard Pytorch ResNet-50 model format.

Dataset Preparation

Please read DATASET.md for downloading and preparing the VLOG dataset for training and DAVIS dataset for testing.

Training

Replace the input list in train_video_cycle_simple.py in the home folder as:

    params['filelist'] = 'YOUR_DATASET_FOLDER/vlog_frames_12fps.txt'

Then run the following code:

    python train_video_cycle_simple.py --checkpoint pytorch_checkpoints/release_model_simple

Testing

Replace the input list in test_davis.py in the home folder as:

    params['filelist'] = 'YOUR_DATASET_FOLDER/davis/DAVIS/vallist.txt'

Set up the dataset path YOUR_DATASET_FOLDER in run_test.sh . Then run the testing and evaluation code together:

    sh run_test.sh

Acknowledgements

weakalign by Ignacio Rocco, Relja Arandjelović and Josef Sivic.

inflated_convnets_pytorch by Yana Hasson.

pytorch-classification by Wei Yang.

Learning Correspondence from the Cycle-consistency of Time (CVPR 2019)

Related tags

Overview

TimeCycle

Citation

Model and Result

Converting Our Model to Standard Pytorch ResNet-50

Dataset Preparation

Training

Testing

Acknowledgements

Owner

Xiaolong Wang

Constructing Neural Network-Based Models for Simulating Dynamical Systems

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection. CVPR 2022

GND-Nets (Graph Neural Diffusion Networks) in TensorFlow.

A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

Learning Synthetic Environments and Reward Networks for Reinforcement Learning

SimDeblur is a simple framework for image and video deblurring, implemented by PyTorch

Main repository for the HackBio'2021 Virtual Internship Experience for #Team-Greider ❤️

Face Detection and Alignment using Multi-task Cascaded Convolutional Networks (MTCNN)

MARE - Multi-Attribute Relation Extraction

Flower classification model that classifies flowers in 10 classes made using transfer learning (~85% accuracy).

Python scripts for performing stereo depth estimation using the MobileStereoNet model in ONNX

Simple SN-GAN to generate CryptoPunks

TensorFlow for Raspberry Pi

RefineMask (CVPR 2021)

領域を指定し、キーを入力することで画像を保存するツールです。クラス分類用のデータセット作成を想定しています。

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

Xview3 solution - XView3 challenge, 2nd place solution

PiRank: Learning to Rank via Differentiable Sorting

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.