Official PyTorch Implementation of Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity

Last update: Nov 16, 2022

Related tags

Overview

UnRigidFlow

This is the official PyTorch implementation of UnRigidFlow (IJCAI2019).

Here are two sample results (~10MB gif for each) of our unsupervised models.

KITTI 15	Cityscapes

If you find this repo useful in your research, please consider citing:

@inproceedings{Liu:2019:unrigid, 
title = {Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity}, 
author = {Liang Liu, Guangyao Zhai, Wenlong Ye, Yong Liu}, 
booktitle = {International Joint Conference on Artificial Intelligence, IJCAI}, 
year = {2019}
}

Requirements

This codebase was developed and tested with Python 3.5, Pytorch>=0.4.1, OpenCV 3.4, CUDA 9.0 and Ubuntu 16.04.

Most of the python packages can be installed by

pip3 install -r requirements.txt

In addition, Optimized correlation with CUDA kernel should be compiled manually with:

cd <correlation_package>
python3 setup.py install

and add <correlation_package> to $PYTHONPATH.

Note that if you are use PyTorch >= 1.0, you should make some changes, see NVIDIA/flownet2-pytorch#98.

Just replace #include <torch/torch.h> with #include <torch/extension.h> , adding #include <ATen/cuda/CUDAContext.h> and then replacing all at::globalContext().getCurrentCUDAStream() with at::cuda::getCurrentCUDAStream().

Training and Evaluation

We are mainly focused on KITTI benchmark. You will need to download all of the KITTI raw data and calibration files to train the model. You will also need the training files of KITTI 2012 and KITTI 2015 with calibration files [1], [2] for validating the models.

The complete training contains 3 steps:

Train the flow model separately:

python3 train.py -c configs/KITTI_flow.json

Train the depth model separately:

python3 train.py -c configs/KITTI_depth_stereo.json

Train the flow and depth models jointly:

python3 train.py -c configs/KITTI_rigid_flow_stereo.json

For evaluation, just adding --e options and modifying the corresponding model path for the above commands.

Pre-trained Models

You can download our pre-trained models, we provide the models as follow:

KITTI_flow: The separately trained optical flow network on KITTI raw data (from scratch)
KITTI_stereo_depth: The stereo depth network on KITTI raw data.
KITTI_flow_joint: The optical flow network jointly trained with stereo depth on KITTI raw data.

Acknowledgement

This repository refers some snippets from several great work, including PWC-Net, monodepth, UnFlow, UnDepthFlow, DF-Net. Although most of these are TensorFlow implementations, we are grateful for the sharing of these works, which save us a lot of time.

Official PyTorch Implementation of Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity

Related tags

Overview

UnRigidFlow

Requirements

Training and Evaluation

Pre-trained Models

Acknowledgement

Owner

Liang Liu

Breast cancer is been classified into benign tumour and malignant tumour.

Mercer Gaussian Process (MGP) and Fourier Gaussian Process (FGP) Regression

Official code for "End-to-End Optimization of Scene Layout" -- including VAE, Diff Render, SPADE for colorization (CVPR 2020 Oral)

https://arxiv.org/abs/2102.11005

My implementation of transformers related papers for computer vision in pytorch

Instance-wise Occlusion and Depth Orders in Natural Scenes (CVPR 2022)

An Unsupervised Graph-based Toolbox for Fraud Detection

PyTorch reimplementation of the paper Involution: Inverting the Inherence of Convolution for Visual Recognition [CVPR 2021].

Neural Scene Flow Fields using pytorch-lightning, with potential improvements

Easy-to-use,Modular and Extendible package of deep-learning based CTR models .

A framework to train language models to learn invariant representations.

TensorFlow 2 implementation of the Yahoo Open-NSFW model

Lung Pattern Classification for Interstitial Lung Diseases Using a Deep Convolutional Neural Network

For encoding a text longer than 512 tokens, for example 800. Set max_pos to 800 during both preprocessing and training.

Code for the paper Open Sesame: Getting Inside BERT's Linguistic Knowledge.

JstDoS - HTTP Protocol Stack Remote Code Execution Vulnerability

This is a collection of our NAS and Vision Transformer work.

EMNLP'2021: SimCSE: Simple Contrastive Learning of Sentence Embeddings

QueryInst: Parallelly Supervised Mask Query for Instance Segmentation

Radar-to-Lidar: Heterogeneous Place Recognition via Joint Learning