Code for "PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds", CVPR 2021

Last update: Dec 05, 2022

Related tags

Overview

PV-RAFT

This repository contains the PyTorch implementation for paper "PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds" (CVPR 2021)[arXiv]

Installation

Prerequisites

Python 3.8
PyTorch 1.8
torch-scatter
CUDA 10.2
RTX 2080 Ti
tqdm, tensorboard, scipy, imageio, png

conda create -n pvraft python=3.8
conda install pytorch torchvision torchaudio cudatoolkit=10.2 -c pytorch
conda install tqdm tensorboard scipy imageio
pip install pypng
pip install torch-scatter -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html

Usage

Data Preparation

We follow HPLFlowNet to prepare FlyingThings3D and KITTI datasets. Please refer to repo. Make sure the project structure look like this:

RAFT_SceneFlow/
    data/
        FlyingThings3D_subset_processed_35m/
        kitti_processed/
    data_preprocess/
    datasets/
    experiments/
    model/
    modules/
    tools/

After downloading datasets, we need to preprocess them.

FlyingThings3D Dataset

python process_flyingthings3d_subset.py --raw_data_path=path_src/FlyingThings3D_subset --save_path=path_dst/FlyingThings3D_subset_processed_35m --only_save_near_pts

You should replace raw_data_path and save_path with your own setting.

KITTI Dataset

python process_kitti.py --raw_data_path=path_src/kitti --save_path=path_dst/kitti_processed --calib_path=calib_folder_path

You should replace raw_data_path, save_path and calib_path with your own setting.

Train

python train.py --exp_path=pv_raft --batch_size=2 --gpus=0,1 --num_epochs=20 --max_points=8192 --iters=8  --root=./

where exp_path is the experiment folder name and root is the project root path. These 20 epochs take about 53 hours on two RTX 2080 Ti.

If you want to train the refine model, please add --refine and specify --weights parameter as the directory name of the pre-trained model. For example,

python train.py --refine --exp_path=pv_raft_finetune --batch_size=2 --gpus=0,1 --num_epochs=10 --max_points=8192 --iters=32 --root=./ --weights=./experiments/pv_raft/checkpoints/best_checkpoint.params

These 10 epochs take about 38 hours on two RTX 2080 Ti.

Test

python test.py --dataset=KITTI --exp_path=pv_raft --gpus=1 --max_points=8192 --iters=8 --root=./ --weights=./experiments/pv_raft/checkpoints/best_checkpoint.params

where dataset should be chosen from FT3D/KITTI, and weights is the absolute path of checkpoint file.

If you want to test the refine model, please add --refine. For example,

python test.py --refine --dataset=KITTI --exp_path=pv_raft_finetune --gpus=1 --max_points=8192 --iters=32 --root=./ --weights=./experiments/pv_raft_finetune/checkpoints/best_checkpoint.params

Reproduce results

You can download the checkpoint of refined model here.

Acknowledgement

Our code is based on FLOT. We also refer to RAFT and HPLFlowNet.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{wei2020pv,
  title={{PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds}},
  author={Wei, Yi and Wang, Ziyi and Rao, Yongming and Lu, Jiwen and Zhou, Jie},
  booktitle={CVPR},
  year={2021}
}

Code for "PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds", CVPR 2021

Related tags

Overview

PV-RAFT

Installation

Prerequisites

Usage

Data Preparation

FlyingThings3D Dataset

KITTI Dataset

Train

Test

Reproduce results

Acknowledgement

Citation

Owner

Yi Wei

PyTorch Implementation for Fracture Detection in Wrist Bone X-ray Images

A Fast Knowledge Distillation Framework for Visual Recognition

Dahua Camera and Doorbell Home Assistant Integration

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

Get started learning C# with C# notebooks powered by .NET Interactive and VS Code.

Code for "Optimizing risk-based breast cancer screening policies with reinforcement learning"

Optimal Camera Position for a Practical Application of Gaze Estimation on Edge Devices,

Rede Neural Convolucional feita durante o processo seletivo do Laboratório de Inteligência Artificial da FACOM (UFMS)

Faster Convex Lipschitz Regression

A 3D sparse LBM solver implemented using Taichi

Investigating automatic navigation towards standard US views integrating MARL with the virtual US environment developed in CT2US simulation

Consensus Learning from Heterogeneous Objectives for One-Class Collaborative Filtering

DEMix Layers for Modular Language Modeling

机器学习、深度学习、自然语言处理等人工智能基础知识总结。

Machine learning Bot detection technique, based on United States election dataset

My tensorflow implementation of "A neural conversational model", a Deep learning based chatbot

Code for "Learning Structural Edits via Incremental Tree Transformations" (ICLR'21)

World Models with TensorFlow 2

Revisiting Weakly Supervised Pre-Training of Visual Perception Models

Keras Image Embeddings using Contrastive Loss