Code for the paper "Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds" (ICCV 2021)

Last update: Jan 05, 2023

Related tags

Overview

Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds

This is the official code implementation for the paper "Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds" (ICCV 2021) paper

Checklist

Self-supervised Pre-training Framework

BYOL
SimCLR

Downstream Tasks

Shape Classification
Semantic Segmentation
Indoor Object Detection
Outdoor Object Detection

Installation

The code was tested with the following environment: Ubuntu 18.04, python 3.7, pytorch 1.7.1, torchvision 0.8.2 and CUDA 11.1.

For self-supervised pre-training, run the following command:

git clone https://github.com/yichen928/STRL.git
cd STRL
pip install -r requirements.txt

For downstream tasks, please refer to the Downstream Tasks section.

Datasets

Please download the used dataset with the following links:

ShapeNet: https://drive.google.com/uc?id=1sJd5bdCg9eOo3-FYtchUVlwDgpVdsbXB
ModelNet40: https://shapenet.cs.stanford.edu/media/modelnet40_normal_resampled.zip
ScanNet (subset): Please follow the instruction in their official website. The 25k frames subset is enough for our model.

Make sure to put the files in the following structure:

|-- ROOT
|	|-- BYOL
|		|-- data
|			|-- modelnet40_normal_resampled_cache
|			|-- shapenet57448xyzonly.npz
|			|-- scannet
|				|-- scannet_frames_25k

Pre-training

BYOL framework

Please run the following command:

python BYOL/train.py

You need to edit the config file BYOL/config/config.yaml to switch different backbone architectures (currently including BYOL-pointnet-cls, BYOL-dgcnn-cls, BYOL-dgcnn-semseg, BYOL-votenet-detection).

Pre-trained Models

You can find the checkpoints of the pre-training and downstream tasks in our Google Drive.

Linear Evaluation

For PointNet or DGCNN classification backbones, you may evaluate the learnt representation with linear SVM classifier by running the following command:

For PointNet:

python BYOL/evaluate_pointnet.py -w /path/to/your/pre-trained/checkpoints

For DGCNN:

python BYOL/evaluate_dgcnn.py -w /path/to/your/pre-trained/checkpoints

Downstream Tasks

Checkpoints Transformation

You can transform the pre-trained checkpoints to different downstream tasks by running:

For VoteNet:

python BYOL/transform_ckpt_votenet.py --input_path /path/to/your/pre-trained/checkpoints --output_path /path/to/the/transformed/checkpoints

For other backbones:

python BYOL/transform_ckpt.py --input_path /path/to/your/pre-trained/checkpoints --output_path /path/to/the/transformed/checkpoints

Fine-tuning and Evaluation for Downstream Tasks

For the fine-tuning and evaluation of downstream tasks, please refer to other corresponding repos. We sincerely thank all these authors for their nice work!

Classification: WangYueFt/dgcnn
Semantic Segmentation: AnTao97/dgcnn.pytorch
Indoor Object Detection: facebookresearch/votenet

Citation

If you found our paper or code useful for your research, please cite the following paper:

@article{huang2021spatio,
  title={Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds},
  author={Huang, Siyuan and Xie, Yichen and Zhu, Song-Chun and Zhu, Yixin},
  journal={arXiv preprint arXiv:2109.00179},
  year={2021}
}

Code for the paper "Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds" (ICCV 2021)

Related tags

Overview

Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds

Checklist

Self-supervised Pre-training Framework

Downstream Tasks

Installation

Datasets

Pre-training

BYOL framework

Pre-trained Models

Linear Evaluation

Downstream Tasks

Checkpoints Transformation

Fine-tuning and Evaluation for Downstream Tasks

Citation

Owner

Hesper

Seeing Dynamic Scene in the Dark: High-Quality Video Dataset with Mechatronic Alignment (ICCV2021)

FedMM: Saddle Point Optimization for Federated Adversarial Domain Adaptation

tsai is an open-source deep learning package built on top of Pytorch & fastai focused on state-of-the-art techniques for time series classification, regression and forecasting.

Optimal Adaptive Allocation using Deep Reinforcement Learning in a Dose-Response Study

The code is the training example of AAAI2022 Security AI Challenger Program Phase 8: Data Centric Robot Learning on ML models.

Distributional Sliced-Wasserstein distance code

Official Keras Implementation for UNet++ in IEEE Transactions on Medical Imaging and DLMIA 2018

Discovering and Achieving Goals via World Models

[AAAI22] Reliable Propagation-Correction Modulation for Video Object Segmentation

Rewrite ultralytics/yolov5 v6.0 opencv inference code based on numpy, no need to rely on pytorch

A repository that finds a person who looks like you by using face recognition technology.

Fully Adaptive Bayesian Algorithm for Data Analysis (FABADA) is a new approach of noise reduction methods. In this repository is shown the package developed for this new method based on \citepaper.

免费获取http代理并生成proxifier配置文件

Dynamic Bottleneck for Robust Self-Supervised Exploration

🔎 Super-scale your images and run experiments with Residual Dense and Adversarial Networks.

CountDown to New Year and shoot fireworks

CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training

A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model.

Simulator for FRC 2022 challenge: Rapid React

ppo_pytorch_cpp - an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch