[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

Last update: Dec 27, 2022

Related tags

Overview

Few-shot 3D Point Cloud Semantic Segmentation

Created by Na Zhao from National University of Singapore

Introduction

This repository contains the PyTorch implementation for our CVPR 2021 Paper "Few-shot 3D Point Cloud Semantic Segmentation" by Na Zhao, Tat-Seng Chua, Gim Hee Lee.

Many existing approaches for point cloud semantic segmentation are fully supervised. These fully supervised approaches heavily rely on a large amount of labeled training data that is difficult to obtain and can not generalize to unseen classes after training. To mitigate these limitations, we propose a novel attention-aware multi-prototype transductive few-shot point cloud semantic segmentation method to segment new classes given a few labeled examples. Specifically, each class is represented by multiple prototypes to model the complex data distribution of 3D point clouds. Subsequently, we employ a transductive label propagation method to exploit the affinities between labeled multi-prototypes and unlabeled query points, and among the unlabeled query points. Furthermore, we design an attention-aware multi-level feature learning network to learn the discriminative features that capture the semantic correlations and geometric dependencies between points. Our proposed method shows significant and consistent improvements compared to the baselines in different few-shot point cloud segmentation settings (i.e. 2/3-way 1/5-shot) on two benchmark datasets.

Installation

Install python --This repo is tested with python 3.6.8.
Install pytorch with CUDA -- This repo is tested with torch 1.4.0, CUDA 10.1. It may work with newer versions, but that is not gauranteed.
Install faiss with cpu version

Install 'torch-cluster' with the corrreponding torch and cuda version

 pip install torch-cluster==latest+cu101 -f https://pytorch-geometric.com/whl/torch-1.5.0.html

Install dependencies

pip install tensorboard h5py transforms3d

Usage

Data preparation

S3DIS

Download S3DIS Dataset Version 1.2.
Re-organize raw data into npy files by running
```
cd ./preprocess
python collect_s3dis_data.py --data_path $path_to_S3DIS_raw_data
```
The generated numpy files are stored in ./datasets/S3DIS/scenes/ by default.
To split rooms into blocks, run

python ./preprocess/room2blocks.py --data_path ./datasets/S3DIS/scenes/

One folder named blocks_bs1_s1 will be generated under ./datasets/S3DIS/ by default.

ScanNet

Download ScanNet V2.
Re-organize raw data into npy files by running
```
cd ./preprocess
python collect_scannet_data.py --data_path $path_to_ScanNet_raw_data
```
The generated numpy files are stored in ./datasets/ScanNet/scenes/ by default.
To split rooms into blocks, run

python ./preprocess/room2blocks.py --data_path ./datasets/ScanNet/scenes/ --dataset scannet

One folder named blocks_bs1_s1 will be generated under ./datasets/ScanNet/ by default.

Running

Training

First, pretrain the segmentor which includes feature extractor module on the available training set:

cd scripts
bash pretrain_segmentor.sh

Second, train our method:

bash train_attMPTI.sh

Evaluation

bash eval_attMPTI.sh

Note that the above scripts are used for 2-way 1-shot on S3DIS (S^0). You can modified the corresponding hyperparameters to conduct experiments on other settings.

Citation

Please cite our paper if it is helpful to your research:

@inproceedings{zhao2021few,
  title={Few-shot 3D Point Cloud Semantic Segmentation},
  author={Zhao, Na and Chua, Tat-Seng and Lee, Gim Hee},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year={2021}
}

Acknowledgement

We thank DGCNN (pytorch) for sharing their source code.

[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

Related tags

Overview

Few-shot 3D Point Cloud Semantic Segmentation

Introduction

Installation

Usage

Data preparation

S3DIS

ScanNet

Running

Training

Evaluation

Citation

Acknowledgement

Owner

The source code of the paper "Understanding Graph Neural Networks from Graph Signal Denoising Perspectives"

A collection of metrics for evaluating timbre dissimilarity using the TorchMetrics API

Notebook and code to synthesize complex and highly dimensional datasets using Gretel APIs.

SuperSDR: multiplatform KiwiSDR + CAT transceiver integrator

A PyTorch implementation of " EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks."

Apply our monocular depth boosting to your own network!

The official implementation of the CVPR2021 paper: Decoupled Dynamic Filter Networks

Official Pytorch implementation of "Unbiased Classification Through Bias-Contrastive and Bias-Balanced Learning (NeurIPS 2021)

Source code for paper "Deep Superpixel-based Network for Blind Image Quality Assessment"

Combinatorial model of ligand-receptor binding

Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation

Scalable machine learning based time series forecasting

Use CLIP to represent video for Retrieval Task

An open source implementation of CLIP.

Code for the TPAMI paper: "Syntax Customized Video Captioning by Imitating Exemplar Sentences"

Convolutional 2D Knowledge Graph Embeddings resources

A multi-mode modulator for multi-domain few-shot classification (ICCV)

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

Deep learning PyTorch library for time series forecasting, classification, and anomaly detection