[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Last update: Dec 15, 2022

Related tags

Overview

Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

This is an official PyTorch code repository of the paper "Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks " (ICCV, 2021).

Here, we present a versatile point cloud processing block that yields state-of-the-art results on many tasks.
The key idea is to process point clouds with many cheap low-dimensional different projections followed by standard convolutions. And we do so both in parallel and sequentially.

Datasets

We provide links to the datasets we used to train/evaluate. After unpacking and preparation, please edit the dataset path (data:path field) in configs/*.yaml

Pre-trained models

We provide our pre-trained models' weights in a single archive.

Building Dependencies

To install and build all the modules required, please run:

bash ./install_deps.sh

Code Structure

In layers/cloud_transform.py the core operations are implemented (rasterization Splat and de-rasterization Slice). While in layers\mutihead_ct_*.py we provide slightly different versions of Multi-Headed Cloud Transform (MHCT).

The model zoo is situated in model_zoo, where the models for corresponding tasks are constructed of Multi-Headed Cloud Transforms.

Run

We train our models in multi-GPU setting using DistributedDataParallel. To train on n GPUs, please run the following commands:

python train_${SCRIPT_NAME}.py ${EXP_NAME} -c configs/${CONFIG_NAME}.yaml --master localhost:3315 --rank 0 --num_nodes n
...
python train_${SCRIPT_NAME}.py ${EXP_NAME} -c configs/${CONFIG_NAME}.yaml --master localhost:3315 --rank  --num_nodes n

The semantics for evaluation scripts is almost the same:

python eval_${SCRIPT_NAME}.py ${EXP_NAME} -c configs/eval/${CONFIG_NAME}.yaml

Cite

If you find our work helpful, please do not hesitate to cite us.

@inproceedings{mazur2021cloudtransformers,
  title={Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks},
  author={Mazur, Kirill and Lempitsky, Victor},
  booktitle={International Conference on Computer Vision (ICCV)},
  year={2021}
}

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Related tags

Overview

Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Datasets

Pre-trained models

Building Dependencies

Code Structure

Run

Cite

Owner

Visual Understanding Lab @ Samsung AI Center Moscow

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

Perspective recovery of text using transformed ellipses

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

A simple document layout analysis using Python-OpenCV

Distort a video using Seam Carving (video) and Vibrato effect (sound)

Python-based tools for document analysis and OCR

Program created with opencv that allows you to automatically count your repetitions on several fitness exercises.

An official PyTorch implementation of the paper "Learning by Aligning: Visible-Infrared Person Re-identification using Cross-Modal Correspondences", ICCV 2021.

Table recognition inside douments using neural networks

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

This is a Computer vision package that makes its easy to run Image processing and AI functions. At the core it uses OpenCV and Mediapipe libraries.

([email protected]) Boosting Co-teaching with Compression Regularization for Label Noise

PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application in real time.

Text recognition (optical character recognition) with deep learning methods.

Motion Detection Squid Game with OpenCV Python

In this project we will be using the live feed coming from the webcam to create a virtual mouse with complete functionalities.

Primary QPDF source code and documentation

Fast style transfer

Code for the head detector (HeadHunter) proposed in our CVPR 2021 paper Tracking Pedestrian Heads in Dense Crowd.

The CIS OCR PostCorrectionTool