Image2PCL

Enter the metaverse with 2D image to 3D projections!
This is an implementation of an algorithm to project 2D images into the 3D space. See below for a visual summary of the project

The published code is inspired by the following works:
Monodepth2: https://www.github.com/nianticlabs/monodepth2
MMSegmentation: https://www.github.com/open-mmlab/mmsegmentation

Setup

Assuming you have already set up an Anaconda environment with PyTorch, CUDA and Python, install additional dependencies with:

pip install open3d
pip install mmcv-full=={mmcv_version} -f https://download.openmmlab.com/mmcv/dist/cu113/torch1.10.0/index.html

Clone the mmsegmentation repository to your working directory

git clone https://github.com/open-mmlab/mmsegmentation

Create a 'models' folder to store your trained models for testing.

mkdir models

Trained KITTI models can be downloaded from the monodepth2 repository. This code was tested with the 'mono_640x192' model.
I also provide a custom-trained nuScenes model for testing with nuScenes images. This is helpful for multi-view point cloud rendering.

Test

To run a test, it is preferred to use images from a dataset with known camera intrinsics. For this implementation, we use two different datasets:

KITTI Raw for single image testing
nuScenes for multi-view images testing

To test on KITTI, run the following (replace the "<>" brackets and contents inside with the correct information):

python img2pcl.py \
--image_path <path to single image file or folder containing single image> \
--model_path <path to trained KITTI model> \
--data_type kitti_raw

To test on nuScenes to view a 360 3D point cloud, run the following (replace the "<>" brackets and contents inside with the correct information):

python img2pcl.py \
--image_path <path to folder containing nuScenes multi-cam images> \
--model_path <path to trained nuScenes model> \
--data_type nuscenes \
--nusc_camera_parameters <path to a json file containing nuscenes camera intrinsics and extrinsics>

Image2pcl - Enter the metaverse with 2D image to 3D projections

Related tags

Overview

Image2PCL

Setup

Test

Owner

Benjamin Ho

chaii - hindi & tamil question answering

SGMC: Spectral Graph Matrix Completion

An evaluation toolkit for voice conversion models.

ChainKnowledgeGraph, 产业链知识图谱包括A股上市公司、行业和产品共3类实体

Search-Engine - 📖 AI based search engine

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

Code of paper: A Recurrent Vision-and-Language BERT for Navigation

👑 spaCy building blocks and visualizers for Streamlit apps

Simple, hackable offline speech to text - using the VOSK-API.

This repository is home to the Optimus data transformation plugins for various data processing needs.

pyupbit 라이브러리를 활용하여 upbit에서 비트코인을 자동매매하는 코드입니다. 조코딩 유튜브 채널에서 자세한 강의 영상을 보실 수 있습니다.

NLP project that works with news (NER, context generation, news trend analytics)

Reading Wikipedia to Answer Open-Domain Questions

IndoBERTweet is the first large-scale pretrained model for Indonesian Twitter. Published at EMNLP 2021 (main conference)

Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together

Text Normalization（文本正则化）

Generating Korean Slogans with phonetic and structural repetition

This is a project of data parallel that running on NLP tasks.

Build Text Rerankers with Deep Language Models

Yodatranslator is a simple translator English to Yoda-language