3D-Reconstruction 基于深度学习方法的单目多视图三维重建

Last update: Dec 26, 2022

Related tags

Deep Learning 3D-Reconstruction

Overview

基于深度学习方法的单目多视图三维重建

Part I 三维重建

代码：Part1

技术文档：[Markdown] [PDF]

原始图像：Original Images

点云结果：Point Cloud Results-1

效果图：

Part II 基于计算机视觉方法的点云到点云窗户识别

代码：Part2

技术文档：[Markdown] [PDF]

点云结果：Point Cloud Results-2

算法流程图：

Part III 基于ResNest的图像到点云的语义分割

代码：Part3

技术文档：[Markdown] [PDF]

语义分割结果：Semantic Segmentation Results

点云结果：Point Cloud Results-3

效果图：

参考文献

AA-RMVSNet [arXiv] [CVF] [PDF]

Wei Z, Zhu Q, Min C, et al. Aa-rmvsnet: Adaptive aggregation recurrent multi-view stereo network[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021: 6187-6196.

Cascade-MVSNet [arXiv] [CVF] [PDF]

Gu X, Fan Z, Zhu S, et al. Cascade cost volume for high-resolution multi-view stereo and stereo matching[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 2495-2504.

TransMVSNet [arXiv] [PDF]

Ding Y, Yuan W, Zhu Q, et al. TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers[J]. arXiv preprint arXiv:2111.14600, 2021.

LoFTR [arXiv] [CVF] [PDF]

Sun J, Shen Z, Wang Y, et al. LoFTR: Detector-free local feature matching with transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 8922-8931.

PatchmatchNet [arXiv] [CVF] [PDF]

Wang F, Galliani S, Vogel C, et al. PatchmatchNet: Learned Multi-View Patchmatch Stereo[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 14194-14203.

ResNeSt [arXiv] [PDF]

Zhang H, Wu C, Zhang Z, et al. Resnest: Split-attention networks[J]. arXiv preprint arXiv:2004.08955, 2020.

致谢

稀疏重建部分使用Colmap完成相机参数的获取。

稠密重建部分的代码主要来源于AA-RMVSNet。

点云切割与可视化使用CloudCompare及Meshlab完成。

调用Open3D进行表面重建。

Cascade+Transformer的代码主要基于kwea123实现的pytorch-lightning版本的Cascade-MVSNetl以及LoFTR进行实现。

窗户识别算法中部分思路参考了Color Space的矩形识别算法，图像处理技术主要基于冈萨雷斯的数字图像处理（第三版）。

语义分割部分调用了PyTorch-Encoding。

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

StyleGAR TODO: add arxiv link Implementation of Inverting Generative Adversarial Renderer for Face Reconstruction TODO: for test Currently, some model

155 Oct 27, 2022

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

The Boombox: Visual Reconstruction from Acoustic Vibrations Boyuan Chen, Mia Chiquier, Hod Lipson, Carl Vondrick Columbia University Project Website |

12 Nov 30, 2022

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints Official implementation for Reducing Footskate in Human Motion Recon

38 Nov 1, 2022

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction TSDF++ is a novel multi-object TSDF formulation that can encode mult

130 Dec 29, 2022

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

MeshTransformer ✨ This is our research code of End-to-End Human Pose and Mesh Reconstruction with Transformers. MEsh TRansfOrmer is a simple yet effec

473 Dec 31, 2022

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

SinIR (Official Implementation) Requirements To install requirements: pip install -r requirements.txt We used Python 3.7.4 and f-strings which are in

47 Oct 11, 2022

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

494 Jan 6, 2023

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Neural Deformation Graphs Project Page | Paper | Video Neural Deformation Graphs for Globally-consistent Non-rigid Reconstruction Aljaž Božič, Pablo P

134 Dec 16, 2022

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

LASR Installation Build with conda conda env create -f lasr.yml conda activate lasr # install softras cd third_party/softras; python setup.py install;

157 Dec 26, 2022

Releases(7)

7(Feb 16, 2022)

White mesh generated by Neus
Source code(tar.gz)
Source code(zip)
dongbeiya_neus.ply(11.21 MB)
gym_north_neus.ply(21.28 MB)
gym_south_neus.ply(16.59 MB)
6(Feb 16, 2022)

White mesh generated by Colmap and Meshlab
Source code(tar.gz)
Source code(zip)
dongbeiya.ply(19.11 MB)
dongbeiya.png(8.45 MB)
gym_north.ply(31.93 MB)
gym_north.png(8.73 MB)
gym_south.ply(26.97 MB)
gym_south.png(9.32 MB)
5(Dec 29, 2021)

Original images for reconstruction
Source code(tar.gz)
Source code(zip)
PIC2.zip(755.68 MB)
PIC2.z01(900.00 MB)
PIC2.z02(900.00 MB)
dby.zip(735.16 MB)
dby.z02(900.00 MB)
dby.z01(900.00 MB)
4(Dec 19, 2021)

Semantic Segmentation Results of Problem 3
Source code(tar.gz)
Source code(zip)
filtered_segmentation_result_dongbeiya.zip(661.17 MB)
filtered_segmentation_result_gym.zip(786.65 MB)
segmentation_result_dongbeiya.zip(64.31 MB)
segmentation_result_dongbeiya_block.zip(53.27 MB)
segmentation_result_gym.zip(4.72 MB)
3(Dec 19, 2021)

Point Cloud Results of Problem 3
Source code(tar.gz)
Source code(zip)
2(Dec 19, 2021)

Point Cloud Results of Problem 2
Source code(tar.gz)
Source code(zip)
gym_south_window.ply(627.30 MB)
gym_north_window.ply(808.62 MB)
dongbeiya_window.ply(1800.53 MB)
gym_window.ply(1603.31 MB)
1(Dec 19, 2021)

Point Cloud Results of Problem 1
Source code(tar.gz)
Source code(zip)
dongbeiya.ply(731.13 MB)
gym_south.ply(696.19 MB)
gym_north.ply(707.89 MB)
gym.ply(1404.08 MB)

Owner

HMT_Curo

GitHub Repository

Official code for "EagerMOT: 3D Multi-Object Tracking via Sensor Fusion" [ICRA 2021]

EagerMOT: 3D Multi-Object Tracking via Sensor Fusion Read our ICRA 2021 paper here. Check out the 3 minute video for the quick intro or the full prese

276 Dec 30, 2022

The repository for freeCodeCamp's YouTube course, Algorithmic Trading in Python

Algorithmic Trading in Python This repository Course Outline Section 1: Algorithmic Trading Fundamentals What is Algorithmic Trading? The Differences

1.8k Jan 02, 2023

Official code for MPG2: Multi-attribute Pizza Generator: Cross-domain Attribute Control with Conditional StyleGAN

This is the official code for Multi-attribute Pizza Generator (MPG2): Cross-domain Attribute Control with Conditional StyleGAN. Paper Demo Setup Envir

5 Sep 01, 2022

[ICRA 2022] CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation

This is the official implementation of our paper: Bowen Wen, Wenzhao Lian, Kostas Bekris, and Stefan Schaal. "CaTGrasp: Learning Category-Level Task-R

199 Jan 04, 2023

李云龙二次元风格化!打滚卖萌，使用了animeGANv2进行了视频的风格迁移

李云龙二次元风格化！一键star、fork，你也可以生成这样的团长！打滚卖萌求star求fork! 0.效果展示视频效果前往B站观看效果最佳：李云龙二次元风格化： github开源repo：李云龙二次元风格化百度AIstudio开源地址,一键fork即可运行: 李云龙二次元风格化！一键fork

44 Dec 04, 2022

Honours project, on creating a depth estimation map from two stereo images of featureless regions

image-processing This module generates depth maps for shape-blocked-out images Install If working with anaconda, then from the root directory: conda e

2 Oct 17, 2022

Trainable Bilateral Filter Layer (PyTorch)

Trainable Bilateral Filter Layer (PyTorch) This repository contains our GPU-accelerated trainable bilateral filter layer (three spatial and one range

26 Dec 25, 2022

Pytorch implementation of paper Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data

31 Nov 20, 2022

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

NNI Doc | 简体中文 NNI (Neural Network Intelligence) is a lightweight but powerful toolkit to help users automate Feature Engineering, Neural Architecture

12.4k Dec 31, 2022

A Vision Transformer approach that uses concatenated query and reference images to learn the relationship between query and reference images directly.

24 Dec 13, 2022

3D-Reconstruction 基于深度学习方法的单目多视图三维重建

Related tags

Overview

基于深度学习方法的单目多视图三维重建

Part I 三维重建

Part II 基于计算机视觉方法的点云到点云窗户识别

Part III 基于ResNest的图像到点云的语义分割

参考文献

致谢

You might also like...

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

Releases(7)

7(Feb 16, 2022)

6(Feb 16, 2022)

5(Dec 29, 2021)

4(Dec 19, 2021)

3(Dec 19, 2021)

2(Dec 19, 2021)

1(Dec 19, 2021)

Owner

HMT_Curo

Official code for "EagerMOT: 3D Multi-Object Tracking via Sensor Fusion" [ICRA 2021]

The repository for freeCodeCamp's YouTube course, Algorithmic Trading in Python

Official code for MPG2: Multi-attribute Pizza Generator: Cross-domain Attribute Control with Conditional StyleGAN

[ICRA 2022] CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation

李云龙二次元风格化!打滚卖萌，使用了animeGANv2进行了视频的风格迁移

Honours project, on creating a depth estimation map from two stereo images of featureless regions

Trainable Bilateral Filter Layer (PyTorch)

Pytorch implementation of paper Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Code from PropMix, accepted at BMVC'21

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Official Repository for Machine Learning class - Physics Without Frontiers 2021

Implementation of Learning Gradient Fields for Molecular Conformation Generation (ICML 2021).

Face Recognition plus identification simply and fast | Python

Code for paper: Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks

Command-line tool for downloading and extending the RedCaps dataset.

Simple (but Strong) Baselines for POMDPs

Deep learning models for classification of 15 common weeds in the southern U.S. cotton production systems.

The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.

A Vision Transformer approach that uses concatenated query and reference images to learn the relationship between query and reference images directly.