MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

Last update: Dec 20, 2022

Related tags

Overview

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

This is the official implementation for the ICCV 2021 paper Learning Signed Distance Field for Multi-view Surface Reconstruction

In this work, we introduce a novel neural surface reconstruction framework that leverages the knowledge of stereo matching and feature consistency to optimize the implicit surface representation. More specifically, we apply a signed distance field (SDF) and a surface light field to represent the scene geometry and appearance respectively. The SDF is directly supervised by geometry from stereo matching, and is refined by optimizing the multi-view feature consistency and the fidelity of rendered images. Our method is able to improve the robustness of geometry estimation and support reconstruction of complex scene topologies. Extensive experiments have been conducted on DTU, EPFL and Tanks and Temples datasets. Compared to previous state-of-the-art methods, our method achieves better mesh reconstruction in wide open scenes without masks as input.

How to Use

Environment Setup

The code is tested in the following environment (manually installed packages only). The newer version of the packages should also be fine.

dependencies:
  - cudatoolkit=10.2.89
  - numpy=1.19.2
  - python=3.8.8
  - pytorch=1.7.1
  - tqdm=4.60.0
  - pip:
    - cvxpy==1.1.12
    - gputil==1.4.0
    - imageio==2.9.0
    - open3d==0.13.0
    - opencv-python==4.5.1.48
    - pyhocon==0.3.57
    - scikit-image==0.18.3
    - scikit-learn==0.24.2
    - trimesh==3.9.13
    - pybind11==2.9.0

Data Preparation

Download preprocessed DTU datasets from here

Training

cd code
python training/exp_runner.py --data_dir <DATA_DIR>/scan<SCAN>/imfunc4 --batch_size 8 --nepoch 1800 --expname dtu_<SCAN>

The results will be written in exps/mvsdf_dtu_.

Trained Models

Download trained models and put them in exps folder. This set of models achieve the following results.

	Chamfer	PSNR
24	0.846	24.67
37	1.894	20.15
40	0.895	25.15
55	0.435	23.19
63	1.067	26.24
65	0.903	26.9
69	0.746	26.54
83	1.241	25.15
97	1.009	25.71
105	1.320	26.48
106	0.867	28.81
110	0.842	23.16
114	0.340	27.51
118	0.472	28.46
122	0.466	27.71
Mean	0.890	25.72

Testing

python evaluation/eval.py --data_dir <DATA_DIR>/scan<SCAN>/imfunc4 --expname dtu_<SCAN> [--eval_rendering]

add --eval_rendering flag to generate and evaluate rendered images. The results will be written in evals/mvsdf_dtu_.

Trimming

cd mesh_cut
python setup.py build_ext -i  # compile
python mesh_cut.py 
    
    
      [--thresh 15 --smooth 10]

Note that this part of code can only be used for research purpose. Please refer to mesh_cut/IBFS/license.txt

Evaluation

Apart from the official implementation, you can also use my re-implemented evaluation script.

Citation

If you find our work useful in your research, please kindly cite

@article{zhang2021learning,
	title={Learning Signed Distance Field for Multi-view Surface Reconstruction},
	author={Zhang, Jingyang and Yao, Yao and Quan, Long},
	journal={International Conference on Computer Vision (ICCV)},
	year={2021}
}

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

Related tags

Overview

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

How to Use

Environment Setup

Data Preparation

Training

Trained Models

Testing

Trimming

Evaluation

Citation

Owner

code for TCL: Vision-Language Pre-Training with Triple Contrastive Learning, CVPR 2022

[NeurIPS 2021] "Delayed Propagation Transformer: A Universal Computation Engine towards Practical Control in Cyber-Physical Systems"

Pytorch Implementation of paper "Noisy Natural Gradient as Variational Inference"

Unified MultiWOZ evaluation scripts for the context-to-response task.

PyTorch implementation of Barlow Twins.

DrNAS: Dirichlet Neural Architecture Search

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020

Walk with fastai

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

Official PyTorch implementation of the Fishr regularization for out-of-distribution generalization

Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

A Lightweight Experiment & Resource Monitoring Tool 📺

PyTorch Implementation of CycleGAN and SSGAN for Domain Transfer (Minimal)

[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers

Library for machine learning stacking generalization.

Related resources for our EMNLP 2021 paper

Alias-Free Generative Adversarial Networks (StyleGAN3) Official PyTorch implementation

An implementation of EWC with PyTorch

Implementation of H-Transformer-1D, Hierarchical Attention for Sequence Learning