Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021).

Last update: Dec 30, 2022

Overview

AA-RMVSNet

Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021) in PyTorch.

paper link: arXiv | CVF

Change Log

Jun 17, 2021: Initialize repo
Jun 27, 2021: Update code
Aug 10, 2021: Update paper link
Oct 14, 2021: Update bibtex

Data Preparation

Download the preprocessed DTU training data (also available at BaiduYun, PW: s2v2).
For other datasets, please follow the practice in Yao Yao's MVSNet repo.
The pretrained model is provided. Place it under ./checkpoints/.

How to run

Install required dependencies:

conda create -n drmvsnet python=3.6
conda activate drmvsnet
conda install pytorch==1.1.0 torchvision==0.3.0 cudatoolkit=10.0 -c pytorch
conda install -c conda-forge py-opencv plyfile tensorboardx

Set root of datasets as env variables in env.sh.
Train AA-RMVSNet on DTU dataset (note that training requires a large amount of GPU memory):
```
./scripts/train_dtu.sh
```
Predict depth maps and fuse them to get point clouds of DTU:
```
./scripts/eval_dtu.sh
./scripts/fusion_dtu.sh
```
Predict depth maps and fuse them to get point clouds of Tanks and Temples:
```
./scripts/eval_tnt.sh
./scripts/fusion_tnt.sh
```

Note: if permission issues are encountered, try chmod +x <script_filename> to allow execution.

Citation

@inproceedings{wei2021aa,
  title={AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network},
  author={Wei, Zizhuang and Zhu, Qingtian and Min, Chen and Chen, Yisong and Wang, Guoping},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  pages={6187--6196},
  year={2021}
}

Acknowledgements

This repository is heavily based on Xiaoyang Guo's PyTorch implementation.

Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021).

Related tags

Overview

AA-RMVSNet

Change Log

Data Preparation

How to run

Citation

Acknowledgements

Owner

Qingtian Zhu

AsymmetricGAN - Dual Generator Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

VQMIVC - Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion

Random Erasing Data Augmentation. Experiments on CIFAR10, CIFAR100 and Fashion-MNIST

🗣️ Microsoft Edge TTS for Home Assistant, no need for app_key

A containerized REST API around OpenAI's CLIP model.

Code of the paper "Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition"

🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐

code for `Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation`

A Deep learning based streamlit web app which can tell with which bollywood celebrity your face resembles.

OCR Streamlit App is used to extract text from images using python's easyocr, pytorch and streamlit packages

Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

A state-of-the-art semi-supervised method for image recognition

Deep Halftoning with Reversible Binary Pattern

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

This codebase is the official implementation of Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization (NeurIPS2021, Spotlight)

Final project code: Implementing BicycleGAN, for CIS680 FA21 at University of Pennsylvania

Interactive Image Segmentation via Backpropagating Refinement Scheme

Bolt Online Learning Toolbox