Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Last update: Oct 12, 2022

Related tags

Deep Learning deep-3dmask

Overview

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Kai-En Lin¹, Lei Xiao², Feng Liu², Guowei Yang¹, Ravi Ramamoorthi¹

¹University of California, San Diego, ²Facebook Reality Labs

Requirements

Install required packages

Make sure you have up-to-date NVIDIA drivers supporting CUDA 11.1 (10.2 could work but need to change cudatoolkit package accordingly)

Run

conda env create -f environment.yml
conda activate video_viewsynth

Usage

Rendering

Download our pretrained checkpoint and testing data. Extract the content to [path_to_data_directory]. It contains frames and background folders, as well as poses_bounds.npy.
In configs, setup data path by changing render_video.txt

root_dir should point to the frames folder mentioned in 1. and bg_dir should point to background folder.

out_dir can be your desired output folder.

ckpt_path should be the pretrained checkpoint path.
Run python render_llff_video.py --config [config_file_path]

e.g. python render_llff_video.py --config ../configs/render_video.txt

(Optional) For your own data, please run prepare_data.sh

sh render.sh [frame_folder] [starting_frame] [ending_frame] [output_folder_name]

Make sure your data is in this structure before running
```
[frame_folder] --- cam00 --- 00000.jpg
                |         |- 00001.jpg
                |         ...
                |- cam01
                |- cam02
                ...
                |- poses_bounds.npy
```
e.g. sh render.sh ~/deep_3d_data/frames 0 20 qual

Training

Train MPI

Download RealEstate10K dataset and extract the frames. There are scripts in preprocessing folder which can be used to generate the data.

The order should be download_data.py -> extract_frames.py -> compress_data.py.

Remember to change the path in compress_data.py.
Change the paths in config file train_realestate10k.txt

Run

cd train_mpi
python train.py --config ../configs/train_realestate10k.txt

Train Mask

Once MPI is trained, we can use the checkpoint to train 3D mask network.

Download dataset
Change the paths in config file train_mask.txt

Run

cd train_mask
python train.py --config ../configs/train_mask.txt

Citation

@inproceedings {lin2021deep,
    title = {Deep 3D Mask Volume for View Synthesis of Dynamic Scenes},
    author = {Kai-En Lin and Lei Xiao and Feng Liu and Guowei Yang and Ravi Ramamoorthi},
    booktitle = {ICCV},
    year = {2021},
}

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

Related tags

Overview

Deep 3D Mask Volume for View Synthesis of Dynamic Scenes

Requirements

Install required packages

Usage

Rendering

Training

Train MPI

Train Mask

Citation

Owner

Ken Lin

Differentiable Surface Triangulation

Back to Basics: Efficient Network Compression via IMP

Automated Melanoma Recognition in Dermoscopy Images via Very Deep Residual Networks

Official PyTorch implementation of "ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows"

(ICCV 2021) Official code of "Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing."

ReferFormer - Official Implementation of ReferFormer

Code for paper "Context-self contrastive pretraining for crop type semantic segmentation"

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

Inference code for "StylePeople: A Generative Model of Fullbody Human Avatars" paper. This code is for the part of the paper describing video-based avatars.

Official Implementation for HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

Reaction SMILES-AA mapping via language modelling

Creating a custom CNN hypertunned architeture for the Fashion MNIST dataset with Python, Keras and Tensorflow.

DiSECt: Differentiable Simulator for Robotic Cutting

CVPR2022 paper "Dense Learning based Semi-Supervised Object Detection"

Fast mesh denoising with data driven normal filtering using deep variational autoencoders

Supervised domain-agnostic prediction framework for probabilistic modelling

pyhsmm - library for approximate unsupervised inference in Bayesian Hidden Markov Models (HMMs) and explicit-duration Hidden semi-Markov Models (HSMMs), focusing on the Bayesian Nonparametric extensions, the HDP-HMM and HDP-HSMM, mostly with weak-limit approximations.

Pytorch Implementation of "Diagonal Attention and Style-based GAN for Content-Style disentanglement in image generation and translation" (ICCV 2021)

Mall-Customers-Segmentation - Customer Segmentation Using K-Means Clustering