Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Last update: Dec 20, 2022

Related tags

Overview

NeX: Real-time View Synthesis with Neural Basis Expansion

Project Page | Video | Paper | COLAB | Shiny Dataset

We present NeX, a new approach to novel view synthesis based on enhancements of multiplane image (MPI) that can reproduce NeXt-level view-dependent effects---in real time. Unlike traditional MPI that uses a set of simple RGBα planes, our technique models view-dependent effects by instead parameterizing each pixel as a linear combination of basis functions learned from a neural network. Moreover, we propose a hybrid implicit-explicit modeling strategy that improves upon fine detail and produces state-of-the-art results. Our method is evaluated on benchmark forward-facing datasets as well as our newly-introduced dataset designed to test the limit of view-dependent modeling with significantly more challenging effects such as the rainbow reflections on a CD. Our method achieves the best overall scores across all major metrics on these datasets with more than 1000× faster rendering time than the state of the art.

TL;DR
Installation
Dataset
Training
Rendering
Citation

Getting started

conda env create -f environment.yml
./download_demo_data.sh
conda activate nex
python train.py -scene data/crest_demo -model_dir crest -http
tensorboard --logdir runs/

Installation

We provide environment.yml to help you setup a conda environment.

conda env create -f environment.yml

Dataset

Shiny dataset

Download: Shiny dataset.

We provide 2 directories named shiny and shiny_extended.

shiny contains benchmark scenes used to report the scores in our paper.
shiny_extended contains additional challenging scenes used on our website project page and video

NeRF's real forward-facing dataset

Download: Undistorted front facing dataset

For real forward-facing dataset, NeRF is trained with the raw images, which may contain lens distortion. But we use the undistorted images provided by COLMAP.

However, you can try running other scenes from Local lightfield fusion (Eg. airplant) without any changes in the dataset files. In this case, the images are not automatically undistorted.

Deepview's spaces dataset

Download: Modified spaces dataset

We slightly modified the file structure of Spaces dataset in order to determine the plane placement and split train/test sets.

Using your own images.

Running NeX on your own images. You need to install COLMAP on your machine.

Then, put your images into a directory following this structure

<scene_name>
|-- images
     | -- image_name1.jpg
     | -- image_name2.jpg
     ...

The training code will automatically prepare a scene for you. You may have to tune planes.txt to get better reconstruction (see dataset explaination)

Training

Run with the paper's config

python train.py -scene ${PATH_TO_SCENE} -model_dir ${MODEL_TO_SAVE_CHECKPOINT} -http

This implementation uses scikit-image to resize images during training by default. The results and scores in the paper are generated using OpenCV's resize function. If you want the same behavior, please add -cv2resize argument.

Note that this code is tested on an Nvidia V100 32GB and 4x RTX 2080Ti GPU.

For a GPU/GPUs with less memory (e.g., a single RTX 2080Ti), you can run using the following command:

python train.py -scene ${PATH_TO_SCENE} -model_dir ${MODEL_TO_SAVE_CHECKPOINT} -http -layers 12 -sublayers 6 -hidden 256

Note that when your GPU runs ouut of memeory, you can try reducing the number of layers, sublayers, and sampled rays.

Rendering

To generate a WebGL viewer and a video result.

python train.py -scene ${scene} -model_dir ${MODEL_TO_SAVE_CHECKPOINT} -predict -http

Video rendering

To generate a video that matches the real forward-facing rendering path, add -nice_llff argument, or -nice_shiny for shiny dataset

Citation

@inproceedings{Wizadwongsa2021NeX,
    author = {Wizadwongsa, Suttisak and Phongthawee, Pakkapon and Yenphraphai, Jiraphon and Suwajanakorn, Supasorn},
    title = {NeX: Real-time View Synthesis with Neural Basis Expansion},
    booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, 
    year = {2021},
}

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Related tags

Overview

NeX: Real-time View Synthesis with Neural Basis Expansion

Project Page | Video | Paper | COLAB | Shiny Dataset

Table of contents

Getting started

Installation

Dataset

Shiny dataset

NeRF's real forward-facing dataset

Deepview's spaces dataset

Using your own images.

Training

Rendering

Video rendering

Citation

Visit us 🦉

Owner

PAIRED in PyTorch 🔥

TextBPN Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection

Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.

More than a hundred strange attractors

WSDM2022 "A Simple but Effective Bidirectional Extraction Framework for Relational Triple Extraction"

PyTorch implementation of DARDet: A Dense Anchor-free Rotated Object Detector in Aerial Images

[ICCV-2021] An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation

An elaborate and exhaustive paper list for Named Entity Recognition (NER)

Deep Inertial Prediction (DIPr)

Official Python implementation of the FuzionCoin protocol

Sequence to Sequence Models with PyTorch

PaRT: Parallel Learning for Robust and Transparent AI

The source code of the paper "SHGNN: Structure-Aware Heterogeneous Graph Neural Network"

Interactive dimensionality reduction for large datasets

official code for dynamic convolution decomposition

Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments (CoRL 2020)

Python scripts for performing lane detection using the LSTR model in ONNX

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Related tags

Overview

NeX: Real-time View Synthesis with Neural Basis Expansion

Project Page | Video | Paper | COLAB | Shiny Dataset

Table of contents

Getting started

Installation

Dataset

Shiny dataset

NeRF's real forward-facing dataset

Deepview's spaces dataset

Using your own images.

Training

Rendering

Video rendering

Citation

Visit us 🦉

Owner

PAIRED in PyTorch 🔥

TextBPN Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection

Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

More than a hundred strange attractors

WSDM2022 "A Simple but Effective Bidirectional Extraction Framework for Relational Triple Extraction"

PyTorch implementation of DARDet: A Dense Anchor-free Rotated Object Detector in Aerial Images

[ICCV-2021] An Empirical Study of the Collapsing Problem in Semi-Supervised 2D Human Pose Estimation

An elaborate and exhaustive paper list for Named Entity Recognition (NER)

Deep Inertial Prediction (DIPr)

Official Python implementation of the FuzionCoin protocol

Sequence to Sequence Models with PyTorch

PaRT: Parallel Learning for Robust and Transparent AI

The source code of the paper "SHGNN: Structure-Aware Heterogeneous Graph Neural Network"

Interactive dimensionality reduction for large datasets

official code for dynamic convolution decomposition

Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments (CoRL 2020)

Python scripts for performing lane detection using the LSTR model in ONNX

Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune Paradigm

Free course that takes you from zero to Reinforcement Learning PRO 🦸🏻‍🦸🏽

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.