VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Last update: Dec 26, 2022

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations
Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
arXiv preprint arXiv:

[Paper] [Project Page] [Demo]

This paper aims at achieving high-fidelity 3D-aware images synthesis. We propose a novel framework, termed as VolumeGAN, for synthesizing images under different camera views, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Qualitative Results

Independent control of structure (shape) and texture (appearance).

Comparison to prior work on various datasets.

Code Coming Soon

BibTeX

@article{xu2021volumegan,
  title   = {3D-aware Image Synthesis via Learning Structural and Textural Representations},
  author  = {Xu, Yinghao and Peng, Sida and Yang, Ceyuan and Shen, Yujun and Zhou, Bolei},
  article = {arXiv preprint arXiv:2112.10759},
  year    = {2021}
}

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Qualitative Results

Code Coming Soon

BibTeX

Owner

GenForce: May Generative Force Be with You

Implementation of fast algorithms for Maximum Spanning Tree (MST) parsing that includes fast ArcMax+Reweighting+Tarjan algorithm for single-root dependency parsing.

CIFS: Improving Adversarial Robustness of CNNs via Channel-wise Importance-based Feature Selection

Machine Learning Toolkit for Kubernetes

1st Solution For NeurIPS 2021 Competition on ML4CO Dual Task

This repository contains the needed resources to build the HIRID-ICU-Benchmark dataset

Consensus Learning from Heterogeneous Objectives for One-Class Collaborative Filtering

PyTorch implementation of Higher Order Recurrent Space-Time Transformer

Official PyTorch implementation of PS-KD

Implementation of the GVP-Transformer, which was used in the paper "Learning inverse folding from millions of predicted structures" for de novo protein design alongside Alphafold2

The code repository for EMNLP 2021 paper "Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization".

Transfer Learning Remote Sensing

This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.

Type4Py: Deep Similarity Learning-Based Type Inference for Python

This is project is the implementation of the DeepShift: Towards Multiplication-Less Neural Networks paper

The code uses SegFormer for Semantic Segmentation on Drone Dataset.

C3D is a modified version of BVLC caffe to support 3D ConvNets.

Official implementation of the paper Momentum Capsule Networks (MoCapsNet)

BEAMetrics: Benchmark to Evaluate Automatic Metrics in Natural Language Generation

GT China coal model

NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch