VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Last update: Dec 26, 2022

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

3D-aware Image Synthesis via Learning Structural and Textural Representations
Yinghao Xu, Sida Peng, Ceyuan Yang, Yujun Shen, Bolei Zhou
arXiv preprint arXiv:

[Paper] [Project Page] [Demo]

This paper aims at achieving high-fidelity 3D-aware images synthesis. We propose a novel framework, termed as VolumeGAN, for synthesizing images under different camera views, through explicitly learning a structural representation and a textural representation. We first learn a feature volume to represent the underlying structure, which is then converted to a feature field using a NeRF-like model. The feature field is further accumulated into a 2D feature map as the textural representation, followed by a neural renderer for appearance synthesis. Such a design enables independent control of the shape and the appearance. Extensive experiments on a wide range of datasets show that our approach achieves sufficiently higher image quality and better 3D control than the previous methods.

Qualitative Results

Independent control of structure (shape) and texture (appearance).

Comparison to prior work on various datasets.

Code Coming Soon

BibTeX

@article{xu2021volumegan,
  title   = {3D-aware Image Synthesis via Learning Structural and Textural Representations},
  author  = {Xu, Yinghao and Peng, Sida and Yang, Ceyuan and Shen, Yujun and Zhou, Bolei},
  article = {arXiv preprint arXiv:2112.10759},
  year    = {2021}
}

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Related tags

Overview

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

Qualitative Results

Code Coming Soon

BibTeX

Owner

GenForce: May Generative Force Be with You

Cortex-compatible model server for Python and TensorFlow

i-RevNet Pytorch Code

Delving into Localization Errors for Monocular 3D Object Detection, CVPR'2021

Implementation of neural class expression synthesizers

Conflict-aware Inference of Python Compatible Runtime Environments with Domain Knowledge Graph, ICSE 2022

Implementation of parameterized soft-exponential activation function.

AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE

ADB-IP-ROTATION - Use your mobile phone to gain a temporary IP address using ADB and data tethering

Official implementation of the ICCV 2021 paper: "The Power of Points for Modeling Humans in Clothing".

Implementation of the state-of-the-art vision transformers with tensorflow

Gym-TORCS is the reinforcement learning (RL) environment in TORCS domain with OpenAI-gym-like interface.

Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation

Examples of how to create colorful, annotated equations in Latex using Tikz.

Text-to-Music Retrieval using Pre-defined/Data-driven Emotion Embeddings

A short and easy PyTorch implementation of E(n) Equivariant Graph Neural Networks

Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, as a standalone package for Pytorch

Classification of EEG data using Deep Learning

A repository for generating stylized talking 3D and 3D face

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias