This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Last update: Dec 29, 2022

Related tags

Deep Learning AD-NeRF

Overview

AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis

| Project Page | Paper |

PyTorch implementation for the paper "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis"

Prerequisites

You can create an anaconda environment called adnerf with:

conda env create -f environment.yml
conda activate adnerf

PyTorch3D

Recommend install from a local clone

git clone https://github.com/facebookresearch/pytorch3d.git
cd pytorch3d && pip install -e .

Basel Face Model 2009

Put "01_MorphableModel.mat" to data_util/face_tracking/3DMM/; cd data_util/face_tracking; run
```
python convert_BFM.py
```

Train AD-NeRF

Data Preprocess ($id Obama for example)
```
bash process_data.sh Obama
```
- Input: A portrait video at 25fps containing voice audio. (dataset/vids/$id.mp4)
- Output: folder dataset/$id that contains all files for training
Train Two NeRFs (Head-NeRF and Torso-NeRF)
- Train Head-NeRF with command
```
python NeRFs/HeadNeRF/run_nerf.py --config dataset/$id/HeadNeRF_config.txt
```
- Copy latest trainied model from dataset/$id/logs/$id_head to dataset/$id/logs/$id_com
- Train Torso-NeRF with command
```
python NeRFs/TorsoNeRF/run_nerf.py --config dataset/$id/TorsoNeRF_config.txt
```

Run AD-NeRF for rendering

Reconstruct original video with audio input

python NeRFs/TorsoNeRF/run_nerf.py --config dataset/$id/TorsoNeRFTest_config.txt --aud_file=dataset/$id/aud.npy --test_size=300

Drive the target person with another audio input

python NeRFs/TorsoNeRF/run_nerf.py --config dataset/$id/TorsoNeRFTest_config.txt --aud_file=${deepspeechfile.npy} --test_size=-1

Acknowledgments

We use face-parsing.PyTorch for parsing head and torso maps, and DeepSpeech for audio feature extraction. The NeRF model is implemented based on NeRF-pytorch.

This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".

Related tags

Overview

AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis

| Project Page | Paper |

Prerequisites

Train AD-NeRF

Run AD-NeRF for rendering

Acknowledgments

Owner

UPSNet: A Unified Panoptic Segmentation Network

One-Shot Neural Ensemble Architecture Search by Diversity-Guided Search Space Shrinking

MonoRCNN is a monocular 3D object detection method for automonous driving

Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences forImage-Text Retrieval

Code & Data for Enhancing Photorealism Enhancement

Semantic Segmentation in Pytorch. Network include: FCN、FCN_ResNet、SegNet、UNet、BiSeNet、BiSeNetV2、PSPNet、DeepLabv3_plus、 HRNet、DDRNet

bespoke tooling for offensive security's Windows Usermode Exploit Dev course (OSED)

Fit Fast, Explain Fast

Generative Adversarial Networks for High Energy Physics extended to a multi-layer calorimeter simulation

Addition of pseudotorsion caclulation eta, theta, eta', and theta' to barnaba package

Source code for CVPR2022 paper "Abandoning the Bayer-Filter to See in the Dark"

nfelo: a power ranking, prediction, and betting model for the NFL

Using python and scikit-learn to make stock predictions

This repository is a basic Machine Learning train & validation Template (Using PyTorch)

FedTorch is an open-source Python package for distributed and federated training of machine learning models using PyTorch distributed API

Solving SMPL/MANO parameters from keypoint coordinates.

Code of Periodic Activation Functions Induce Stationarity

[ICCV21] Code for RetrievalFuse: Neural 3D Scene Reconstruction with a Database

Builds a LoRa radio frequency fingerprint identification (RFFI) system based on deep learning techiniques

Project page of the paper 'Analyzing Perception-Distortion Tradeoff using Enhanced Perceptual Super-resolution Network' (ECCVW 2018)