DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering

Yao, Shunyu; Zhong, RuiZhe; Yan, Yichao; Zhai, Guangtao; Yang, Xiaokang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2201.00791 (cs)

[Submitted on 3 Jan 2022]

Title:DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering

Authors:Shunyu Yao, RuiZhe Zhong, Yichao Yan, Guangtao Zhai, Xiaokang Yang

View PDF

Abstract:While recent advances in deep neural networks have made it possible to render high-quality images, generating photo-realistic and personalized talking head remains challenging. With given audio, the key to tackling this task is synchronizing lip movement and simultaneously generating personalized attributes like head movement and eye blink. In this work, we observe that the input audio is highly correlated to lip motion while less correlated to other personalized attributes (e.g., head movements). Inspired by this, we propose a novel framework based on neural radiance field to pursue high-fidelity and personalized talking head generation. Specifically, neural radiance field takes lip movements features and personalized attributes as two disentangled conditions, where lip movements are directly predicted from the audio inputs to achieve lip-synchronized generation. In the meanwhile, personalized attributes are sampled from a probabilistic model, where we design a Transformer-based variational autoencoder sampled from Gaussian Process to learn plausible and natural-looking head pose and eye blink. Experiments on several benchmarks demonstrate that our method achieves significantly better results than state-of-the-art methods.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2201.00791 [cs.CV]
	(or arXiv:2201.00791v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2201.00791

Submission history

From: Shunyu Yao [view email]
[v1] Mon, 3 Jan 2022 18:23:38 UTC (21,230 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2022-01

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shunyu Yao
Yichao Yan
Guangtao Zhai
Xiaokang Yang

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators