One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing

Wang, Ting-Chun; Mallya, Arun; Liu, Ming-Yu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2011.15126v2 (cs)

[Submitted on 30 Nov 2020 (v1), revised 30 Mar 2021 (this version, v2), latest version 2 Apr 2021 (v3)]

Title:One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing

Authors:Ting-Chun Wang, Arun Mallya, Ming-Yu Liu

View PDF

Abstract:We propose a neural talking-head video synthesis model and demonstrate its application to video conferencing. Our model learns to synthesize a talking-head video using a source image containing the target person's appearance and a driving video that dictates the motion in the output. Our motion is encoded based on a novel keypoint representation, where the identity-specific and motion-related information is decomposed unsupervisedly. Extensive experimental validation shows that our model outperforms competing methods on benchmark datasets. Moreover, our compact keypoint representation enables a video conferencing system that achieves the same visual quality as the commercial H.264 standard while only using one-tenth of the bandwidth. Besides, we show our keypoint representation allows the user to rotate the head during synthesis, which is useful for simulating face-to-face video conferencing experiences.

Comments:	CVPR 2021 camera ready (oral). Our project page can be found at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2011.15126 [cs.CV]
	(or arXiv:2011.15126v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2011.15126

Submission history

From: Ting-Chun Wang [view email]
[v1] Mon, 30 Nov 2020 18:56:35 UTC (6,645 KB)
[v2] Tue, 30 Mar 2021 17:54:33 UTC (7,621 KB)
[v3] Fri, 2 Apr 2021 23:37:06 UTC (7,619 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2020-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ting-Chun Wang
Arun Mallya
Ming-Yu Liu

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators