WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation

Huang, Zihao; Hu, Shoukang; Wang, Guangcong; Liu, Tianqi; Zang, Yuhang; Cao, Zhiguo; Li, Wei; Liu, Ziwei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.02165 (cs)

[Submitted on 2 Jul 2024 (v1), last revised 14 Jul 2024 (this version, v3)]

Title:WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation

Authors:Zihao Huang, Shoukang Hu, Guangcong Wang, Tianqi Liu, Yuhang Zang, Zhiguo Cao, Wei Li, Ziwei Liu

View PDF HTML (experimental)

Abstract:Existing human datasets for avatar creation are typically limited to laboratory environments, wherein high-quality annotations (e.g., SMPL estimation from 3D scans or multi-view images) can be ideally provided. However, their annotating requirements are impractical for real-world images or videos, posing challenges toward real-world applications on current avatar creation methods. To this end, we propose the WildAvatar dataset, a web-scale in-the-wild human avatar creation dataset extracted from YouTube, with $10,000+$ different human subjects and scenes. WildAvatar is at least $10\times$ richer than previous datasets for 3D human avatar creation. We evaluate several state-of-the-art avatar creation methods on our dataset, highlighting the unexplored challenges in real-world applications on avatar creation. We also demonstrate the potential for generalizability of avatar creation methods, when provided with data at scale. We publicly release our data source links and annotations, to push forward 3D human avatar creation and other related fields for real-world applications.

Comments:	Project page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.02165 [cs.CV]
	(or arXiv:2407.02165v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.02165

Submission history

From: Zihao Huang [view email]
[v1] Tue, 2 Jul 2024 11:17:48 UTC (3,042 KB)
[v2] Wed, 10 Jul 2024 09:20:39 UTC (6,793 KB)
[v3] Sun, 14 Jul 2024 08:15:12 UTC (6,793 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:WildAvatar: Web-scale In-the-wild Video Dataset for 3D Avatar Creation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators