HumanRef: Single Image to 3D Human Generation via Reference-Guided Diffusion

Zhang, Jingbo; Li, Xiaoyu; Zhang, Qi; Cao, Yanpei; Shan, Ying; Liao, Jing

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.16961 (cs)

[Submitted on 28 Nov 2023]

Title:HumanRef: Single Image to 3D Human Generation via Reference-Guided Diffusion

Authors:Jingbo Zhang, Xiaoyu Li, Qi Zhang, Yanpei Cao, Ying Shan, Jing Liao

View PDF

Abstract:Generating a 3D human model from a single reference image is challenging because it requires inferring textures and geometries in invisible views while maintaining consistency with the reference image. Previous methods utilizing 3D generative models are limited by the availability of 3D training data. Optimization-based methods that lift text-to-image diffusion models to 3D generation often fail to preserve the texture details of the reference image, resulting in inconsistent appearances in different views. In this paper, we propose HumanRef, a 3D human generation framework from a single-view input. To ensure the generated 3D model is photorealistic and consistent with the input image, HumanRef introduces a novel method called reference-guided score distillation sampling (Ref-SDS), which effectively incorporates image guidance into the generation process. Furthermore, we introduce region-aware attention to Ref-SDS, ensuring accurate correspondence between different body regions. Experimental results demonstrate that HumanRef outperforms state-of-the-art methods in generating 3D clothed humans with fine geometry, photorealistic textures, and view-consistent appearances.

Comments:	Homepage: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.16961 [cs.CV]
	(or arXiv:2311.16961v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.16961

Submission history

From: Jingbo Zhang [view email]
[v1] Tue, 28 Nov 2023 17:06:28 UTC (12,132 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:HumanRef: Single Image to 3D Human Generation via Reference-Guided Diffusion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:HumanRef: Single Image to 3D Human Generation via Reference-Guided Diffusion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators