Diffusion Guided Domain Adaptation of Image Generators

Song, Kunpeng; Han, Ligong; Liu, Bingchen; Metaxas, Dimitris; Elgammal, Ahmed

Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.04473 (cs)

[Submitted on 8 Dec 2022 (v1), last revised 9 Dec 2022 (this version, v2)]

Title:Diffusion Guided Domain Adaptation of Image Generators

Authors:Kunpeng Song, Ligong Han, Bingchen Liu, Dimitris Metaxas, Ahmed Elgammal

View PDF

Abstract:Can a text-to-image diffusion model be used as a training objective for adapting a GAN generator to another domain? In this paper, we show that the classifier-free guidance can be leveraged as a critic and enable generators to distill knowledge from large-scale text-to-image diffusion models. Generators can be efficiently shifted into new domains indicated by text prompts without access to groundtruth samples from target domains. We demonstrate the effectiveness and controllability of our method through extensive experiments. Although not trained to minimize CLIP loss, our model achieves equally high CLIP scores and significantly lower FID than prior work on short prompts, and outperforms the baseline qualitatively and quantitatively on long and complicated prompts. To our best knowledge, the proposed method is the first attempt at incorporating large-scale pre-trained diffusion models and distillation sampling for text-driven image generator domain adaptation and gives a quality previously beyond possible. Moreover, we extend our work to 3D-aware style-based generators and DreamBooth guidance.

Comments:	Project website: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2212.04473 [cs.CV]
	(or arXiv:2212.04473v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.04473

Submission history

From: Kunpeng Song [view email]
[v1] Thu, 8 Dec 2022 18:46:19 UTC (36,494 KB)
[v2] Fri, 9 Dec 2022 08:58:13 UTC (36,494 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Diffusion Guided Domain Adaptation of Image Generators

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Diffusion Guided Domain Adaptation of Image Generators

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators