DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

Wang, Yibo; Gao, Ruiyuan; Chen, Kai; Zhou, Kaiqiang; Cai, Yingjie; Hong, Lanqing; Li, Zhenguo; Jiang, Lihui; Yeung, Dit-Yan; Xu, Qiang; Zhang, Kai

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.13304 (cs)

[Submitted on 20 Mar 2024]

Title:DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

Authors:Yibo Wang, Ruiyuan Gao, Kai Chen, Kaiqiang Zhou, Yingjie Cai, Lanqing Hong, Zhenguo Li, Lihui Jiang, Dit-Yan Yeung, Qiang Xu, Kai Zhang

View PDF HTML (experimental)

Abstract:Current perceptive models heavily depend on resource-intensive datasets, prompting the need for innovative solutions. Leveraging recent advances in diffusion models, synthetic data, by constructing image inputs from various annotations, proves beneficial for downstream tasks. While prior methods have separately addressed generative and perceptive models, DetDiffusion, for the first time, harmonizes both, tackling the challenges in generating effective data for perceptive models. To enhance image generation with perceptive models, we introduce perception-aware loss (P.A. loss) through segmentation, improving both quality and controllability. To boost the performance of specific perceptive models, our method customizes data augmentation by extracting and utilizing perception-aware attribute (P.A. Attr) during generation. Experimental results from the object detection task highlight DetDiffusion's superior performance, establishing a new state-of-the-art in layout-guided generation. Furthermore, image syntheses from DetDiffusion can effectively augment training data, significantly enhancing downstream detection performance.

Comments:	Accepted to CVPR 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.13304 [cs.CV]
	(or arXiv:2403.13304v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.13304

Submission history

From: Yibo Wang [view email]
[v1] Wed, 20 Mar 2024 04:58:03 UTC (4,648 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators