The Bias Amplification Paradox in Text-to-Image Generation

Seshadri, Preethi; Singh, Sameer; Elazar, Yanai

Computer Science > Machine Learning

arXiv:2308.00755 (cs)

[Submitted on 1 Aug 2023 (v1), last revised 15 Nov 2023 (this version, v2)]

Title:The Bias Amplification Paradox in Text-to-Image Generation

Authors:Preethi Seshadri, Sameer Singh, Yanai Elazar

View PDF

Abstract:Bias amplification is a phenomenon in which models exacerbate biases or stereotypes present in the training data. In this paper, we study bias amplification in the text-to-image domain using Stable Diffusion by comparing gender ratios in training vs. generated images. We find that the model appears to amplify gender-occupation biases found in the training data (LAION) considerably. However, we discover that amplification can be largely attributed to discrepancies between training captions and model prompts. For example, an inherent difference is that captions from the training data often contain explicit gender information while our prompts do not, which leads to a distribution shift and consequently inflates bias measures. Once we account for distributional differences between texts used for training and generation when evaluating amplification, we observe that amplification decreases drastically. Our findings illustrate the challenges of comparing biases in models and their training data, and highlight confounding factors that impact analyses.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY)
Cite as:	arXiv:2308.00755 [cs.LG]
	(or arXiv:2308.00755v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2308.00755

Submission history

From: Preethi Seshadri [view email]
[v1] Tue, 1 Aug 2023 18:00:08 UTC (28,634 KB)
[v2] Wed, 15 Nov 2023 08:51:11 UTC (36,376 KB)

Computer Science > Machine Learning

Title:The Bias Amplification Paradox in Text-to-Image Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Bias Amplification Paradox in Text-to-Image Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators