Bias in Automated Speaker Recognition

Hutiri, Wiebke Toussaint; Ding, Aaron

doi:10.1145/3531146.3533089

Computer Science > Sound

arXiv:2201.09486 (cs)

[Submitted on 24 Jan 2022 (v1), last revised 20 Jun 2022 (this version, v2)]

Title:Bias in Automated Speaker Recognition

Authors:Wiebke Toussaint Hutiri, Aaron Ding

View PDF

Abstract:Automated speaker recognition uses data processing to identify speakers by their voice. Today, automated speaker recognition is deployed on billions of smart devices and in services such as call centres. Despite their wide-scale deployment and known sources of bias in related domains like face recognition and natural language processing, bias in automated speaker recognition has not been studied systematically. We present an in-depth empirical and analytical study of bias in the machine learning development workflow of speaker verification, a voice biometric and core task in automated speaker recognition. Drawing on an established framework for understanding sources of harm in machine learning, we show that bias exists at every development stage in the well-known VoxCeleb Speaker Recognition Challenge, including data generation, model building, and implementation. Most affected are female speakers and non-US nationalities, who experience significant performance degradation. Leveraging the insights from our findings, we make practical recommendations for mitigating bias in automated speaker recognition, and outline future research directions.

Subjects:	Sound (cs.SD); Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2201.09486 [cs.SD]
	(or arXiv:2201.09486v2 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2201.09486
Journal reference:	2022 ACM Conference on Fairness, Accountability, and Transparency (FAccT '22)
Related DOI:	https://doi.org/10.1145/3531146.3533089

Submission history

From: Wiebke Toussaint Hutiri [view email]
[v1] Mon, 24 Jan 2022 06:48:57 UTC (4,872 KB)
[v2] Mon, 20 Jun 2022 00:34:09 UTC (1,980 KB)

Computer Science > Sound

Title:Bias in Automated Speaker Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Bias in Automated Speaker Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators