Learning to Split for Automatic Bias Detection

Bao, Yujia; Barzilay, Regina

Computer Science > Machine Learning

arXiv:2204.13749v1 (cs)

[Submitted on 28 Apr 2022 (this version), latest version 20 Jul 2022 (v2)]

Title:Learning to Split for Automatic Bias Detection

Authors:Yujia Bao, Regina Barzilay

View PDF

Abstract:Classifiers are biased when trained on biased datasets. As a remedy, we propose Learning to Split (ls), an algorithm for automatic bias detection. Given a dataset with input-label pairs, ls learns to split this dataset so that predictors trained on the training split generalize poorly to the testing split. This performance gap provides a proxy for measuring the degree of bias in the learned features and can therefore be used to reduce biases. Identifying non-generalizable splits is challenging as we don't have any explicit annotations about how to split. In this work, we show that the prediction correctness of the testing example can be used as a source of weak supervision: generalization performance will drop if we move examples that are predicted correctly away from the testing split, leaving only those that are mispredicted. We evaluate our approach on Beer Review, Waterbirds, CelebA and MNLI. Empirical results show that ls is able to generate astonishingly challenging splits that correlate with human-identified biases. Moreover, we demonstrate that combining robust learning algorithms (such as group DRO) with splits identified by ls enables automatic de-biasing. Compared with previous state-of-the-arts, we substantially improves the worst-group performance (23.4% on average) when the source of biases is unknown during training and validation.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2204.13749 [cs.LG]
	(or arXiv:2204.13749v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2204.13749

Submission history

From: Yujia Bao [view email]
[v1] Thu, 28 Apr 2022 19:41:08 UTC (1,512 KB)
[v2] Wed, 20 Jul 2022 23:44:01 UTC (1,656 KB)

Computer Science > Machine Learning

Title:Learning to Split for Automatic Bias Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning to Split for Automatic Bias Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators