Equalizing Gender Biases in Neural Machine Translation with Word Embeddings Techniques

Font, Joel Escudé; Costa-jussà, Marta R.

Computer Science > Computation and Language

arXiv:1901.03116 (cs)

[Submitted on 10 Jan 2019 (v1), last revised 2 Jun 2019 (this version, v2)]

Title:Equalizing Gender Biases in Neural Machine Translation with Word Embeddings Techniques

Authors:Joel Escudé Font, Marta R. Costa-jussà

View PDF

Abstract:Neural machine translation has significantly pushed forward the quality of the field. However, there are remaining big issues with the output translations and one of them is fairness. Neural models are trained on large text corpora which contain biases and stereotypes. As a consequence, models inherit these social biases. Recent methods have shown results in reducing gender bias in other natural language processing tools such as word embeddings. We take advantage of the fact that word embeddings are used in neural machine translation to propose a method to equalize gender biases in neural machine translation using these representations. Specifically, we propose, experiment and analyze the integration of two debiasing techniques over GloVe embeddings in the Transformer translation architecture. We evaluate our proposed system on the WMT English-Spanish benchmark task, showing gains up to one BLEU point. As for the gender bias evaluation, we generate a test set of occupations and we show that our proposed system learns to equalize existing biases from the baseline system.

Comments:	This paper has been accepted for publication at the 1st ACL Workshop on Gender Bias for Natural Language Processing (2019)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1901.03116 [cs.CL]
	(or arXiv:1901.03116v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1901.03116

Submission history

From: Joel Escudé Font [view email]
[v1] Thu, 10 Jan 2019 12:06:31 UTC (113 KB)
[v2] Sun, 2 Jun 2019 22:20:06 UTC (102 KB)

Computer Science > Computation and Language

Title:Equalizing Gender Biases in Neural Machine Translation with Word Embeddings Techniques

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Equalizing Gender Biases in Neural Machine Translation with Word Embeddings Techniques

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators