End-to-End Bias Mitigation by Modelling Biases in Corpora

This repo contains the PyTorch implementation of the ACL, 2020 paper End-to-End Bias Mitigation by Modelling Biases in Corpora.

Datasets

To get all datasets used in this work, run the following commands.

cd data
bash get_datasets.sh

Downloads the GLOVE (v1)

cd data
bash get_glove.sh

Implementations

The product of experts (PoE), Debiased Focal Loss (DFL), and RuBi loss implemnentations are provided in src/losses.py
The codes for BERT baseline are provided in src/BERT/ and the scripts to reproduce the results are provided in src/BERT/scripts/
The codes for InferSent baseline are provided in src/InferSent/ and the scripts to reproduce the results are provided in src/InferSent/scripts

Tested envrionment

pytorch-transformers 1.1.0, transformers 2.5.0, pytorch 1.2.0, pytorch-pretrained-bert 0.6.2

Datasets

To download the MNLI Mismatched/Matched development set from ACL 2020 paper End-to-End Bias Mitigation by Modelling Biases in Corpora use these links mismatched, matched
By running the get_datasets.sh scripts, the generated files will be downloaded under the names of MNLIMismatchedHardWithHardTest and MNLIMatchedHardWithHardTest.

Datasets format

Each dataset has three files:

s1.test each lines shows a premise
s2.test each line shows a hypothesis
labels.test each line shows a label.

Bibliography

If you find this repo useful, please cite our paper.

@inproceedings{karimi2020endtoend,
  title={End-to-End Bias Mitigation by Modelling Biases in Corpora},
  author={Karimi Mahabadi, Rabeeh and Belinkov, Yonatan and Henderson, James},
  booktitle={Annual Meeting of the Association for Computational Linguistics},
  year={2020}
}

Final words

Hope this repo is useful for your research. For any questions, please create an issue or email [email protected], and we will get back to you as soon as possible.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

End-to-End Bias Mitigation by Modelling Biases in Corpora

Datasets

Implementations

Tested envrionment

Datasets

Datasets format

Bibliography

Final words

About

Releases

Packages

Languages

rabeehk/robust-nli

Folders and files

Latest commit

History

Repository files navigation

End-to-End Bias Mitigation by Modelling Biases in Corpora

Datasets

Implementations

Tested envrionment

Datasets

Datasets format

Bibliography

Final words

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages