Bias Mitigation and Direction Methods #5130

ArjunSubramonian · 2021-04-19T06:56:57Z

Additions proposed in this pull request:

Added four bias direction methods (PCABiasDirection, PairedPCABiasDirection, TwoMeansBiasDirection, ClassificationNormalBiasDirection) and four bias mitigation methods (LinearBiasMitigator, HardBiasMitigator, INLPBiasMitigator, OSCaRBiasMitigator)

…g-debiasing

…s/post-processing-debiasing

AkshitaB · 2021-05-08T00:17:38Z

allennlp/fairness/bias_direction.py

+
+ with torch.set_grad_enabled(self.requires_grad):
+ # pca_lowrank centers the embeddings by default
+ _, _, V = torch.pca_lowrank(seed_embeddings, q=2)


Why do we set q=2?

I followed the VERB implementation + paper. I think the intuition behind this is that there will be two dimensions when applying PCA to definitionally-gendered words: 1) the gender direction, 2) all other directions, with the gender direction being principal.

Added a comment in the file itself

AkshitaB · 2021-05-08T00:40:47Z

allennlp/fairness/bias_mitigators.py

+ bias_direction : `torch.Tensor`
+ A unit tensor of size (dim, ) representing the concept subspace. The words
+ that are used to define the bias direction are considered definitionally
+ gendered and not modified.


"definitionally gendered" is for the specific example of concept "gender", right? Words like "king", "queen", "he", "she", etc.?

AkshitaB · 2021-05-08T00:43:27Z

allennlp/fairness/bias_mitigators.py

+
+class HardBiasMitigator(BiasMitigator):
+ """
+ Hard bias mitigator. Mitigates bias in embeddings by:


Perhaps we should mention explicitly that this is applicable for binary concepts?

Added note at top of both mitigator and direction files.

AkshitaB · 2021-05-08T01:05:23Z

allennlp/fairness/bias_mitigators.py

+
+ 2. Equalizing: ensuring that protected variable-related words are averaged
+ out to have the same norm.
+


Can we add some conceptual examples of what "Neutralizing" and "Equalizing" mean? It makes sense mathematically, but for someone getting started and looking to use this, it might be more helpful to give practical examples for making it "click". The examples in the VERB paper are good.

For each mitigation method, I just linked the appropriate figure in the VERB paper, as I think the pictures are the most helpful.

AkshitaB · 2021-05-08T01:17:58Z

allennlp/fairness/bias_direction.py

+ All tensors are expected to be on the same device.
+
+ !!! Note
+ This bias direction method is NOT differentiable.


If we intend to allow users to specify bias direction (and mitigator) methods in config, perhaps we should make "is_differentiable" a field, so that the list of methods which can be used can be obtained programmatically?

Yes, this is part of the bias mitigators and direction wrappers PR - this PR is just the functional API.

AkshitaB · 2021-05-08T01:51:14Z

tests/fairness/bias_mitigators_test.py

+ expected_bias_mitigated_embeddings
+ ).reshape(2, 2, -1)
+
+ def teardown_method(self):


Why do we do this?

we shouldn't :) I just forgot to call the parent setup_method(), so the tmp dir wasn't being deleted.

AkshitaB · 2021-05-08T02:02:16Z

allennlp/fairness/bias_mitigators.py

+ # Want to adjust first 2 coordinates and leave d - 2
+ # other orthogonal components fixed
+ fixed_rotated_evaluation_embeddings = rotated_evaluation_embeddings[..., 2:]
+ # Restrict attention to subspace S


where subspace S is ...?

the subspace spanned by the bias directions (made a comment in the file)

AkshitaB · 2021-05-08T02:06:30Z

@ArjunSubramonian I've left some comments; mostly regarding docs (which are fairly extensive, btw; great job!)

…/allenai/allennlp into arjuns/post-processing-debiasing

* added linear and hard debiasers * worked on documentation * committing changes before branch switch * committing changes before switching branch * finished bias direction, linear and hard debiasers, need to write tests * finished bias direction test * Commiting changes before switching branch * finished hard and linear debiasers * finished OSCaR * bias mitigators tests and bias metrics remaining * added bias mitigator tests * added bias mitigator tests * finished tests for bias mitigation methods * fixed gpu issues * fixed gpu issues * fixed gpu issues * resolve issue with count_nonzero not being differentiable * added more references * responded to Akshita's comments Co-authored-by: Arjun Subramonian <[email protected]> Co-authored-by: Arjun Subramonian <[email protected]> Co-authored-by: Arjun Subramonian <[email protected]> Co-authored-by: Arjun Subramonian <[email protected]> Co-authored-by: Michael Schmitz <[email protected]> Co-authored-by: Akshita Bhagia <[email protected]>

Arjun Subramonian added 13 commits April 13, 2021 01:57

added linear and hard debiasers

79c6c33

worked on documentation

e23057c

committing changes before branch switch

fcc3d34

committing changes before switching branch

7d00910

finished bias direction, linear and hard debiasers, need to write tests

668a513

finished bias direction test

91029ef

Commiting changes before switching branch

396b245

finished hard and linear debiasers

a8c22a1

finished OSCaR

ef6a062

bias mitigators tests and bias metrics remaining

2c873cb

added bias mitigator tests

d97a526

added bias mitigator tests

8460281

finished tests for bias mitigation methods

5a76922

ArjunSubramonian requested review from AkshitaB and epwalsh April 19, 2021 06:56

Arjun Subramonian and others added 5 commits April 19, 2021 00:12

Merge remote-tracking branch 'origin/main' into arjuns/post-processin…

85cb107

…g-debiasing

fixed gpu issues

8e55f28

fixed gpu issues

b42b73a

fixed gpu issues

37d8e33

resolve issue with count_nonzero not being differentiable

31b1d2c

ArjunSubramonian self-assigned this Apr 20, 2021

Arjun Subramonian and others added 5 commits April 20, 2021 17:33

merged main into post-processing-debiasing

a1f4f2a

added more references

36cebe3

Merge branch 'main' of https://github.com/allenai/allennlp into arjun…

88c083b

…s/post-processing-debiasing

Merge branch 'main' into arjuns/post-processing-debiasing

7269c1d

Merge branch 'main' into arjuns/post-processing-debiasing

24ce58f

AkshitaB suggested changes May 8, 2021

View reviewed changes

Arjun Subramonian added 2 commits May 9, 2021 11:30

responded to Akshita's comments

4495627

Merge branch 'arjuns/post-processing-debiasing' of https://github.com…

1182b10

…/allenai/allennlp into arjuns/post-processing-debiasing

ArjunSubramonian requested a review from AkshitaB May 11, 2021 17:35

AkshitaB approved these changes May 11, 2021

View reviewed changes

AkshitaB merged commit d9b19b6 into main May 11, 2021

AkshitaB deleted the arjuns/post-processing-debiasing branch May 11, 2021 17:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bias Mitigation and Direction Methods #5130

Bias Mitigation and Direction Methods #5130

ArjunSubramonian commented Apr 19, 2021

AkshitaB May 8, 2021

ArjunSubramonian May 9, 2021

ArjunSubramonian May 9, 2021

AkshitaB May 8, 2021

ArjunSubramonian May 9, 2021

AkshitaB May 8, 2021

ArjunSubramonian May 9, 2021

AkshitaB May 8, 2021

ArjunSubramonian May 9, 2021

AkshitaB May 8, 2021

ArjunSubramonian May 9, 2021

AkshitaB May 8, 2021

ArjunSubramonian May 9, 2021

AkshitaB May 8, 2021

ArjunSubramonian May 9, 2021

AkshitaB commented May 8, 2021


		2. Equalizing: ensuring that protected variable-related words are averaged
		out to have the same norm.

Bias Mitigation and Direction Methods #5130

Bias Mitigation and Direction Methods #5130

Conversation

ArjunSubramonian commented Apr 19, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AkshitaB commented May 8, 2021