Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pythia Experiment 3: Gender bias #14

Closed
1 of 6 tasks
haileyschoelkopf opened this issue Nov 9, 2022 · 2 comments
Closed
1 of 6 tasks

Pythia Experiment 3: Gender bias #14

haileyschoelkopf opened this issue Nov 9, 2022 · 2 comments
Assignees

Comments

@haileyschoelkopf
Copy link
Collaborator

haileyschoelkopf commented Nov 9, 2022

  • Collect cooccurrence statistics on deduped and non-deduped Pile
  • Cast WinoBias as a multiple choice classification task
  • Convert corpus stats into Pandas dataset + visualize distribution
  • Run WinoBias on all models
  • Collect correlations between WinoBias scores and data statistics
  • Create fine-tuning data points for insertion into last 3 ckpt retraining
@haileyschoelkopf haileyschoelkopf self-assigned this Nov 9, 2022
@haileyschoelkopf
Copy link
Collaborator Author

The cooccurrence statistics were too uniformly noisy to derive meaningful conclusions from at large timescales.

Updated todos:

  • train 1.3B on (M prn -> F prn) data but same data + order otherwise, for last 5k steps of training
  • Run Winobias, AXG, crows_pairs, jigsaw on model before + after intervention
  • ???

@haileyschoelkopf
Copy link
Collaborator Author

We aren't doing anything with the above described data, and ended up doing something else to fulfill this experiment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

No branches or pull requests

1 participant