Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix train pipeline #89

Merged
merged 6 commits into from
Jan 25, 2021
Merged

Fix train pipeline #89

merged 6 commits into from
Jan 25, 2021

Conversation

sdtblck
Copy link
Contributor

@sdtblck sdtblck commented Jan 24, 2021

Fix pipeline training scripts.

Some fixes are literally in the deepspeed library so setup will need to install our deepspeed fork https://github.com/EleutherAI/DeepSpeed to properly work.

Copy link
Member

@StellaAthena StellaAthena left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Works for several people locally and on the CW servers! A big win 👍

This was linked to issues Jan 25, 2021
@StellaAthena StellaAthena merged commit cb37b36 into main Jan 25, 2021
@StellaAthena StellaAthena deleted the fix_train_pipeline branch January 25, 2021 04:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

gpt3small is broken Implement Gradient Checkpointing
2 participants