Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate DeepSpeed #4

Closed
StellaAthena opened this issue Dec 23, 2020 · 0 comments · Fixed by #9 or #10
Closed

Integrate DeepSpeed #4

StellaAthena opened this issue Dec 23, 2020 · 0 comments · Fixed by #9 or #10
Labels
feature request New feature or request
Projects

Comments

@StellaAthena
Copy link
Member

To get this code running as efficiently as we will need, we should use the DeepSpeed library by Microsoft. It has a lot of bells and whistles and optimization options.

@StellaAthena StellaAthena added the feature request New feature or request label Dec 23, 2020
@StellaAthena StellaAthena added this to To do in 1T or BUST via automation Dec 23, 2020
@StellaAthena StellaAthena moved this from To do to In progress in 1T or BUST Dec 24, 2020
This was linked to pull requests Dec 24, 2020
1T or BUST automation moved this from In progress to Done Dec 26, 2020
vaibhav016 pushed a commit to vaibhav016/gpt-neox that referenced this issue Jun 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
Development

Successfully merging a pull request may close this issue.

1 participant