Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement the MPU from Megatron #75

Closed
StellaAthena opened this issue Jan 21, 2021 · 1 comment
Closed

Implement the MPU from Megatron #75

StellaAthena opened this issue Jan 21, 2021 · 1 comment
Assignees
Labels
feature request New feature or request
Projects

Comments

@StellaAthena
Copy link
Member

The Megatron code contains a "MPU" library. MPU stands for "model parallelism unit." The purpose of an MPU is to allow custom tensor slicing across GPUs. DeepSpeed allows you to hook up a MPU, but doesn't provide one. The goal is to convert the MPU from Megatron to GPT-NeoX. This is a modified clone of Megatron: https://github.com/EleutherAI/MegatronPipeline

You may find the (minimalistic) descriptions DeepSpeed provides helpful:
https://www.deepspeed.ai/features/#model-parallelism
https://www.deepspeed.ai/tutorials/megatron/

The full DeepSpeed docs can be found here: https://deepspeed.readthedocs.io/en/latest/index.html

@StellaAthena StellaAthena added the feature request New feature or request label Jan 21, 2021
@StellaAthena StellaAthena added this to To do in 1T or BUST via automation Jan 21, 2021
@StellaAthena StellaAthena moved this from To do to In progress in 1T or BUST Jan 23, 2021
@StellaAthena StellaAthena linked a pull request Jan 23, 2021 that will close this issue
@StellaAthena
Copy link
Member Author

Superseded by codebase refactoring.

1T or BUST automation moved this from In progress to Done Feb 15, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
Development

Successfully merging a pull request may close this issue.

2 participants