Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate ZeRO-Powered Data Parallelism #20

Closed
StellaAthena opened this issue Jan 1, 2021 · 1 comment
Closed

Integrate ZeRO-Powered Data Parallelism #20

StellaAthena opened this issue Jan 1, 2021 · 1 comment
Labels
feature request New feature or request
Projects

Comments

@StellaAthena
Copy link
Member

Per DeepSpeed

We developed ZeRO to conquer the limitations of data parallelism and model parallelism while achieving the merits of both. ZeRO removes the memory redundancies across data-parallel processes by partitioning the model states—parameters, gradients, and optimizer state—across data parallel processes instead of replicating them. It uses a dynamic communication schedule during training to share the necessary state across distributed devices to retain the computational granularity and communication volume of data parallelism

We call this ZeRO-powered data parallelism, which allows per-device memory usage to scale linearly with the degree of data parallelism and incurs similar communication volume as data parallelism. ZeRO-powered data parallelism can fit models of arbitrary size—as long as the aggregated device memory is large enough to share the model states.

@StellaAthena StellaAthena added the feature request New feature or request label Jan 1, 2021
@StellaAthena StellaAthena added this to To do in 1T or BUST via automation Jan 1, 2021
@sdtblck
Copy link
Contributor

sdtblck commented Jan 5, 2021

this is handled automatically by deepspeed.initialize

@sdtblck sdtblck closed this as completed Jan 5, 2021
1T or BUST automation moved this from To do to Done Jan 5, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
Development

No branches or pull requests

2 participants