Skip to content

Tags: EleutherAI/DeeperSpeed

Tags

v2.0

Toggle v2.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
[zero] prevent poor configs from running w. zero-offload (microsoft#2971

)

v1.0

Toggle v1.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Add a runner for MosaicML cloud (#44)

* add mosaic launcher string constant

* first attempt at mosaic multinode runner for gptneox

* typo

* actually add mosaic runner

* string cast

* debug

* fix

* actually fix?

* strip extra space

* debugging

* debugging

* correctly set env vars

* drop cd

* add env vars via env instead of export commands

* fix

* try using slurms arg parsing

* debug print

* debug print

* print debug

* cleanup

* try getting world info from the hostfile

* add missing init arg

* more cleanup

* remove more prints

v0.3.10

Toggle v0.3.10's commit message
version bump to 0.3.10

v0.3.9

Toggle v0.3.9's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Elastic training support (microsoft#602)

Co-authored-by: Samyam Rajbhandari <[email protected]>

v0.3.8

Toggle v0.3.8's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
bump to 0.3.8

grad-norm-test

Toggle grad-norm-test's commit message
calculate grad norm wrt sub partitions

v0.3.7

Toggle v0.3.7's commit message
bump to 0.3.7

v0.3.6

Toggle v0.3.6's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
bump to 0.3.6 and fix manifest to include reqs (microsoft#561)

v0.3.5

Toggle v0.3.5's commit message
bump to 0.3.5

v0.3.4

Toggle v0.3.4's commit message
bump version 0.3.4