Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mup Support #704

Merged
merged 45 commits into from
Dec 10, 2022
Merged

Mup Support #704

merged 45 commits into from
Dec 10, 2022

Conversation

nsarka
Copy link
Contributor

@nsarka nsarka commented Oct 19, 2022

This PR implements support for mup: https://github.com/microsoft/mup

I'm able to train with this patch. Studies on its effectiveness still need to be done. For that, I still need to add args for generating the coord check plots they describe in the readme.

@nsarka nsarka mentioned this pull request Oct 19, 2022
7 tasks
configs/small.yml Outdated Show resolved Hide resolved
@Quentin-Anthony
Copy link
Member

I left a set of comments. Let's resolve those, test changes with PP, get some coord check plots, and test a few other mup settings + neox config combinations before merging.

@nsarka nsarka marked this pull request as ready for review December 8, 2022 00:18
@nsarka nsarka requested a review from a team as a code owner December 8, 2022 00:18
@nsarka nsarka requested review from StellaAthena and removed request for a team December 8, 2022 00:18
@Quentin-Anthony Quentin-Anthony merged commit 0535bfb into deepspeed_main Dec 10, 2022
@Quentin-Anthony Quentin-Anthony deleted the nsarka/mup-support branch December 10, 2022 17:30
@StellaAthena StellaAthena added this to the Release V2 milestone Dec 20, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants