-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RLlib] DreamerV3: Make 200M (XL model) work; mixed float16 option #38461
[RLlib] DreamerV3: Make 200M (XL model) work; mixed float16 option #38461
Conversation
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
…mer_v3_06_make_200M_work
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
…mer_v3_06_make_200M_work
Signed-off-by: sven1977 <[email protected]>
* computing all gradients * NOT postprocessing * NOT applying Signed-off-by: sven1977 <[email protected]>
…AND apply them as well
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
…lus_float16' into dreamer_v3_06_2_make_200M_work_plus_float16
Signed-off-by: sven1977 <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
stamp
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes this is critical PR to merge
@sven1977 @krfricke @kouroshHakha : could one of you confirm that CI is OK, before I merge? |
Signed-off-by: sven1977 <[email protected]>
Thanks @zhe-thoughts ! I'll let you know. Just waiting for tests now to all pass. |
…mer_v3_06_2_make_200M_work_plus_float16
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Signed-off-by: sven1977 <[email protected]>
Hey @zhe-thoughts , this would be good to merge. All tests are passing now. Thank you so much! |
…ay-project#38461) Signed-off-by: sven1977 <[email protected]> Signed-off-by: e428265 <[email protected]>
…ay-project#38461) Signed-off-by: sven1977 <[email protected]>
…ay-project#38461) Signed-off-by: sven1977 <[email protected]> Signed-off-by: Jim Thompson <[email protected]>
…ay-project#38461) Signed-off-by: sven1977 <[email protected]> Signed-off-by: Victor <[email protected]>
DreamerV3: Make 200M (XL model) work; mixed float16 option
Why are these changes needed?
Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.