-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[air/output] Use Tune reporter if Tuner/tune.run was used #36154
[air/output] Use Tune reporter if Tuner/tune.run was used #36154
Conversation
Signed-off-by: Kai Fricke <[email protected]>
This change makes sense! One question: |
@scottsun94 RLlib can only run through the Tune entrypoints. Trainer is from Ray Train. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Makes sense! Are the failing tests unrelated?
Yes, they're due to a breakage that was fixed in #36157. I'll merge master to reflect this in the CI. |
Signed-off-by: Kai Fricke <[email protected]>
This makes sense. But what's the story of output for rllib (training an RLlib Algorithm outside of Tune)? Rllib needs to build its own output reporter? |
Yes, if rllib is not using tune/air for execution, they have to implement their own reporting logic. They can potentially still use the same output reporter, but they have to call the hooks themselves. The logic from this PR would then still use the train (single run) reporter, so it should be unaffected by the changes in this PR. |
It makes sense. Thanks! |
…t#36154) Currently we use the number of samples to detect if we should use the Train output reporter or the tune output reporter. However, this is confusing when developing on a small scale, e.g. a tuning run with only 1 trial, before going to multiple trials. Instead, we should use the entrypoint to decide which reporter we use. The Trainer will use the Train reporter, and Tuner, tune.run, and tune.run_experiments will use the tune reporter. Signed-off-by: Kai Fricke <[email protected]>
…t#36154) Currently we use the number of samples to detect if we should use the Train output reporter or the tune output reporter. However, this is confusing when developing on a small scale, e.g. a tuning run with only 1 trial, before going to multiple trials. Instead, we should use the entrypoint to decide which reporter we use. The Trainer will use the Train reporter, and Tuner, tune.run, and tune.run_experiments will use the tune reporter. Signed-off-by: Kai Fricke <[email protected]> Signed-off-by: e428265 <[email protected]>
Why are these changes needed?
Currently we use the number of samples to detect if we should use the Train output reporter or the tune output reporter.
However, this is confusing when developing on a small scale, e.g. a tuning run with only 1 trial, before going to multiple trials. Instead, we should use the entrypoint to decide which reporter we use. The
Trainer
will use the Train reporter, and Tuner, tune.run, and tune.run_experiments will use the tune reporter.Related issue number
Checks
git commit -s
) in this PR.scripts/format.sh
to lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/
under thecorresponding
.rst
file.