-
Notifications
You must be signed in to change notification settings - Fork 323
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how does the privileged observation work? #33
Comments
Hi, The privileged observations feature is implemented but were not using it in the paper. These privileged observations are not used in a teacher-student distillation. Hopefully, this clarifies the distinction. |
thanks for your great contribution!
I notice that you use the privileged observation as critic obs for assymetric training in the PPO, but you haven`t mention this in the paper,
Could you please explain this part more clearly?
Plus, I notice that in other works by your team the privileged observation is used for distillation that can be reconstructed in the student policy, is the two privileged observation the same? If so, how does it work?
The text was updated successfully, but these errors were encountered: