-
Notifications
You must be signed in to change notification settings - Fork 111
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clarification on the relationship between num_unroll_steps and infer_context_length in UniZero #248
Comments
|
The |
I have conducted a thorough comparison between the parameters specified in the paper and those in the configuration files. Unfortunately, I did not identify any significant discrepancies. I executed atari_unizero_stack4_config.py with only minor modifications: I adjusted the frequency of checkpoint saves and enabled video file recording. These changes should not materially affect the training process or results. |
Thank you for your feedback. We will verify and address the performance issues related to atari_unizero_stack4_config.py within a week. The code using stack1 in the main branch, atari_unizero_config.py, has already been confirmed to perform consistently with the curves presented in the paper. We recommend using atari_unizero_config.py for your tests and research in the meantime. Thank you for your patience. |
Thank you for your prompt and helpful reply. I truly appreciate your dedication to addressing this issue. The work you're doing with LightZero is impressive and valuable to the research community. |
For clarification: In the Tensorboard, do the collector_step and tabs with the *_step prefix represent envstep? Or do they represent *_iter? Which data sources (collector, evaluator, etc.) were used for the statistics presented in the paper? I want to ensure I'm interpreting the data correctly. Thank you for your assistance. |
Hello, in Tensorboard, tags with the |
Thank you very much for your clear and informative response. I believe this addresses all my current questions, so I'll be closing this thread. Thank you again for your time and support throughout this discussion. |
I've been studying the UniZero implementation and I have a question about two key parameters:
I noticed that these values are different, and I'm curious about the reasoning behind this design choice. Specifically:
num_unroll_steps
set higher thaninfer_context_length
?I'd greatly appreciate any insights you could provide on the rationale behind these parameter choices and their impact on the model's performance and behavior.
Thank you for your time and for creating this interesting algorithm.
The text was updated successfully, but these errors were encountered: