-
Notifications
You must be signed in to change notification settings - Fork 247
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
You can add two vGPUs for RKE2 node on Harvester even though they won't start #10947
Comments
We need to confirm that we don't have a unique key problem. I'm quite sure that it's just a label issue, where we should show the node id + profile id. |
Something to be aware of is that if you go in and edit the YAML for the ramFB fix and the node driver ever updates the node it will overwrite the fix. This will happen every time the node is redeployed, such as for the node going into error, unresponsive, or other states. Also when you change the count. For a quick fix I would suggest just disabling the add button after adding one vGPU profile. |
Setup
Describe the bug
You can add two vGPU devices for a RKE2 Harvester cluster. They won't start due to the need for the YAML ramFB fix.
Also Rancher stays in starting state and doesn’t go into an error state. The fail state will keep looping since it’s not reporting back the error
To Reproduce
Result
The VMs won't start
Expected Result
There are few possible fixes
Screenshots
Additional context
Currently this is allowed in the Harvester UI, but you have to do the YAML fix after creation, or possibly via YAML during creation.
The text was updated successfully, but these errors were encountered: