-
Notifications
You must be signed in to change notification settings - Fork 180
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Llava CUDA error: device-side assert triggered #543
Comments
There were many hanging processes so I needed to kill them and re-deploy slang again. However, now I get a different issue, again coming from the llava implementation:
Anybody ideas how to fix this? Thanks |
1 used one gpu card is ok, but two has the same problem |
I explicitly restricted the access to 1 GPU with CUDA_VISIBLE_DEVICES=0. I do have more GPUs on the node, but it should only use this device, plus I am getting this in the logs, so it means it uses one device:
|
I am trying to deploy
llava-v1.6-34b
on A100 80GB but am getting the following error:Does anybody have an idea how to fix the issue? Thanks
The text was updated successfully, but these errors were encountered: