Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

device-side assert triggered #14

Open
ubuntu-achieve opened this issue Apr 2, 2023 · 1 comment
Open

device-side assert triggered #14

ubuntu-achieve opened this issue Apr 2, 2023 · 1 comment

Comments

@ubuntu-achieve
Copy link

ubuntu-achieve commented Apr 2, 2023

Hi, When I try to use the command python demo.py --model resnet_18 --dataset voc2010_crop --label_name bird to reproduce the results of the paper, I get the error device-side assert triggered.
And this problem occurs steadily on the 13th batch of the 6th epoch. After debugging, I found that the problem occurred with the previous parameter update, which caused one of the values in mask1[0].mask_weight to be nan, which caused all subsequent values to be nan and led to the error.
image

But the strange thing is that the first few times the parameters were updated very normally, how can I solve this problem. 😭

@WindymanJOX
Copy link

感觉像是梯度消失,我在第5轮的时候出现了这个错误

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants