Loss become Nan after some time training #35

DrokBing · 2023-11-18T19:33:30Z

And the rendering image is white background

guanjunwu · 2023-11-21T06:48:23Z

Wow... I also found same problem during optimization. Initialiy I think it's error on my training machine.
Most of cases happen on scenes have more background points such as flame_salmon_1 and coffee_martini of the Neu3D datasets. I think it may be the nemerical overflow during training. Do you have any ideas?
I hope we can solve it together if you have time :)

Arisilin · 2023-11-24T02:17:53Z

I also encountered this problem when training on my own scene, the loss may become nan after several iterations in fine stage. Besides, there are also some cases that "Runtime Error: numel: integer multiplication overflow" happens during fine stage training. I am not sure if it is caused by similar reason.

leo-frank · 2023-12-18T00:27:45Z

I meet the same problem, on a colmap-format dataset.

The PSNR suddenly decrease into an unexpected value(4.28), while the number of point cloud also decreases .

guanjunwu · 2023-12-19T04:55:49Z

I guess that maybe the scene's bounding box is so large, and causes the error when producing the backpropagation in the Gaussian deformation field network.

GotFusion · 2024-02-25T12:47:09Z

I guess that maybe the scene's bounding box is so large, and causes the error when producing the backpropagation in the Gaussian deformation field network.

Is there any solution to solve this problem?

guanjunwu · 2024-02-25T16:07:56Z

In my test, set no_dr=True and no_ds=True (disable the deformation of rotation and scaling) will decrease the happening of the problem.

zhaohaoyu376 · 2024-03-27T12:22:41Z

In my test, set no_dr=True and no_ds=True (disable the deformation of rotation and scaling) will decrease the happening of the problem.

However, it seems that performance might be significantly affected by this approach. Are there any other solutions?

zhaohaoyu376 · 2024-03-27T13:22:58Z

Why do I always run again because the loss is nan? I can't even finish running it once.

guanjunwu added the bug Something isn't working label Nov 23, 2023

leo-frank mentioned this issue Dec 19, 2023

How to Create custom dataSets for training? #46

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loss become Nan after some time training #35

Loss become Nan after some time training #35

DrokBing commented Nov 18, 2023

guanjunwu commented Nov 21, 2023 •

edited

Loading

Arisilin commented Nov 24, 2023

leo-frank commented Dec 18, 2023

guanjunwu commented Dec 19, 2023

GotFusion commented Feb 25, 2024

guanjunwu commented Feb 25, 2024

zhaohaoyu376 commented Mar 27, 2024

zhaohaoyu376 commented Mar 27, 2024

Loss become Nan after some time training #35

Loss become Nan after some time training #35

Comments

DrokBing commented Nov 18, 2023

guanjunwu commented Nov 21, 2023 • edited Loading

Arisilin commented Nov 24, 2023

leo-frank commented Dec 18, 2023

guanjunwu commented Dec 19, 2023

GotFusion commented Feb 25, 2024

guanjunwu commented Feb 25, 2024

zhaohaoyu376 commented Mar 27, 2024

zhaohaoyu376 commented Mar 27, 2024

guanjunwu commented Nov 21, 2023 •

edited

Loading