Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

为什么在训练的第二个epoch时显示cuda out of memory #98

Open
lemonade-lm opened this issue Mar 7, 2024 · 11 comments
Open

为什么在训练的第二个epoch时显示cuda out of memory #98

lemonade-lm opened this issue Mar 7, 2024 · 11 comments

Comments

@lemonade-lm
Copy link

lemonade-lm commented Mar 7, 2024

No description provided.

@mooncake199809
Copy link

我在3DMatch数据集上训练时,训练到第二个epoch时现存不够,我并没有修改代码,并且现存是48G的,为什么会出现这种情况

Do you resolve this problem?

@jlyw1017
Copy link

same problem here.

@mooncake199809
Copy link

same problem here.

I solve this problem by changing the number of points from 30000 to 29000 for 3DMatch.

@Baros-Young
Copy link

same problem here.

I solve this problem by changing the number of points from 30000 to 29000 for 3DMatch.

Hello,how long did it take you to train the model?It took me 5.7 hours to train the model for 3 epochs on a 3090 GPU using preset parameters, which is really beyond my acceptable range...

@mooncake199809
Copy link

same problem here.

I solve this problem by changing the number of points from 30000 to 29000 for 3DMatch.

Hello,how long did it take you to train the model?It took me 5.7 hours to train the model for 3 epochs on a 3090 GPU using preset parameters, which is really beyond my acceptable range...

It takes me more than three days using a 3090GPUS

@Baros-Young
Copy link

same problem here.

I solve this problem by changing the number of points from 30000 to 29000 for 3DMatch.

Hello,how long did it take you to train the model?It took me 5.7 hours to train the model for 3 epochs on a 3090 GPU using preset parameters, which is really beyond my acceptable range...

It takes me more than three days using a 3090GPUS

Thanks for your reply,it helps!

@Baros-Young
Copy link

same problem here.

I solve this problem by changing the number of points from 30000 to 29000 for 3DMatch.

Hello,how long did it take you to train the model?It took me 5.7 hours to train the model for 3 epochs on a 3090 GPU using preset parameters, which is really beyond my acceptable range...

It takes me more than three days using a 3090GPUS

tp
I get wrong rre and rte result using pretrained weight,so what could be the reason?

@mooncake199809
Copy link

same problem here.

I solve this problem by changing the number of points from 30000 to 29000 for 3DMatch.

Hello,how long did it take you to train the model?It took me 5.7 hours to train the model for 3 epochs on a 3090 GPU using preset parameters, which is really beyond my acceptable range...

It takes me more than three days using a 3090GPUS

tp I get wrong rre and rte result using pretrained weight,so what could be the reason?

You should run both of these two codes:
CUDA_VISIBLE_DEVICES=0 python test.py --snapshot=../../weights/geotransformer-3dmatch.pth.tar --benchmark=3DMatch
CUDA_VISIBLE_DEVICES=0 python eval.py --benchmark=3DMatch --method=lgr
It seems that you only run the first code

@mooncake199809
Copy link

same problem here.

I solve this problem by changing the number of points from 30000 to 29000 for 3DMatch.

Hello,how long did it take you to train the model?It took me 5.7 hours to train the model for 3 epochs on a 3090 GPU using preset parameters, which is really beyond my acceptable range...

It takes me more than three days using a 3090GPUS

tp I get wrong rre and rte result using pretrained weight,so what could be the reason?

Hellow, could you please tell me the testing result in the 3DMatch dataset? I found that the testing results is different from results reported in the paper in terms of IR metric.

@wuzelei123
Copy link

你是在linux上跑的吗

@mooncake199809
Copy link

你是在linux上跑的吗

yes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants