Single card(RTX 3090) training results #65

sjhfdl · 2023-06-08T07:15:55Z

Thanks for the great work and sharing the code!

I trained through a single RTX3090 graphics card, the configuration file is maptr_tiny_r50_24e.py, the results after training are shown in the figure, the results are not ideal, and there is a big gap between the results in the paper, I would like to ask if you have tried the single card training, or what should be paid attention to in the training

adasfag · 2023-06-20T15:04:27Z

I have also encounter the same problem

adasfag · 2023-06-20T15:11:31Z

We trained it on the two A100 GPUS, and the Map result is about 0.35 in the epoch 24

sjhfdl · 2023-06-20T15:16:24Z

We trained it on the two A100 GPUS, and the Map result is about 0.35 in the epoch 24

Hello, I solved this problem, the reason is that the paper used 8 Gpus for training, and I trained on a single card, so I reduced the initial learning rate lr and weight_decay by 8 times, changed to lr=0.75e-4, weight_decay=0.00125, and then decreased the initial learning rate LR and weight_decay by 8 times. Also, enlarge the warmup_iters in lr_config by a factor of eight, to 4000

adasfag · 2023-06-20T15:20:36Z

Thanks

adasfag · 2023-06-23T14:43:01Z

Hello .I train the model again as your advice,but the Map is about 45.7 in the epoch 24. Could you provicd your single GPU result?Thank you very much

sjhfdl · 2023-06-23T15:18:16Z

Hello .I train the model again as your advice,but the Map is about 45.7 in the epoch 24. Could you provicd your single GPU result?Thank you very much

First of all, I would like to apologize to you. Due to the computing power of my graphics card, when I adjusted the learning rate, I only trained the author's code for two epochs, and I felt that the accuracy of the second epoch had reached 0.15, so I did not continue the training. Then I went to verify my method, and the accuracy of the training was similar to the results given by the author. My idea is that the results of the multi-card run will be slightly lower than those of the single card, and then I assume that the method of the author can also run on my own computer and produce similar results as in the paper.

adasfag · 2023-06-23T15:24:05Z

Hello .I train the model again as your advice,but the Map is about 45.7 in the epoch 24. Could you provicd your single GPU result?Thank you very much

First of all, I would like to apologize to you. Due to the computing power of my graphics card, when I adjusted the learning rate, I only trained the author's code for two epochs, and I felt that the accuracy of the second epoch had reached 0.15, so I did not continue the training. Then I went to verify my method, and the accuracy of the training was similar to the results given by the author. My idea is that the results of the multi-card run will be slightly lower than those of the single card, and then I assume that the method of the author can also run on my own computer and produce similar results as in the paper.

It seems that the single result is lower than those of the multi-card, maybe it needs a more suitable lr and It confused me.

sjhfdl · 2023-06-23T15:29:49Z

It seems that the single result is lower than those of the multi-card, maybe it needs a more suitable lr and It confused me.

Yes, you need a good learning rate configuration, you can try it a few times, maybe because our graphics card models are different

sjhfdl · 2023-06-23T15:35:11Z

Hello .I train the model again as your advice,but the Map is about 45.7 in the epoch 24. Could you provicd your single GPU result?Thank you very much

Could you show me the test results of the training？

adasfag · 2023-06-23T15:35:56Z

lrx02 · 2023-07-01T01:10:54Z

I have met this problem as well

dynamic721 · 2024-10-03T17:39:20Z

Hello guys, I wonder how much time you spend with a single card?

VanHelen · 2024-11-12T01:51:13Z

Hello，have you solve this problem？I have trained through a single RTX4090, the configuration file is maptr_tiny_r50_24e.py, but it alwalys stop training at epoch 10, with the following problem

May I ask what are the parameters required for completing 24 epochs of training?

VanHelen · 2024-11-12T01:55:32Z

Hello guys, I wonder how much time you spend with a single card?

Hello，have you successfully trained the model on a single GPU？May I ask what parameters need to be modified? I am currently experiencing training failure at epoch 10.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Single card(RTX 3090) training results #65

Single card(RTX 3090) training results #65

sjhfdl commented Jun 8, 2023

adasfag commented Jun 20, 2023

adasfag commented Jun 20, 2023

sjhfdl commented Jun 20, 2023

adasfag commented Jun 20, 2023

adasfag commented Jun 23, 2023

sjhfdl commented Jun 23, 2023

adasfag commented Jun 23, 2023

sjhfdl commented Jun 23, 2023

sjhfdl commented Jun 23, 2023

adasfag commented Jun 23, 2023

lrx02 commented Jul 1, 2023

dynamic721 commented Oct 3, 2024

VanHelen commented Nov 12, 2024

VanHelen commented Nov 12, 2024

Single card(RTX 3090) training results #65

Single card(RTX 3090) training results #65

Comments

sjhfdl commented Jun 8, 2023

adasfag commented Jun 20, 2023

adasfag commented Jun 20, 2023

sjhfdl commented Jun 20, 2023

adasfag commented Jun 20, 2023

adasfag commented Jun 23, 2023

sjhfdl commented Jun 23, 2023

adasfag commented Jun 23, 2023

sjhfdl commented Jun 23, 2023

sjhfdl commented Jun 23, 2023

adasfag commented Jun 23, 2023

lrx02 commented Jul 1, 2023

dynamic721 commented Oct 3, 2024

VanHelen commented Nov 12, 2024

VanHelen commented Nov 12, 2024