About training under different version of pytorch and cuda #14

shangbuhuan13 · 2022-02-08T21:39:01Z

Thanks for your great work!
I am now training the code under pytorch 1.10 and cuda 11.0, because I don't have a proper GPU that satisfies the environment in README. However, I got a much lower result in AP40 moderate: 13.69, compared to the given ckpt 16.23.
Do you have some ideas about why the performance deteriorate sharply under different environments?
Thanks very much

SuperMHP · 2022-02-11T05:46:38Z

I do not know. Please mention that the model has a certain range of jitter. But 13.69 is very low. Most times will not happen. We also are developing the model in the higher torch.

shangbuhuan13 · 2022-02-11T09:02:28Z

Thanks for the reply.
We now can reproduce the results.
But a weird thing is that training under 3 GPUs outperforms training under a single GPU.
We first trained the network with 1 A100, and got 13.69.
Then we trained it with 3 A100s, and got 15.70.
Other conditions were kept the same.

All in all, we get the right results.
Thanks for your work again.

Senwang98 · 2022-04-10T14:18:33Z

@SuperMHP
I used 3 * V100 to train GUPNet, But get 15mAP.
I use pt1.7 and cuda11 to train model, I think 15mAP is a little abnormal.

shangbuhuan13 closed this as completed Feb 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About training under different version of pytorch and cuda #14

About training under different version of pytorch and cuda #14

shangbuhuan13 commented Feb 8, 2022

SuperMHP commented Feb 11, 2022

shangbuhuan13 commented Feb 11, 2022

Senwang98 commented Apr 10, 2022

About training under different version of pytorch and cuda #14

About training under different version of pytorch and cuda #14

Comments

shangbuhuan13 commented Feb 8, 2022

SuperMHP commented Feb 11, 2022

shangbuhuan13 commented Feb 11, 2022

Senwang98 commented Apr 10, 2022