Converge trend of the model #2

hwfan · 2022-01-02T08:02:23Z

Hi authors,

Thanks for your open-source implementation, I read your instruction and tried to reproduce the final detection performance. However I realized the converge speed of the model is too low: it takes almost 2 days to reach 150 epoches on two nodes with 8 gpus on each node. Have you tried any way to accelerate the procedure? Will scaling up the learning rate at the start of the training be helpful?

cjw2021 · 2022-01-02T08:37:39Z

Thanks for your interest in our work.
We have tried 2x learning rate (the same setting as deformable DETR), but it didn't work.
The heavy burden of predicting verb class may cause slow convergence.

cjw2021 closed this as completed Feb 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Converge trend of the model #2

Converge trend of the model #2

hwfan commented Jan 2, 2022 •

edited

cjw2021 commented Jan 2, 2022

Converge trend of the model #2

Converge trend of the model #2

Comments

hwfan commented Jan 2, 2022 • edited

cjw2021 commented Jan 2, 2022

hwfan commented Jan 2, 2022 •

edited