Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Converge trend of the model #2

Closed
hwfan opened this issue Jan 2, 2022 · 1 comment
Closed

Converge trend of the model #2

hwfan opened this issue Jan 2, 2022 · 1 comment

Comments

@hwfan
Copy link

hwfan commented Jan 2, 2022

Hi authors,

Thanks for your open-source implementation, I read your instruction and tried to reproduce the final detection performance. However I realized the converge speed of the model is too low: it takes almost 2 days to reach 150 epoches on two nodes with 8 gpus on each node. Have you tried any way to accelerate the procedure? Will scaling up the learning rate at the start of the training be helpful?

@cjw2021
Copy link
Owner

cjw2021 commented Jan 2, 2022

Thanks for your interest in our work.
We have tried 2x learning rate (the same setting as deformable DETR), but it didn't work.
The heavy burden of predicting verb class may cause slow convergence.

@cjw2021 cjw2021 closed this as completed Feb 9, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants