New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is EMA used in this work? #5
Comments
Hi @JacobYuan7, Thank you for your interest in this work. Similar to MDETR, MDef-DETR also uses EMA during training and for evaluating the weights are loaded from the original model. You can try using the ema model for testing by loading the weights from |
As I understand it, MDETR uses 'model_ema' to evaluate the model, which is shown in: BTW, the training of the language model follows MDETR, right? With a warmup schedule and then decrease linearly back to zero for the rest of the training. |
Hi, my apologies for the delayed reply. Yes, your understanding is correct. MDETR is using
Yes, this is the case. Further, we are planning to release the training scripts by the end of this month. Stay tuned! |
Sure, I will! Thx so much for your kind response. |
Hello author, thanks for your great work. I raise a question about the usage of Exponential Moving Average (EMA) in this paper, hoping you can provide me with some clues. It seems that this paper does not detail in this part. As far as I know, MDETR uses it and evaluate use the EMA model. So I wonder is it used in this work? If it is actually used, why should we evaluate by the EMA model rather than the original one?
The text was updated successfully, but these errors were encountered: