Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to train on coco #1

Open
Retiina opened this issue Mar 11, 2021 · 7 comments
Open

How to train on coco #1

Retiina opened this issue Mar 11, 2021 · 7 comments

Comments

@Retiina
Copy link

Retiina commented Mar 11, 2021

Sir, thanks to share the code of your great work.
May I ask how to train on COCO dataset. I can only find the tutorial of VOC. Can you give a tutorial to prepare coco training?
Thanks in advance!

@Bohao-Lee
Copy link
Owner

Because of limited time, we only train our model on COCO dataset with MPSR. Hope this can help you.

@Retiina
Copy link
Author

Retiina commented Mar 11, 2021

Sir, thanks for you quick reply. Do you mean you apply the method on top of MPSR, but actually I am puzzled how you apply CME to two stage detectors.
MPSR is based on FPN which only take one input, but CME needs support branch and query branch.
So you equip FPN with an extra support branch and train it with the meta learning pipeline right? Could you tell me the baseline performance of that.

Also I am puzzled about the
Screen Shot 2021-03-11 at 6 47 46 PM
Argmax L_mrg is equivalent to increase intra and decrease inter, but this will lead the margin decrease? Why you say it will increase margin? Could you explain to me. Thanks in advance!

@Bohao-Lee
Copy link
Owner

For the first question, we use CME to imporve MPSR's classfication, so we use it in refinement branch, not another branch.
For the second question, we view margin as inter-class distance as margin. Hope this paper can help u. (https://arxiv.org/pdf/2005.13826.pdf)

@shuangw98
Copy link

For the first question, we use CME to imporve MPSR's classfication, so we use it in refinement branch, not another branch.
For the second question, we view margin as inter-class distance as margin. Hope this paper can help u. (https://arxiv.org/pdf/2005.13826.pdf)

Sorry, I still don't understand. CME is meta-learning based method, however, MPSR is fine-tuning method without support branch, so how to implement Feature Disturbance while there is no support mask?

@Bohao-Lee
Copy link
Owner

For the first question, we use CME to imporve MPSR's classfication, so we use it in refinement branch, not another branch.
For the second question, we view margin as inter-class distance as margin. Hope this paper can help u. (https://arxiv.org/pdf/2005.13826.pdf)

Sorry, I still don't understand. CME is meta-learning based method, however, MPSR is fine-tuning method without support branch, so how to implement Feature Disturbance while there is no support mask?

Sorry, we use Feature Disturbance to disturb feature. Because mask reflects object position, we use it. You can disturb corresponding object in image or corresponding feature. Because of limited time, we don't more experiment in it. We'll try to do some research in it.

@Wei-i
Copy link

Wei-i commented Dec 11, 2021

@Bohao-Lee 你好 针对Retina的第二个问题, 我也有疑惑,我想问下为什么margin loss的公式不是 argmin Lmrg? 因为您的论文底下也要提到要 增大类间距离D_inter 减少同类之间距离D_intra 包括你的loss 公式也是 argmin L = Ldet + Lmrg, 这不是和argmax L_mrg矛盾吗? 希望您能赐教。

@Bohao-Lee
Copy link
Owner

@Bohao-Lee 你好 针对Retina的第二个问题, 我也有疑惑,我想问下为什么margin loss的公式不是 argmin Lmrg? 因为您的论文底下也要提到要 增大类间距离D_inter 减少同类之间距离D_intra 包括你的loss 公式也是 argmin L = Ldet + Lmrg, 这不是和argmax L_mrg矛盾吗? 希望您能赐教。

十分感谢您对我们工作的关注,论文中由于疏忽导致公式错误,这里应该修改为arg min Lmrg,具体实现可以参考代码,对您造成的困惑还请您谅解,十分感谢您的指正。 @Wei-i

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants