GCNet vs. Non-Local module #2

Senwang98 · 2021-11-30T06:21:13Z

@yzd-v
Hi, thanks for your simple and powerfull KD method. I have one question, why use GCNet rather commonly used Non-local module,
Have you test these two OPs since GCNet is little better than Non-local.(Maybe some experiments about these two modules can be added into your paper, since this is one question reviewers intuitively will ask)

yzd-v · 2021-11-30T06:38:55Z

Both GCNet and non-local can be used to add global knowledge for distillation. And we compare them in ablation study. The lighter GCNet is easier to train for distillation.

Senwang98 · 2021-11-30T06:58:30Z

@yzd-v
Okay, I have seen Table 6. in your paper.

yzd-v · 2021-11-30T07:05:11Z

You may read the two papers carefully. In table 6, we just use global distillation without focal distillation. While the result in FKD is achieved with all losses. However, Faster RCNN can achieve 42.0 mAP with focal and global distillation together.

Senwang98 · 2021-11-30T07:19:44Z

@yzd-v
Ok, it's my fault(feel a bit embarrassed, I will delete above pictures......23333)

yzd-v closed this as completed Dec 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GCNet vs. Non-Local module #2

GCNet vs. Non-Local module #2

Senwang98 commented Nov 30, 2021 •

edited

yzd-v commented Nov 30, 2021

Senwang98 commented Nov 30, 2021 •

edited

yzd-v commented Nov 30, 2021

Senwang98 commented Nov 30, 2021

GCNet vs. Non-Local module #2

GCNet vs. Non-Local module #2

Comments

Senwang98 commented Nov 30, 2021 • edited

yzd-v commented Nov 30, 2021

Senwang98 commented Nov 30, 2021 • edited

yzd-v commented Nov 30, 2021

Senwang98 commented Nov 30, 2021

Senwang98 commented Nov 30, 2021 •

edited

Senwang98 commented Nov 30, 2021 •

edited