Visualization code of Figure 1 in paper. #5

MaureenZOU · 2021-09-13T15:49:10Z

Hi Author,

First thanks for your great work to improve the convergence speed of DETR with such a large margin. When reading the paper, I get a little bit confused on how do you exactly draw the attention map in Figure 1.

Given object query q (1 x d), memory feature m (d x (hw)). I use the following equation to draw the attention maps:

Similarity(q,m) = Softmax(proj(q) \dot proj(m)) [1 x (hw)] where proj is the trained linear layer in cross attention module.

The attention maps I get is quite similar with the one shown in DETR paper:

A random object query:

A random object query on head A:

A random object query on head B:

A random object query on head C:

Could you please give some information on how to generate attention in Figure 1? Thanks!

SISTMrL · 2021-09-23T11:40:41Z

hello, have you generated the attention map like fig. 1? @MaureenZOU

MaureenZOU · 2021-09-25T17:24:15Z

The problem was solved by the explanation in Section 3.4, paragraph comparison to detr. instead of measuring the similarity with memory + pos_encoding, the author just measuring the similarity between the position encoding.

GWwangshuo · 2021-09-28T16:53:54Z

The problem was solved by the explanation in Section 3.4, paragraph comparison to detr. instead of measuring the similarity with memory + pos_encoding, the author just measuring the similarity between the position encoding.

@MaureenZOU
Could you please kindly provide the souce code for visualizing the attention map? That will be greatly helpful. Thanks a lot.

DeppMeng · 2021-10-12T08:52:42Z

Hi, @GWwangshuo @MaureenZOU @SISTMrL,

Thank you for your attention. Sorry for the late reply. We did not release the visualization code yet since we find that it is not easy to write a neat and clean version of it. When we finished re-writing this part of code, we will make a release (there is no certain schedule yet, the authors are busy working on recent ddls).

Here is a brief guide:

Perform validation process, record: content, position attention weights, predictions.
Filter out predictions with low classification score, as well as too-small objects.
Plot the original image.
Plot the content/position attention map on top of it.
Plot the prediction box on top of it.
Arrange plots in order you would like (e.g., order of attention heads).

wulele2 · 2022-04-11T12:21:56Z

The problem was solved by the explanation in Section 3.4, paragraph comparison to detr. instead of measuring the similarity with memory + pos_encoding, the author just measuring the similarity between the position encoding.

Hello, when I tried to visualize detr, I first read the self-attn of the last layer of decoder to get cq:[100,1,256]; In addition, pQ is read from the trained model: [100,256]; Then get the pk of the feature map: [1,256,h, W]; Then calculate ((cq + pq)T * pk).softmax(-1).view(h,w) found out the effect is inconsistent. I really hope to get yours reply.

Flyooofly · 2022-12-14T16:35:17Z

Hi Author,

First thanks for your great work to improve the convergence speed of DETR with such a large margin. When reading the paper, I get a little bit confused on how do you exactly draw the attention map in Figure 1.

Given object query q (1 x d), memory feature m (d x (hw)). I use the following equation to draw the attention maps:

Similarity(q,m) = Softmax(proj(q) \dot proj(m)) [1 x (hw)] where proj is the trained linear layer in cross attention module.

The attention maps I get is quite similar with the one shown in DETR paper:

A random object query:

A random object query on head A:

A random object query on head B:

A random object query on head C:

Could you please give some information on how to generate attention in Figure 1? Thanks!

你好，请问有研究过如何可视化Deformable-detr的注意力权重吗，我基于DETR提供的绘制代码一直不能得到正确结果~

MaureenZOU closed this as completed Sep 25, 2021

DeppMeng mentioned this issue Oct 12, 2021

How to visualize the attention map of the cross-attention #4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Visualization code of Figure 1 in paper. #5

Visualization code of Figure 1 in paper. #5

MaureenZOU commented Sep 13, 2021

SISTMrL commented Sep 23, 2021

MaureenZOU commented Sep 25, 2021

GWwangshuo commented Sep 28, 2021 •

edited

DeppMeng commented Oct 12, 2021

wulele2 commented Apr 11, 2022

Flyooofly commented Dec 14, 2022

Visualization code of Figure 1 in paper. #5

Visualization code of Figure 1 in paper. #5

Comments

MaureenZOU commented Sep 13, 2021

SISTMrL commented Sep 23, 2021

MaureenZOU commented Sep 25, 2021

GWwangshuo commented Sep 28, 2021 • edited

DeppMeng commented Oct 12, 2021

wulele2 commented Apr 11, 2022

Flyooofly commented Dec 14, 2022

GWwangshuo commented Sep 28, 2021 •

edited