questions about provided conditional detr model #31

xz-123-new · 2022-10-15T08:51:24Z

Thanks for your excellent work!
I have questions about your provided model.In the provided conditional detr model"conditional detr resnet50",the transformer.decoder.layer.cross_attn.out_proj.weight/bias is of dimension of 256x256 and 256 seperately,but since the input of this cross attention is the concatenation of two 256-d query, it seems should be 512x512 and 512.It really confuses me.Looking forward to your help,thanks!

charlesCXK · 2022-10-18T12:43:06Z

Hi,
the function out_proj is applied to the value (

ConditionalDETR/models/transformer.py

Line 317 in ead865c

v = self.ca_v_proj(memory)

) which is 256-d.

xz-123-new · 2022-10-21T06:37:38Z

Sorry to disturb you again.My question is about the out_proj of cross_attn,i.e.
self.cross_attn = nn.MultiheadAttention(d_model * 2, nhead, dropout=dropout, vdim=d_model)
instead of your mentioned out_proj .In the source code of nn.MultiheadAttention,the out_proj is set as
self.out_proj = NonDynamicallyQuantizableLinear(embed_dim, embed_dim, bias=bias, **factory_kwargs),
where in your code, the embed_dim is set as d_model*2,i.e. 512,so i think the out_proj seems should be 512-d instead of 256-d,but the model provided is all 256-d.

xz-123-new · 2022-10-23T06:09:19Z

sorry i wrongly import Multiheadattention from torch.nn instead of your modified version.Now the problem is solved,thanks.

xz-123-new closed this as completed Oct 23, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

questions about provided conditional detr model #31

questions about provided conditional detr model #31

xz-123-new commented Oct 15, 2022

charlesCXK commented Oct 18, 2022

xz-123-new commented Oct 21, 2022

xz-123-new commented Oct 23, 2022

questions about provided conditional detr model #31

questions about provided conditional detr model #31

Comments

xz-123-new commented Oct 15, 2022

charlesCXK commented Oct 18, 2022

xz-123-new commented Oct 21, 2022

xz-123-new commented Oct 23, 2022