the specific role of the decoder in transformer structure #50

ANdong-star · 2021-11-10T14:14:26Z

Hi!
You said that "In the encoder-decoder attention module, the target query can attend to all positions on the template and the search region features, thus learning robust representations for the final bounding box prediction." in your paper. How to understand that? It's really abstract for me.
Thanks for your reply!

MasterBin-IIAU · 2021-11-30T01:57:41Z

@ANdong-star Hi, this process is quite similar to that in the DETR decoder. In DETR, 100 object queries interact with the image features output by the encoder. In STARK, one target query interacts with the joint template-search features to extract the target information. Finally the box prediction head integrate the output of the encoder and the decoder to predict the final box results.

ANdong-star · 2021-12-02T08:03:46Z

@ANdong-star Hi, this process is quite similar to that in the DETR decoder. In DETR, 100 object queries interact with the image features output by the encoder. In STARK, one target query interacts with the joint template-search features to extract the target information. Finally the box prediction head integrate the output of the encoder and the decoder to predict the final box results.

got it! thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the specific role of the decoder in transformer structure #50

the specific role of the decoder in transformer structure #50

ANdong-star commented Nov 10, 2021

MasterBin-IIAU commented Nov 30, 2021

ANdong-star commented Dec 2, 2021

the specific role of the decoder in transformer structure #50

the specific role of the decoder in transformer structure #50

Comments

ANdong-star commented Nov 10, 2021

MasterBin-IIAU commented Nov 30, 2021

ANdong-star commented Dec 2, 2021