The input/output feature dimensions of Transformer Encoder and Causal Transformer Decoder? #41

yxgz · 2022-05-10T03:07:07Z

Hi, thanks for your great project!
I am wondering the input/output feature dimensions of Transformer Encoder. The description in Section 4.1 of the paper shows the input/output feature dimensions are both 768D, is it right? However, the description in Section 4.4 of the paper shows the
input feature dimension of Causal Transformer Decoder is 2048D, what is the output feature dimension of Causal Transformer Decoder? And is there a dimension conversion (768D->2048D) before using Causal Transformer Decoder?

rohitgirdhar · 2022-05-20T15:16:37Z

Hi, thanks for your interest and apologies for the delay.
You are correct, there is a linear layer that does this mapping --

AVT/models/base_model.py

Line 46 in b082c99

self.mapper_to_inter = nn.Linear(backbone_dim,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The input/output feature dimensions of Transformer Encoder and Causal Transformer Decoder? #41

The input/output feature dimensions of Transformer Encoder and Causal Transformer Decoder? #41

yxgz commented May 10, 2022

rohitgirdhar commented May 20, 2022

The input/output feature dimensions of Transformer Encoder and Causal Transformer Decoder? #41

The input/output feature dimensions of Transformer Encoder and Causal Transformer Decoder? #41

Comments

yxgz commented May 10, 2022

rohitgirdhar commented May 20, 2022