Is this repo runnable? #3

korcys · 2021-10-31T08:02:23Z

There seems some bugs in this repo. For instance, the parameter of nn.TransformerEncoderLayer d_model in line 30 of transformer.py is the number of expected features in the input which must be divisible by num_heads. But in this repo, the d_model is set as NUM_BOX_FEATURES 109, and num_heads is 4. (109 % 4 eq 1)

Git-oNmE · 2021-12-30T06:08:04Z

I also met the same issue. Have you fixed this problem? Look forward to your reply.

YichaoLu · 2022-01-06T16:20:38Z

As described in the paper, when num_features is not divisible by num_heads, a fully-connected layer is applied to project num_features into d_model. If you look at the hyperparameter setting for the commit you are referring to, you will find nhead is set to 1 so no fully-connected layer is required. The latest main branch shows the example of adding the fully-connected layer so you can now vary the value of nhead.

YichaoLu closed this as completed Jan 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is this repo runnable? #3

Is this repo runnable? #3

korcys commented Oct 31, 2021

Git-oNmE commented Dec 30, 2021

YichaoLu commented Jan 6, 2022

Is this repo runnable? #3

Is this repo runnable? #3

Comments

korcys commented Oct 31, 2021

Git-oNmE commented Dec 30, 2021

YichaoLu commented Jan 6, 2022