Hi, some questions about the model! #35

roger-cv · 2021-12-02T07:38:47Z

Hi, Thanks for this nice work. Recently, I am trying to modify this excellent model to make it suit for my work. As you see, why the "average" operation is required here, the shape of the tensor changes from 6464 to 88 after the "average" operation. However, the shape of the tensor should be the 64*64 just like the description" fusion at (B,64,64,64)".

ap229997 · 2021-12-02T18:50:28Z

The fusion can also be done at 64x64 resolution but that would be too computationally expensive since a transformer is used (quadratic complexity due to attention), so I reduced the size to 8x8 at each resolution of the intermediate feature maps.

roger-cv · 2021-12-06T12:07:07Z

Thanks for your quick reply. I guess that the input feature map of the transformer of each layer will be downsampled to 8*8 according to what you mean?

ap229997 · 2021-12-06T19:33:57Z

that's correct, now there are several variants of transformer which address the quadratic complexity issue of the transformer (eg. Linformer) so maybe it's possible to use the transformer without downsampling.

roger-cv · 2021-12-08T02:16:17Z

that's correct, now there are several variants of transformer which address the quadratic complexity issue of the transformer (eg. Linformer) so maybe it's possible to use the transformer without downsampling.

Ok, Another interesting question is that can this fusion fashion based on the transformer be replaced with other transformers, such as swim or PVT. Because I notice that this transformer is developed based on the GPT suited for the NLP area.

ap229997 · 2021-12-08T03:36:41Z

I agree, architecture design can be improved quite a bit.

roger-cv · 2021-12-09T02:34:07Z

Ok, Nice work, Thanks for your reply.

Kin-Zhang · 2021-12-17T12:31:55Z

But it may require more resources to train...

I agree, architecture design can be improved quite a bit.

wanhaodong98 mentioned this issue Dec 17, 2021

Problem when run run_evaluation.sh #20

Closed

ap229997 closed this as completed Jan 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hi, some questions about the model! #35

Hi, some questions about the model! #35

roger-cv commented Dec 2, 2021

ap229997 commented Dec 2, 2021 •

edited

Loading

roger-cv commented Dec 6, 2021

ap229997 commented Dec 6, 2021

roger-cv commented Dec 8, 2021

ap229997 commented Dec 8, 2021

roger-cv commented Dec 9, 2021

Kin-Zhang commented Dec 17, 2021

Hi, some questions about the model! #35

Hi, some questions about the model! #35

Comments

roger-cv commented Dec 2, 2021

ap229997 commented Dec 2, 2021 • edited Loading

roger-cv commented Dec 6, 2021

ap229997 commented Dec 6, 2021

roger-cv commented Dec 8, 2021

ap229997 commented Dec 8, 2021

roger-cv commented Dec 9, 2021

Kin-Zhang commented Dec 17, 2021

ap229997 commented Dec 2, 2021 •

edited

Loading