Unmentioned but critical LayerNorm #3

gathierry · 2022-03-18T12:09:29Z

To achieve comparable result as the original paper. LayerNorm is applied to the feature before NF. This is never mentioned in the paper and the usage is very tricky (but this is the only way works for me):

resnet18 and wide-resnet-50: use trainable LayerNorm
CaiT and DeiT: use the final norm from the pre-trained model and fix it's affine parameters

cytotoxicity8 · 2022-04-16T07:57:36Z

I measured the performances of models without LayerNorm parts. In both renset18 and wide-resnet50, AUROC was quite similar, sometimes even better the original ones. Also DeiT showed comparable performances. (lower as 0.03~0.05) However in CaiT, the loss was crazily high and AUROC was 0.5! I can't understand why these models show different results depending on Layer Normalization.

cytotoxicity8 · 2022-05-28T08:39:01Z

The red one is w/o elementwise-affine.
I am experimenting to advance FastFlow, discussion is always open.

AncientRemember · 2022-09-23T11:14:11Z

use x = x.flatten(2).transpose(1, 2) to reshape the featuremap BCHW -->B,N,C,thus layerNorm don't depend the input size

AncientRemember · 2022-09-23T11:26:22Z

maybe use BN after conv2d will work

gathierry · 2022-09-23T12:04:02Z

Well, after learning more about transformers, I realize that adding LayerNorm to intermediate output feature maps is very commom, such as applying transformers as the backbone in semantic segmentation (https://github.com/SwinTransformer/Swin-Transformer-Semantic-Segmentation/blob/87e6f90577435c94f3e92c7db1d36edc234d91f6/mmseg/models/backbones/swin_transformer.py#L620). So I guess that's why the paper never mentioned.

And for resnet, maybe LayerNorm is not necessary as pointed out by @cytotoxicity8 .

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unmentioned but critical LayerNorm #3

Unmentioned but critical LayerNorm #3

gathierry commented Mar 18, 2022

cytotoxicity8 commented Apr 16, 2022

cytotoxicity8 commented May 28, 2022 •

edited

Loading

AncientRemember commented Sep 23, 2022

AncientRemember commented Sep 23, 2022 •

edited

Loading

gathierry commented Sep 23, 2022 •

edited

Loading

Unmentioned but critical LayerNorm #3

Unmentioned but critical LayerNorm #3

Comments

gathierry commented Mar 18, 2022

cytotoxicity8 commented Apr 16, 2022

cytotoxicity8 commented May 28, 2022 • edited Loading

AncientRemember commented Sep 23, 2022

AncientRemember commented Sep 23, 2022 • edited Loading

gathierry commented Sep 23, 2022 • edited Loading

cytotoxicity8 commented May 28, 2022 •

edited

Loading

AncientRemember commented Sep 23, 2022 •

edited

Loading

gathierry commented Sep 23, 2022 •

edited

Loading