About BN parameters #12

itongworld · 2019-11-13T05:41:24Z

In your code I see that you fix the parameters of Batch Normalization after 1 epoch (10000 episodes), but when I remove the constraint (i.e., before training use model.train(), and before val/test use model.eval()), the performance will drop sharply.

Have you observed the same degradation w/o fixing the BN params? and why is that?

WenbinLee · 2019-11-13T06:03:06Z

Yes, we had observed the same degradation without fixing the BN parameters. This is the reason why we fix the parameters of Batch Normalization after 10000 episodes. We don't know the true reason of this phenomenon yet. We guess BN will affect the generalization performance of models. Actually, the research on BN is very hot recently. Thanks.

itongworld · 2019-11-13T14:53:46Z

Thanks a lot for your insightful analysis. And do you think other metric-based few-shot classification models, i.e., Protonet, can also benefit from this fixed-parameter setting?

WenbinLee · 2019-11-14T00:43:09Z

You are welcome. Yes, we had reimplemented Protonet based on the same framework, and we found this setting can gain about 1% performance improvement.

WenbinLee closed this as completed Dec 4, 2019

d33dler mentioned this issue Aug 11, 2023

BN parameters followup question #21

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About BN parameters #12

About BN parameters #12

itongworld commented Nov 13, 2019

WenbinLee commented Nov 13, 2019

itongworld commented Nov 13, 2019

WenbinLee commented Nov 14, 2019

About BN parameters #12

About BN parameters #12

Comments

itongworld commented Nov 13, 2019

WenbinLee commented Nov 13, 2019

itongworld commented Nov 13, 2019

WenbinLee commented Nov 14, 2019