no_weight_decay_on_bn removes weight decay on FC #31

nagisa-eevee · 2022-03-08T02:20:41Z

I notice that if no_weight_decay_on_bn is set to True, weight decay will only apply to conv.weight. It seems that weight decay on fc layers are also removed at the same time. Is there any reason to do so?

The text was updated successfully, but these errors were encountered:

hysts · 2022-03-10T22:23:25Z

Hi, @nagisa-eevee

I think I added the option to try what's written in section 4.2 of this paper, and if I remember correctly, in my experiments, the accuracy was higher when no weight decay was applied to the fc layer. So that's the reason.
But the accuracy drop may be because I used the same parameter for weight decay on conv and fc layers, and tuning weight decay parameter on fc layer could lead to better accuracy. So in that sense, the current implementation that forces the weight decay on the fc layer to be 0 is not very good.

hysts closed this as completed Mar 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

no_weight_decay_on_bn removes weight decay on FC #31

no_weight_decay_on_bn removes weight decay on FC #31

nagisa-eevee commented Mar 8, 2022

hysts commented Mar 10, 2022

no_weight_decay_on_bn removes weight decay on FC #31

no_weight_decay_on_bn removes weight decay on FC #31

Comments

nagisa-eevee commented Mar 8, 2022

hysts commented Mar 10, 2022