Batch norm parameters are not updating #9

sfarkya · 2020-06-17T16:42:16Z

Hello,

I tried to replicate your results for Resnet50+LSTM on UCF101 data. The reported performance in the default setting is 80.20 and I got 80.30. However, I think there's a bug in training. The batch norm parameters (mean and variance) are not updating during the training, the model is using moving mean and variance from the loaded model (imagenet weights for resnet). Since, the params are not learnable params you need to put explicitly collect those update params and put control dependency on that. I think this will improve performance though I haven't run this change.

If this is true then this might improve performance for all the models and combinations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch norm parameters are not updating #9

Batch norm parameters are not updating #9

sfarkya commented Jun 17, 2020 •

edited

Loading

Batch norm parameters are not updating #9

Batch norm parameters are not updating #9

Comments

sfarkya commented Jun 17, 2020 • edited Loading

sfarkya commented Jun 17, 2020 •

edited

Loading