The BN running mean&var with torch.utils.checkpoint.checkpoint #73

ljn114514 · 2020-10-04T09:01:35Z

How do you deal with the bn running mean/variance? Because the BatchNorm would be calculated twice (once during the forward pass and once during recomputation in the backward pass), and the running mean&var would updated twice.

gpleiss · 2020-10-05T13:31:29Z

This is a good point. Ideally, PyTorch's batch norm layers should be smart enough to update the running mean/var appropriately with the checkpointing operation.

If this is not the case, then you should raise an issue with PyTorch, since the checkpointing/batch norm layers are part of their library, not this library.

ljn114514 · 2020-10-06T06:38:51Z

Ok, thanks for your reply

ljn114514 closed this as completed Oct 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The BN running mean&var with torch.utils.checkpoint.checkpoint #73

The BN running mean&var with torch.utils.checkpoint.checkpoint #73

ljn114514 commented Oct 4, 2020

gpleiss commented Oct 5, 2020

ljn114514 commented Oct 6, 2020

The BN running mean&var with torch.utils.checkpoint.checkpoint #73

The BN running mean&var with torch.utils.checkpoint.checkpoint #73

Comments

ljn114514 commented Oct 4, 2020

gpleiss commented Oct 5, 2020

ljn114514 commented Oct 6, 2020