Discrepancy in the parameters used for batch normalisation between pytorch and keras #74

zaccharieramzi · 2019-12-06T16:18:01Z

The parameters used for batch normalization are not specified in the original paper.

I don't know how batch norm works in Matlab (I have tried to read the codes but it's very difficult to me), so I tried looking in keras. The parameters used seemed very odd (in addition the batch normalisation is involved in a code mess), so I looked in pytorch, and saw that they are different.
In keras the momentum is 0.0 or 0.1 depending on where you look, and in pytorch the momentum is 0.95.

Note that this differs from the original unofficial keras implementation.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discrepancy in the parameters used for batch normalisation between pytorch and keras #74

Discrepancy in the parameters used for batch normalisation between pytorch and keras #74

zaccharieramzi commented Dec 6, 2019 •

edited

Discrepancy in the parameters used for batch normalisation between pytorch and keras #74

Discrepancy in the parameters used for batch normalisation between pytorch and keras #74

Comments

zaccharieramzi commented Dec 6, 2019 • edited

zaccharieramzi commented Dec 6, 2019 •

edited