Fix Glorot uniform initialization for convolutional layers #22

jekbradbury · 2019-02-22T08:27:53Z

The correct way to extend the initialization scheme introduced in Glorot and Bengio for dense layers to convolutional layers is to multiply the fanIn and fanOut sizes by the receptive field size (the product of kernel dimensions). Keras, PyTorch, Lasagne etc. all implement this correction without mentioning it in the relevant docstrings. As discussed for a related initialization in He et al., this is needed because the responses produced in a convolutional layer are equivalent to those produced by a pointwise dense layer over a feature space that has been expanded by a factor of the receptive field size.

Fixes the convergence discrepancy seen on CIFAR convnets.

Sources/DeepLearning/Initializers.swift

Co-Authored-By: jekbradbury <jekbradbury@gmail.com>

Sources/DeepLearning/Initializers.swift

Fix Glorot uniform initialization for convolutional layers

e3e03c7

jekbradbury requested review from rxwei and saeta February 22, 2019 08:27

rxwei approved these changes Feb 22, 2019

View reviewed changes

Sources/DeepLearning/Initializers.swift Outdated Show resolved Hide resolved

Sources/DeepLearning/Initializers.swift Outdated Show resolved Hide resolved

jekbradbury commented Feb 22, 2019

View reviewed changes

Sources/DeepLearning/Initializers.swift Show resolved Hide resolved

Apply suggestions from code review

1d58ae3

Co-Authored-By: jekbradbury <jekbradbury@gmail.com>

jekbradbury commented Feb 22, 2019

View reviewed changes

Sources/DeepLearning/Initializers.swift Outdated Show resolved Hide resolved

Update Sources/DeepLearning/Initializers.swift

a8491c3

jekbradbury merged commit 22c923a into tensorflow:master Feb 22, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Glorot uniform initialization for convolutional layers #22

Fix Glorot uniform initialization for convolutional layers #22

Uh oh!

jekbradbury commented Feb 22, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix Glorot uniform initialization for convolutional layers #22

Fix Glorot uniform initialization for convolutional layers #22

Uh oh!

Conversation

jekbradbury commented Feb 22, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants