Skip to content
This repository has been archived by the owner on Nov 22, 2022. It is now read-only.

created weight_norm option for deep cnn representation #809

Closed

Conversation

shreydesai
Copy link
Contributor

Differential Revision: D16403533

@facebook-github-bot facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Jul 20, 2019
shreydesai added a commit to shreydesai/pytext that referenced this pull request Jul 25, 2019
Summary:
Pull Request resolved: facebookresearch#809

Weight norm decouples a weight matrix into its magnitude and direction and optimizes them separately -- this is very helpful in speeding up convergence and getting better performance overall. This diff adds an option to use weight norm as a normalization strategy on conv1d layers.

Differential Revision: D16403533

fbshipit-source-id: b2b737b3dbfa43ed4f2893295df3afdfe6b59df0
shreydesai added a commit to shreydesai/pytext that referenced this pull request Jul 27, 2019
Summary:
Pull Request resolved: facebookresearch#809

Weight norm decouples a weight matrix into its magnitude and direction and optimizes them separately -- this is very helpful in speeding up convergence and getting better performance overall. This diff adds an option to use weight norm as a normalization strategy on conv1d layers.

Differential Revision: D16403533

fbshipit-source-id: 4c6d7546408b4d18c80b2348bcf8299943292ee2
shreydesai added a commit to shreydesai/pytext that referenced this pull request Jul 27, 2019
Summary:
Pull Request resolved: facebookresearch#809

Weight norm decouples a weight matrix into its magnitude and direction and optimizes them separately -- this is very helpful in speeding up convergence and getting better performance overall. This diff adds an option to use weight norm as a normalization strategy on conv1d layers.

Differential Revision: D16403533

fbshipit-source-id: 52ae2d815d6c2f7f41eb320dec8fdc451c1e2f0f
Summary:
Pull Request resolved: facebookresearch#809

Weight norm decouples a weight matrix into its magnitude and direction and optimizes them separately -- this is very helpful in speeding up convergence and getting better performance overall. This diff adds an option to use weight norm as a normalization strategy on conv1d layers.

Reviewed By: geof90

Differential Revision: D16403533

fbshipit-source-id: 4ef3141c59d3c02b95cde33da1dba0b1ebc5ba28
@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 50ef702.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
CLA Signed Do not delete this pull request or issue due to inactivity. Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants