FIX: weightnorm variables #219

seanpmorgan · 2019-04-30T02:46:47Z

Closes #216

Looks like assigning the variables using operations during init created an issue in the gradients. Setting their values inside of tf.function as done before appears to fix it. I'm not really thrilled about the lack of test coverage, but also not excited to build a regression test into our unit tests.

https://colab.research.google.com/drive/1Md7SnyEC5bUfkME1Akth4KKM24ac26NK

After this bugfix is in I'll work on publishing a 0.3 release

facaiy · 2019-04-30T03:38:58Z

Thanks for ping me, Sean, I'll look into it tomorrow. By the way, is it related with #220 ?

WindQAQ · 2019-04-30T03:58:52Z

Thanks for ping me, Sean, I'll look into it tomorrow. By the way, is it related with #220 ?

No, it's just the typo in example :-)

seanpmorgan · 2019-04-30T10:52:49Z

Thanks for ping me, Sean, I'll look into it tomorrow. By the way, is it related with #220 ?

No problem that was auto from CODEOWNERS. @qlzh727 might want to review as this was just worked on.

Changes:

data_init also calls init_norm because it was previously called in build
init_norm now calls _compute_weights as it ran afterward in build before
The variable assigns were causing broken gradients.

qlzh727 · 2019-04-30T15:11:57Z

I am ok with point 3 which changes the .assign() to =, but I am still bit confused about 1 and 2.

a. Both data init and init_norm are trying to init the value of g, and I don't see the reason to make init_norm to call _compute_weights(), which will update the self.layer.kernel.
b. For my understanding, init_norm and data_init are trying to do the same thing, which is initialize the g value. In the data init's case, does it also require init_norm()?

seanpmorgan · 2019-04-30T15:26:51Z

I am ok with point 3 which changes the .assign() to =, but I am still bit confused about 1 and 2.

a. Both data init and init_norm are trying to init the value of g, and I don't see the reason to make init_norm to call _compute_weights(), which will update the self.layer.kernel.
b. For my understanding, init_norm and data_init are trying to do the same thing, which is initialize the g value. In the data init's case, does it also require init_norm()?

@qlzh727 Ah sry thats a bit of laziness on my part. During troubleshooting I just made everything match the same steps as it was before to try to debug. I've removed 1+2 and confirmed the expected results:
https://colab.research.google.com/drive/1Md7SnyEC5bUfkME1Akth4KKM24ac26NK

facaiy · 2019-05-01T02:45:38Z

Is it a bug? I think we are encouraged to use assign than =, if I'm not wrong. cc @alextp

alextp · 2019-05-01T15:22:18Z

using assign sets the value of a tf.Variable, using = will set the value of a python variable to be that tensor. Assigning to tf.Variables is not differentiable but making a tensor based on some operations is.

Doing self.bias = ... inside a tf.function seems very dangerous because you've now kept a reference to a tensor defined inside a function which you're not allowed to use outside the function.

I recommend this code be changed to just say bias = instead of self.bias = . Then I believe it is correct.

FIX: weightnorm variables

18c3808

seanpmorgan requested a review from facaiy as a code owner April 30, 2019 02:46

googlebot added the cla: yes label Apr 30, 2019

facaiy requested a review from qlzh727 April 30, 2019 03:39

WindQAQ added the layers label Apr 30, 2019

Format lint

782f7e9

Remove double init

9ca1dfd

qlzh727 approved these changes Apr 30, 2019

View reviewed changes

seanpmorgan merged commit 377edb7 into tensorflow:master Apr 30, 2019

seanpmorgan deleted the fix-weightnorm branch April 30, 2019 15:54

WindQAQ mentioned this pull request Aug 22, 2019

WeightNormalization data init fails with Keras experimental_run_tf_function #428

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FIX: weightnorm variables #219

FIX: weightnorm variables #219

Uh oh!

seanpmorgan commented Apr 30, 2019 •

edited

Loading

Uh oh!

facaiy commented Apr 30, 2019

Uh oh!

WindQAQ commented Apr 30, 2019

Uh oh!

seanpmorgan commented Apr 30, 2019

Uh oh!

qlzh727 commented Apr 30, 2019

Uh oh!

seanpmorgan commented Apr 30, 2019 •

edited

Loading

Uh oh!

facaiy commented May 1, 2019

Uh oh!

alextp commented May 1, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

FIX: weightnorm variables #219

FIX: weightnorm variables #219

Uh oh!

Conversation

seanpmorgan commented Apr 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facaiy commented Apr 30, 2019

Uh oh!

WindQAQ commented Apr 30, 2019

Uh oh!

seanpmorgan commented Apr 30, 2019

Uh oh!

qlzh727 commented Apr 30, 2019

Uh oh!

seanpmorgan commented Apr 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facaiy commented May 1, 2019

Uh oh!

alextp commented May 1, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

seanpmorgan commented Apr 30, 2019 •

edited

Loading

seanpmorgan commented Apr 30, 2019 •

edited

Loading