Refactor gradient setter in `gradient_check` #5699

niboshi · 2018-11-23T18:40:26Z

Merge after ~~#5698.~~

~~Allow y_grad=None in any target functions, not only loss functions.~~

niboshi · 2018-11-26T03:08:58Z

Rebased.
PTAL

toslunar · 2018-11-26T08:38:29Z

chainer/gradient_check.py

@@ -71,6 +100,7 @@ def numerical_grad(

    """
    assert eps > 0
+    assert isinstance(inputs,  (tuple, list))


Use one space after ,.

toslunar · 2018-11-26T09:02:35Z

chainer/gradient_check.py

+            'Output gradients: {}'.format(
+                ', '.join(str(y.shape) for y in outputs),
+                ', '.join(str(None if gy is None else gy.shape)
+                          for gy in grad_outputs)))


Why are the shapes printed on this error?

I thought it would be helpful for those who read this message to spot the cause.

I got your intention. Could you make the error message clearer in that the shapes are printed?

Fixed the messages.

toslunar · 2018-11-26T09:06:35Z

chainer/gradient_check.py

+        # Keep output arrays to save computation in numerical gradients
+        y0_data = tuple([y.array for y in ys])
+
+        # If y_grad is not given, generate the all-1 gradients.


The current behavior is compatible with Variable.backward. Do you want to change it to be compatible with chainer.grad?

Are you referring to the Variable.backward behavior where it fills grad with 1 only for those with shape ()? Actually it's not so clear to me why the behavior of Variable.backward is like that.
Also, considering the purpose of gradient_check, I think there's no reason for users of it to expect the same behavior.

random is a better default

That's also fine for me, but isn't that compatibility breaking?
In that case ()-shape grads should be initialized in random for consistency.

After discussion, I reverted the behavior change. That can be separated from this PR.

toslunar · 2018-11-27T04:41:48Z

chainer/gradient_check.py

+            'Output gradients: {}'.format(
+                ', '.join(str(y.shape) for y in outputs),
+                ', '.join(str(None if gy is None else gy.shape)
+                          for gy in grad_outputs)))


I got your intention. Could you make the error message clearer in that the shapes are printed?

toslunar · 2018-11-27T04:43:09Z

chainer/gradient_check.py

+            'Output gradients: {}\n'.format(
+                ', '.join(str(y.shape) for y in outputs),
+                ', '.join(str(None if gy is None else gy.shape)
+                          for gy in grad_outputs)))


I'm wondering if dtype can be checked in this function, too.

I thought I encountered some error by doing this, but now I can't find it.
Fixed so that dtypes are compared as well.

niboshi · 2018-12-07T05:38:10Z

PTAL.

Currently the error message looks like this:

E   ValueError: Shapes and/or dtypes of outputs and output gradients do not match.            
E   Output shapes and dtypes         : (2, 2, 3):float16, (2, 3, 3):float16, (2, 2, 3):float16                     
E   Output gradient shapes and dtypes: (12,):float16, (18,):float16, (12,):float16

(I wonder if there is any canonical way to present shapes and dtypes🤔)

toslunar

LGTM

toslunar · 2018-12-07T07:21:02Z

chainer/gradient_check.py

-        # If no input has a gradient, we don't need to compare with numeric
-        # gradient.
-        if len(self.x_data) + len(self.params) == self.no_grads.count(True):
-            return


I agree to the deletion of the early return.

I'd like detect_nondifferentiable=True to detect a random function regardless of the length of inputs.

I observed that the early return had no significant speed-up for the tests under tests/chainer_tests/(functions|links)_tests.

toslunar · 2018-12-07T07:24:31Z

Jenkins, test this please.

chainer-ci · 2018-12-07T08:04:08Z

Jenkins CI test (for commit 20a53f2, target branch master) failed with status FAILURE.
(For contributors, please wait until the reviewer confirms the details of the error.)

toslunar · 2018-12-07T09:34:46Z

The Jenkins failure (TestDeconvolutionND_param_19.test_forward_consistency_cudnn) seems unrelated to the PR.

niboshi force-pushed the refactor-gradient-setter branch 3 times, most recently from 8a02560 to 39f7b20 Compare November 23, 2018 18:51

toslunar self-requested a review November 25, 2018 01:37

niboshi force-pushed the refactor-gradient-setter branch from 39f7b20 to 3ec3a0b Compare November 26, 2018 03:05

niboshi force-pushed the refactor-gradient-setter branch from 3ec3a0b to 12589fd Compare November 26, 2018 04:05

toslunar reviewed Nov 26, 2018

View reviewed changes

toslunar reviewed Nov 27, 2018

View reviewed changes

toslunar added the st:needs-discussion State indicating that discussions are needed before proceeding. label Dec 3, 2018

niboshi force-pushed the refactor-gradient-setter branch 3 times, most recently from 0f5e22d to d075ae4 Compare December 7, 2018 05:11

Refactor gradient setter in gradient_check

758f4fa

niboshi force-pushed the refactor-gradient-setter branch from d075ae4 to 758f4fa Compare December 7, 2018 05:13

Check dtypes of outputs and grad_outputs as well

20a53f2

toslunar approved these changes Dec 7, 2018

View reviewed changes

toslunar removed the st:needs-discussion State indicating that discussions are needed before proceeding. label Dec 7, 2018

takagi assigned toslunar Dec 10, 2018

toslunar added this to the v6.0.0b2 milestone Dec 10, 2018

toslunar merged commit 87f497b into chainer:master Dec 10, 2018

niboshi deleted the refactor-gradient-setter branch December 10, 2018 12:42

kmaehashi added the cat:code-fix Code refactoring that does not change the behavior. label Jan 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor gradient setter in `gradient_check` #5699

Refactor gradient setter in `gradient_check` #5699

niboshi commented Nov 23, 2018 •

edited

Loading

niboshi commented Nov 26, 2018

toslunar Nov 26, 2018

niboshi Dec 7, 2018

toslunar Nov 26, 2018

niboshi Nov 26, 2018

toslunar Nov 27, 2018

niboshi Dec 7, 2018

toslunar Nov 26, 2018

niboshi Nov 26, 2018 •

edited

Loading

toslunar Nov 28, 2018

niboshi Nov 29, 2018

niboshi Dec 7, 2018

toslunar Nov 27, 2018

toslunar Nov 27, 2018

niboshi Dec 7, 2018

niboshi commented Dec 7, 2018

toslunar left a comment

toslunar Dec 7, 2018

toslunar commented Dec 7, 2018

chainer-ci commented Dec 7, 2018

toslunar commented Dec 7, 2018

Refactor gradient setter in gradient_check #5699

Refactor gradient setter in gradient_check #5699

Conversation

niboshi commented Nov 23, 2018 • edited Loading

niboshi commented Nov 26, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

niboshi Nov 26, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

niboshi commented Dec 7, 2018

toslunar left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

toslunar commented Dec 7, 2018

chainer-ci commented Dec 7, 2018

toslunar commented Dec 7, 2018

Refactor gradient setter in `gradient_check` #5699

Refactor gradient setter in `gradient_check` #5699

niboshi commented Nov 23, 2018 •

edited

Loading

niboshi Nov 26, 2018 •

edited

Loading