Gradients calculated by reverse mode and forward mode are not equal for API `tf.keras.layers.Softmax` #56863

eeDigitalSeal · 2022-07-22T15:28:37Z

Click to expand!

Issue Type

Bug

Source

binary

Tensorflow Version

tf 2.9

Custom Code

Yes

OS Platform and Distribution

Linux Ubuntu 20.04

Mobile device

No response

Python version

3.9

Bazel version

No response

GCC/Compiler version

No response

CUDA/cuDNN version

No response

GPU model and memory

No response

Current Behaviour?

The jacobian matrix calculated in reverse mode are not equal to that in forward mode using tf.autodiff.ForwardAccumulator.

Standalone code to reproduce the issue

import numpy as np
import tensorflow as tf

input0 = tf.constant([[[[[[0.56601346]]],
   [[[0.35367298]]],
   [[[0.5609504 ]]]],
  [[[[0.43669665]]],
   [[[0.82903767]]],
   [[[0.57900476]]]]],
 [[[[[0.13786423]]],
   [[[0.17976725]]],
   [[[0.35320067]]]],
  [[[[0.34501767]]],
   [[[0.82709   ]]],
   [[[0.8386754 ]]]]],
 [[[[[0.8424399 ]]],
   [[[0.12519908]]],
   [[[0.41379738]]]],
  [[[[0.88551676]]],
   [[[0.26824057]]],
   [[[0.06636572]]]]]], shape=(3, 2, 3, 1, 1, 1), dtype=tf.float32)
input1 = tf.constant(
[[[[[[0.88215363],
     [0.56112957],
     [0.47048628]],
    [[0.23962319],
     [0.24418604],
     [0.68752027]],
    [[0.8854921 ],
     [0.8750253 ],
     [0.43920374]],
    [[0.6869767 ],
     [0.9971782 ],
     [0.21735716]]],
   [[[0.7472261 ],
     [0.7923174 ],
     [0.99001765]],
    [[0.23535097],
     [0.47414947],
     [0.53421795]],
    [[0.46127486],
     [0.31279147],
     [0.41679263]],
    [[0.40748405],
     [0.8575851 ],
     [0.62180364]]],
   [[[0.09478486],
     [0.8094814 ],
     [0.7278038 ]],
    [[0.36277568],
     [0.14143586],
     [0.6791742 ]],
    [[0.48797262],
     [0.34706163],
     [0.211653  ]],
    [[0.49032676],
     [0.37094796],
     [0.7821864 ]]]]]], shape=(1, 1, 3, 4, 3, 1), dtype=tf.float32)

softmax = tf.keras.layers.Softmax(axis=0)

input = [input0,input1]

with tf.GradientTape() as g:
    g.watch(input1)
    res_backward = softmax(*input)
grad = g.jacobian(res_backward,input1)

grad_fwd_arr = []

for i in range(tf.size(input1)):
    tangents = tf.reshape(tf.one_hot(i,tf.size(input1)),shape=input1.shape)
    with tf.autodiff.ForwardAccumulator(input1, tangents) as acc:
        res_forward = softmax(*input)
        jvp = acc.jvp(res_forward)
        grad_fwd_arr.append(jvp)

grad_fwd = tf.reshape(tf.convert_to_tensor(grad_fwd_arr),shape=grad.shape)

np.testing.assert_allclose(grad,grad_fwd)

Relevant log output

AssertionError: 
Not equal to tolerance rtol=1e-07, atol=0

Mismatched elements: 215 / 7776 (2.76%)
Max absolute difference: 32.
Max relative difference: 1.
 x: array([[[[[[[[[[[[-29.802322],
                 [  0.      ],
                 [  0.      ]],...
 y: array([[[[[[[[[[[[  0.],
                 [  0.],
                 [  0.]],...

tilakrayal · 2022-07-25T06:16:42Z

@sachinprasadhs,
I was able to reproduce the issue on tensorflow v2.8, v2.9 and nightly. Kindly find the gist of here.

cantonios · 2022-08-04T22:58:29Z

@tilakrayal what are you attempting to accomplish here? You have an input tensor input0, and a mask tensor input1, which is a different shape from input0 and does not consist of only 0s and 1s. Then you're trying to find the gradient w.r.t. the mask?

Oops, tagged the wrong person, sorry. @eeDigitalSeal

eeDigitalSeal · 2022-08-10T03:44:49Z

@cantonios Thanks for your reply. But I think if the input1 is inappropriate, it should reject the input (like throwing exception) when computing gradients for input1?

cantonios · 2022-08-10T05:22:08Z

@eeDigitalSeal you're combining keras and raw tf here. Keras is a separate project, over at https://github.com/keras-team/keras. It's the keras Softmax API that is being used incorrectly.

But my question stands: what are you attempting to do (other than perhaps generate cases where forward & backward gradients differ)? We don't always validate all inputs (e.g. checking a mask is all zeros or ones) because doing so can be as expensive as running the op - so if you give it bad inputs, you're going to get unreliable outputs.

Also, the forward and backward gradients are computed in different ways, so they will not always be exactly the same - for example, if you give it invalid inputs, or if you encounter NaNs in some computation.

google-ml-butler bot added the type:bug Bug label Jul 22, 2022

google-ml-butler bot assigned tilakrayal Jul 22, 2022

tilakrayal added TF 2.9 Issues found in the TF 2.9 release (or RCs) comp:ops OPs related issues labels Jul 25, 2022

tilakrayal assigned sachinprasadhs and unassigned tilakrayal Jul 25, 2022

sachinprasadhs added the stat:awaiting tensorflower Status - Awaiting response from tensorflower label Jul 25, 2022

sachinprasadhs mentioned this issue Aug 10, 2022

Forward-mode Autodiff gives wrong nan when argument segment_ids[0] is not zero for API tf.math.segment_max #57002

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gradients calculated by reverse mode and forward mode are not equal for API `tf.keras.layers.Softmax` #56863

Gradients calculated by reverse mode and forward mode are not equal for API `tf.keras.layers.Softmax` #56863

eeDigitalSeal commented Jul 22, 2022 •

edited by google-ml-butler bot

Issue Type

Source

Tensorflow Version

Custom Code

OS Platform and Distribution

Mobile device

Python version

Bazel version

GCC/Compiler version

CUDA/cuDNN version

GPU model and memory

Current Behaviour?

Standalone code to reproduce the issue

Relevant log output

tilakrayal commented Jul 25, 2022

cantonios commented Aug 4, 2022 •

edited

eeDigitalSeal commented Aug 10, 2022

cantonios commented Aug 10, 2022

Gradients calculated by reverse mode and forward mode are not equal for API tf.keras.layers.Softmax #56863

Gradients calculated by reverse mode and forward mode are not equal for API tf.keras.layers.Softmax #56863

Comments

eeDigitalSeal commented Jul 22, 2022 • edited by google-ml-butler bot

Issue Type

Source

Tensorflow Version

Custom Code

OS Platform and Distribution

Mobile device

Python version

Bazel version

GCC/Compiler version

CUDA/cuDNN version

GPU model and memory

Current Behaviour?

Standalone code to reproduce the issue

Relevant log output

tilakrayal commented Jul 25, 2022

cantonios commented Aug 4, 2022 • edited

eeDigitalSeal commented Aug 10, 2022

cantonios commented Aug 10, 2022

Gradients calculated by reverse mode and forward mode are not equal for API `tf.keras.layers.Softmax` #56863

Gradients calculated by reverse mode and forward mode are not equal for API `tf.keras.layers.Softmax` #56863

eeDigitalSeal commented Jul 22, 2022 •

edited by google-ml-butler bot

cantonios commented Aug 4, 2022 •

edited