layer.output raises AttributeError because inbound nodes lost after call to activation function #34834

chahld · 2019-12-04T19:02:53Z

System information

custom code
OS Platform and Distribution (e.g., Linux Ubuntu 16.04):
ubuntu 18.04
TensorFlow installed from (source or binary):
pip install tensorflow-gpu
TensorFlow version (use command below):

v2.0.0-rc2-26-g64c3d38 2.0.0
and
v2.0.0-beta0-16-g1d91213fe7 2.0.0-beta1

Python version:
3.6.9

Describe the current behavior
calling layer.output on a keras layer that is called on the output of an activation function does not setup the inbound nodes properly and so one cannot call the layer.output method.

Describe the expected behavior
layer.output should return the output tensor

Code to reproduce the issue


import tensorflow as tf



class MyModel(tf.keras.Model):
    def __init__(self, **kwargs):
        super().__init__(**kwargs)

        self.dense0 = tf.keras.layers.Dense(10, name='dense0', input_shape=(5, 5, 1))
        self.dense1 = tf.keras.layers.Dense(10, name='dense1')
        self.dense2 = tf.keras.layers.Dense(10, name='dense2')

    def call(self, x):
        x = self.dense0(x)
        # if you use this line it works
        x = tf.keras.layers.ReLU()(x)
        x = self.dense1(x)

        print('correct:', self.dense1.inbound_nodes)

        # if you use this line it doesn't work
        relu = tf.keras.activations.get('relu')
        x = relu(x)
        x = self.dense2(x)

        print('incorrect:', self.dense2.inbound_nodes)
        return x


def main():
    my_model = MyModel()

    inp = tf.keras.Input(shape=(5, 5, 1))
    out = my_model(inp)

    my_model.compile(optimizer='adam',
                     loss='sparse_categorical_crossentropy')

    for l in my_model.layers:
        try:
            print(l.output)
        except AttributeError:
            print('EXCEPTION: {}.output raises attribute error'.format(l.name))


if __name__=='__main__':
    main()

Other info / logs
Include any logs or source code that would be helpful to diagnose the problem. If including tracebacks, please include the full traceback. Large logs and files should be attached.

UPDATED: to include input_shape which does not solve the problem.

The text was updated successfully, but these errors were encountered:

oanush · 2019-12-06T05:01:58Z

@chahld ,
When defining a network in Keras, the first layer added needs to have input_shape added, please check the link for more information.
Also refer this example.Thanks!

chahld · 2019-12-08T15:55:01Z

@oanush,

I am not using the Sequential Model. I discovered the problem while subclassing a layer where I needed to call tf.stop_gradient. [Note: Subclassing a layer does not require the input_shape parameter. In my example I'm calling the model on a tf.keras.Input which provides the appropriate input shape. Just to prove this to myself I've ammended the code in my original post, it still fails.]

TL;DNR: The inbound nodes problem happens whenever you are subclassing (either Model or Layer) and you use functions instead of classes for some of the steps in the call function. You can work around this problem by wrapping any tensorflow function inside a tf.keras.layers.Lambda.

Detailed comments:

maybe tf.keras.activations.get should return the ReLU layer, not the relu function.
so long as you use only Keras Layers in the model, the inbound nodes are updated correctly.
If you use the functional interface outside of a keras.Layer, the error does not happen. So there is something about the context inside of Layer.call method that is not working properly. (see code below)

the fact that the node structure is updated correctly in the functional interface makes me think that the layer.call problem is actually a bug.
Workaround: My original problem happened because I needed to call tf.stop_gradient. There is no corresponding keras layer for this, but you can get around this by wrapping it inside a Lambda layer:

   StopGradient = tf.keras.layers.Lambda(tf.stop_gradient)

or by defining a subclassed layer:

class StopGradient(KL.Layer):
    def call(self, x):
        x = tf.stop_gradient(x)
        return x

I have no idea why this fixes the problem, because inside this layer's call function the nodes are probably not being correctly assigned, but somehow in the layer that calls this function, it recovers. Maybe if the function call is the last or only step in a layer then the node construction works properly? I don't know.
The Sequential object is helpful because it throws an error if you try to add function to the sequence instead of a class:

    my_model.add(tf.keras.activations.get('relu')) 
    
    TypeError: The added layer must be an instance of class Layer. Found: <function relu at 0x7fee0be176a8>

it would nice if the Layer subclassing system raised such a useful exception instead of silently building a bad node structure.

Addendum:

See my original post for code to reproduce the problem.

Here is an example using the functional interface that works:

  import tensorflow as tf

def main():
    inputs = tf.keras.Input(shape=(5, 5, 1))
    x = inputs
    x = tf.keras.layers.Dense(10, name='dense0')(x)
    x = tf.keras.layers.ReLU()(x)
    x = tf.stop_gradient(x)
    x = tf.keras.layers.Dense(10, name='dense1')(x)
    x = tf.keras.activations.relu(x)
    x = tf.keras.layers.Dense(10, name='dense2')(x)
    outputs = x
    my_model = tf.keras.Model(inputs, outputs)

    # inp = tf.keras.Input(shape=(5, 5, 1))

    my_model.compile(optimizer='adam',
                     loss='sparse_categorical_crossentropy')


    for l in my_model.layers:

        print(l.output)  # does not fail


if __name__=='__main__':
    main()

oanush · 2019-12-09T10:18:28Z

Issue replicating for given code,kindly find the gist of colab.Thanks!

jvishnuvardhan · 2019-12-10T22:09:16Z

@chahld I am not sure why you are trying to use relu = tf.keras.activations.get('relu'). I tried to define relu in def method and used it. Please check the gist here. Thanks!

Please close the issue if it was resolved for you. Thanks!

chahld · 2019-12-11T18:32:51Z

@jvishnuvardhan

Yes I know that the ReLU layer works (I showed that in my code to reproduce the problem).

There are two reasons why I don't think this should be closed.

First: tf.keras.activations.get('relu') returns the function, not the layer class. If only the layer class is supported then it should be the one returned by the get. I was using the get method so that I could change the activation function from a configuration file. The get method does not accept "ReLU", which is another way to fix this part of the problem.

And 2nd, As I said in my second post, the problem seems to occur whenever you use any tensorflow function in a layer subclass or model subclass. It is not limited to relu. For some tensorflow functions there is no Keras layer available: e.g. tf.stop_gradient()

I know there is a workaround (I explained how to do that in my second post). But this problem ought to be fixed, and not just worked around. The error message is very obscure, and this was difficult to debug and reproduce. At the least, the documentation and tutorials discussing the subclass approach should indicate that there are issues if you use tensorflow functions in the pipeline, or the function should throw an exception if it is an invalid use of the function.

zzj0402 · 2020-04-01T04:29:24Z

import tensorflow as tf
from tensorflow import keras
from tensorflow.keras.layers import Dense, Flatten, Conv2D
from tensorflow.keras import Model, backend as K

mnist = keras.datasets.mnist

(x_train, y_train), (x_test, y_test) = mnist.load_data()
x_train, x_test = x_train / 255.0, x_test / 255.0

x_train = x_train[..., tf.newaxis]
x_test = x_test[..., tf.newaxis]

train_ds = tf.data.Dataset.from_tensor_slices(
    (x_train, y_train)).shuffle(10000).batch(32)
test_ds = tf.data.Dataset.from_tensor_slices((x_test, y_test)).batch(32)


class MyModel(Model):
    def __init__(self):
        super(MyModel, self).__init__()
        self.conv1 = Conv2D(32, 3, activation='relu', input_shape=(28, 28, 1))
        self.flatten = Flatten()
        self.d1 = Dense(128, activation='relu')
        self.d2 = Dense(10, activation='softmax')

    def call(self, x):
        x = self.conv1(x)
        x = self.flatten(x)
        x = self.d1(x)
        return self.d2(x)


# def get_layer_output(self):
#     output = graph.get_tensor_by_name('output:0')


model = MyModel()

loss_object = tf.keras.losses.SparseCategoricalCrossentropy()

optimizer = tf.keras.optimizers.Adam()

train_loss = tf.keras.metrics.Mean(name='train_loss')
train_accuracy = tf.keras.metrics.SparseCategoricalAccuracy(
    name='train_accuracy')

test_loss = tf.keras.metrics.Mean(name='test_loss')
test_accuracy = tf.keras.metrics.SparseCategoricalAccuracy(
    name='test_accuracy')


@tf.function
def train_step(images, labels):
    with tf.GradientTape() as tape:
        predictions = model(images)
        loss = loss_object(labels, predictions)
    gradients = tape.gradient(loss, model.trainable_variables)
    optimizer.apply_gradients(zip(gradients, model.trainable_variables))

    train_loss(loss)
    train_accuracy(labels, predictions)


@tf.function
def test_step(images, labels):
    predictions = model(images)
    t_loss = loss_object(labels, predictions)

    test_loss(t_loss)
    test_accuracy(labels, predictions)


EPOCHS = 1

for epoch in range(EPOCHS):
    train_loss.reset_states()
    train_accuracy.reset_states()
    test_loss.reset_states()
    test_accuracy.reset_states()

    for images, labels in train_ds:
        train_step(images, labels)

    for test_images, test_labels in test_ds:
        test_step(test_images, test_labels)

    template = 'Epoch {}, Loss: {}, Accuracy: {}, Test Loss: {}, Test Accuracy: {}'
    print(template.format(epoch+1,
                          train_loss.result(),
                          train_accuracy.result()*100,
                          test_loss.result(),
                          test_accuracy.result()*100))


def get_all_outputs(model, input_data, learning_phase=1):
    outputs = [layer.output for layer in model.layers[1:]]  # exclude Input
    layers_fn = K.function([model.input, K.learning_phase()], outputs)
    return layers_fn([input_data, learning_phase])


outputs = get_all_outputs(model, "input_data", 1)
print(outputs)

The above sample code reproduces the error:

Traceback (most recent call last):
  File "/Volumes/SAMSUNG/ai/get-layer-output.py", line 102, in <module>
    outputs = get_all_outputs(model, "input_data", 1)
  File "/Volumes/SAMSUNG/ai/get-layer-output.py", line 97, in get_all_outputs
    outputs = [layer.output for layer in model.layers[1:]]  # exclude Input
  File "/Volumes/SAMSUNG/ai/get-layer-output.py", line 97, in <listcomp>
    outputs = [layer.output for layer in model.layers[1:]]  # exclude Input
  File "/Users/xoxoxo/Library/Python/3.6/lib/python/site-packages/tensorflow_core/python/keras/engine/base_layer.py", line 1553, in output
    raise AttributeError('Layer ' + self.name + ' has no inbound nodes.')
AttributeError: Layer flatten has no inbound nodes.

I am using Python 3.6.6 on Mac with TF 2.1

amahendrakar · 2020-06-18T18:50:45Z

Was able to reproduce the issue with TF v2.2 and TF-nightly. Please find the attached gist. Thanks!

etienne-v · 2021-04-29T09:00:56Z

Hi @zzj0402 ... were you able to solve this issue with regard to the inbound nodes? I am trying to do the exact same thing and am running into similar problems. How did you get around this when using custom models in TF2?

sushreebarsa · 2021-05-30T07:41:01Z

Was able to reproduce the issue in TF v2.5,please find the gist here..Thanks !

Mouradost · 2021-06-12T06:29:12Z

@chahld @zzj0402 I solved this issue with a simple trick

import tensorflow as tf

class MyModel(tf.keras.Model):
    def __init__(self, **kwargs):
        super().__init__(**kwargs)

        self.dense0 = tf.keras.layers.Dense(10, name='dense0')
        self.dense1 = tf.keras.layers.Dense(10, name='dense1')
        self.dense2 = tf.keras.layers.Dense(10, name='dense2')

    def build(self, input_shape):
        self.dense2(self.dense1(self.dense0(tf.keras.layers.Input(shape=input_shape[1:], name="input_x"))))

    def call(self, x):
        x = self.dense0(x)
        # if you use this line it works
        x = tf.keras.layers.ReLU()(x)
        x = self.dense1(x)

        print('correct:', self.dense1.inbound_nodes)

        # if you use this line it doesn't work
        relu = tf.keras.activations.get('relu')
        x = relu(x)
        x = self.dense2(x)

        print('incorrect:', self.dense2.inbound_nodes)
        return x


def main():
    my_model = MyModel()

    inp = tf.keras.Input(shape=(5, 5, 1))
    out = my_model(inp)
    my_model.summary()

    my_model.compile(optimizer='adam',
                     loss='sparse_categorical_crossentropy')

    for l in my_model.layers:
        try:
            print(l.output)
        except AttributeError:
            print('EXCEPTION: {}.output raises attribute error'.format(l.name))


if __name__=='__main__':
    main()

and this is the output:

correct: [<tensorflow.python.keras.engine.node.Node object at 0x7fc4e55bfb90>]
incorrect: [<tensorflow.python.keras.engine.node.Node object at 0x7fc4e4ea27d0>]
Model: "my_model_11"
_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
dense0 (Dense)               (None, 5, 5, 10)          20        
_________________________________________________________________
dense1 (Dense)               (None, 5, 5, 10)          110       
_________________________________________________________________
dense2 (Dense)               (None, 5, 5, 10)          110       
=================================================================
Total params: 240
Trainable params: 240
Non-trainable params: 0
_________________________________________________________________
KerasTensor(type_spec=TensorSpec(shape=(None, 5, 5, 10), dtype=tf.float32, name=None), name='dense0/BiasAdd:0', description="created by layer 'dense0'")
KerasTensor(type_spec=TensorSpec(shape=(None, 5, 5, 10), dtype=tf.float32, name=None), name='dense1/BiasAdd:0', description="created by layer 'dense1'")
KerasTensor(type_spec=TensorSpec(shape=(None, 5, 5, 10), dtype=tf.float32, name=None), name='dense2/BiasAdd:0', description="created by layer 'dense2'")

Basically, I did add a build method to your model and explicitly computed the output of each layer.
You can now see the shapes of each layer in the summary.
note that if you don't pass tf.keras.Input(shape=(5, 5, 1)) to the model and pass tensor directly for example tf.random.uniform([5, 5, 1]) you need to get raide of the range selection in input_shape as follow:

    def build(self, input_shape):
        self.dense2(self.dense1(self.dense0(tf.keras.layers.Input(shape=input_shape, name="input_x"))))

    my_model(tf.random.uniform([5, 5, 1])

I hope that helps good luck
This is the reproduced version in TF v2.5 but it should work on all TF v2.x versions, please find the gist here thanks

kumariko · 2021-08-27T05:32:07Z

@chahld please check #34834 (comment) and let us know if your issue got resolved or not?

google-ml-butler · 2021-09-03T06:05:46Z

This issue has been automatically marked as stale because it has no recent activity. It will be closed if no further activity occurs. Thank you.

google-ml-butler · 2021-09-10T06:42:17Z

Closing as stale. Please reopen if you'd like to work on this further.

google-ml-butler · 2021-09-10T06:42:32Z

Are you satisfied with the resolution of your issue?
Yes
No

mehini · 2023-09-15T15:41:08Z

This issue still persists in TF 2.10 when building custom models that inherit from tf.keras.Model.

oanush self-assigned this Dec 6, 2019

oanush added comp:keras Keras related issues TF 2.0 Issues relating to TensorFlow 2.0 type:bug Bug labels Dec 6, 2019

oanush added the stat:awaiting response Status - Awaiting response from author label Dec 6, 2019

tensorflowbutler removed the stat:awaiting response Status - Awaiting response from author label Dec 9, 2019

oanush assigned jvishnuvardhan and unassigned oanush Dec 9, 2019

oanush added the stat:awaiting response Status - Awaiting response from author label Dec 11, 2019

jvishnuvardhan assigned pavithrasv and unassigned jvishnuvardhan Dec 11, 2019

jvishnuvardhan added stat:awaiting tensorflower Status - Awaiting response from tensorflower and removed stat:awaiting response Status - Awaiting response from author labels Dec 11, 2019

amahendrakar self-assigned this Jun 18, 2020

amahendrakar added TF 2.2 Issues related to TF 2.2 and removed TF 2.0 Issues relating to TensorFlow 2.0 labels Jun 18, 2020

amahendrakar removed their assignment Jun 18, 2020

tensorflowbutler removed the stat:awaiting tensorflower Status - Awaiting response from tensorflower label Jun 25, 2020

kumariko self-assigned this Aug 27, 2021

kumariko added the stat:awaiting response Status - Awaiting response from author label Aug 27, 2021

google-ml-butler bot added the stale This label marks the issue/pr stale - to be closed automatically if no activity label Sep 3, 2021

google-ml-butler bot closed this as completed Sep 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

layer.output raises AttributeError because inbound nodes lost after call to activation function #34834

layer.output raises AttributeError because inbound nodes lost after call to activation function #34834

chahld commented Dec 4, 2019 •

edited

oanush commented Dec 6, 2019

chahld commented Dec 8, 2019 •

edited

oanush commented Dec 9, 2019

jvishnuvardhan commented Dec 10, 2019

chahld commented Dec 11, 2019

zzj0402 commented Apr 1, 2020 •

edited

amahendrakar commented Jun 18, 2020

etienne-v commented Apr 29, 2021

sushreebarsa commented May 30, 2021

Mouradost commented Jun 12, 2021 •

edited

kumariko commented Aug 27, 2021

google-ml-butler bot commented Sep 3, 2021

google-ml-butler bot commented Sep 10, 2021

google-ml-butler bot commented Sep 10, 2021

mehini commented Sep 15, 2023

layer.output raises AttributeError because inbound nodes lost after call to activation function #34834

layer.output raises AttributeError because inbound nodes lost after call to activation function #34834

Comments

chahld commented Dec 4, 2019 • edited

oanush commented Dec 6, 2019

chahld commented Dec 8, 2019 • edited

Addendum:

oanush commented Dec 9, 2019

jvishnuvardhan commented Dec 10, 2019

chahld commented Dec 11, 2019

zzj0402 commented Apr 1, 2020 • edited

amahendrakar commented Jun 18, 2020

etienne-v commented Apr 29, 2021

sushreebarsa commented May 30, 2021

Mouradost commented Jun 12, 2021 • edited

kumariko commented Aug 27, 2021

google-ml-butler bot commented Sep 3, 2021

google-ml-butler bot commented Sep 10, 2021

google-ml-butler bot commented Sep 10, 2021

mehini commented Sep 15, 2023

chahld commented Dec 4, 2019 •

edited

chahld commented Dec 8, 2019 •

edited

zzj0402 commented Apr 1, 2020 •

edited

Mouradost commented Jun 12, 2021 •

edited