dtype of RNN cell's state is changed to tf.float32 during reset_states #649

TillHa · 2021-08-13T08:56:39Z

Have I written custom code (as opposed to using a stock example script
provided in Keras): yes
OS Platform and Distribution: both win10 and CentOS Linux
TensorFlow installed from: pip
TensorFlow version: 2.6.0
Python version: 3.8
Exact command to reproduce: tf.keras.layers.RNN(cell)

Describe the problem.

I have implemented a recurrent cell which is to be wrapped within a tf.keras.layers.RNN. The cell has a state whose data type is not tf.float32 but tf.complex64. However, each time when layer.reset_states() is invoked, the data type of the state is changed to tf.float32. As a result, a value error is thrown during the initial symbolic call. See attached stack trace.

Describe the current behavior.
The programm crashes at the construction of the RNN layer. See attached stack trace.

Contributing.

I assume, a reason for this issue is line 933, 934 in function reset_states in class RNN in file keras/layers/recurrent.py

      flat_states_variables = tf.nest.map_structure(
          backend.variable, flat_init_state_values)

Here, the initialized state values are stored in flat_init_state_values and backend.variable is called on each of the states. However, no dtype argument is passed to backend.variable. As a consequence it defaults to tf.float32 for all states. T
I would recommend the following patch, which solves the issue for me

      flat_states_variables = tf.nest.map_structure(
    lambda var: backend.variable(var, var.dtype), flat_init_state_values)

I also tried to run the example after replacing the files affected by the latest commit regarding mixed precision. Unfortunately it did not solve the issue for me

Standalone code to reproduce the issue.

Currently, the example fails at the construction of the RNN layer.

import tensorflow as tf

class RecurrentCell(tf.keras.layers.Layer):
    def __init__(self, state_size):
        super(RecurrentCell, self).__init__()
        self.state_size = state_size

    def build(self, input_shape):
        super(RecurrentCell, self).build(input_shape)

    def get_initial_state(self, inputs=None, batch_size=None, dtype=None):
        # explicit initialization with tf.complex64
        return tf.zeros((self.state_size, ), dtype=tf.complex64)

    @tf.function
    def call(self, inputs, states):
        # toy example
        x = inputs
        xfd = tf.signal.rfft(x)[..., :self.state_size]
        yfd = tf.multiply(xfd, states)
        return tf.signal.irfft(yfd), states


recCell = RecurrentCell(state_size=5)

inp = tf.keras.Input(shape=(None, 8),
                     batch_size=32)
out = tf.keras.layers.RNN(recCell,  # crashes
                          return_sequences=True,
                          stateful=True,
                          return_state=False)(inp)
model = tf.keras.Model(inputs=[inp], outputs=[out])

y = model.predict(tf.random.normal((32, 16, 8)))

Source code / Logs
Stacktrace:
stacktrace.txt

The text was updated successfully, but these errors were encountered:

The backend.variable was asssuming float32 when dtype is not provided. The RNN init state should pass the init state dtype to the backend.variable. Seehttps://github.com/keras-team/keras/issues/15164 for more details. PiperOrigin-RevId: 550600164

The backend.variable was asssuming float32 when dtype is not provided. The RNN init state should pass the init state dtype to the backend.variable. Seehttps://github.com/keras-team/keras/issues/15164 for more details. PiperOrigin-RevId: 550619673

tilakrayal · 2023-09-22T09:46:01Z

@TillHa,
I tried to execute the mentioned code on tf-nightly(2.15.0-dev20230922) without any issue/error. Kindly find the gist of it here. Thank you!

github-actions · 2023-10-09T01:48:10Z

This issue is stale because it has been open for 14 days with no activity. It will be closed if no further activity occurs. Thank you.

github-actions · 2023-10-23T01:48:22Z

This issue was closed because it has been inactive for 28 days. Please reopen if you'd like to work on this further.

TillHa mentioned this issue Aug 13, 2021

dtype of RNN cell's state is changed to tf.float32 during reset_states tensorflow/tensorflow#51449

Closed

ymodak added the keras-team-review-pending label Aug 17, 2021

mattdangerw assigned qlzh727 Aug 19, 2021

mattdangerw removed the keras-team-review-pending label Aug 19, 2021

tilakrayal added the stat:awaiting keras-eng label Jul 20, 2023

copybara-service bot mentioned this issue Jul 24, 2023

Address the issue for init state dtype in RNN. keras-team/keras#18309

Closed

tilakrayal added stat:awaiting response from contributor and removed stat:awaiting keras-eng labels Sep 22, 2023

tilakrayal assigned tilakrayal and unassigned qlzh727 Sep 22, 2023

fchollet transferred this issue from keras-team/keras Sep 22, 2023

github-actions bot added the stale label Oct 9, 2023

github-actions bot closed this as completed Oct 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dtype of RNN cell's state is changed to tf.float32 during reset_states #649

dtype of RNN cell's state is changed to tf.float32 during reset_states #649

TillHa commented Aug 13, 2021

tilakrayal commented Sep 22, 2023

github-actions bot commented Oct 9, 2023

github-actions bot commented Oct 23, 2023

dtype of RNN cell's state is changed to tf.float32 during reset_states #649

dtype of RNN cell's state is changed to tf.float32 during reset_states #649

Comments

TillHa commented Aug 13, 2021

tilakrayal commented Sep 22, 2023

github-actions bot commented Oct 9, 2023

github-actions bot commented Oct 23, 2023