rnn with initial_state model can't be loaded with load_model #32

xing-w · 2023-08-31T20:39:46Z

Issue type

Bug

Have you reproduced the bug with TensorFlow Nightly?

Yes

Source

source

TensorFlow version

2.13.0

Custom code

Yes

OS platform and distribution

No response

Mobile device

No response

Python version

No response

Bazel version

No response

GCC/compiler version

No response

CUDA/cuDNN version

No response

GPU model and memory

No response

Current behavior?

A simple RNN with LSTMcell model.
I want to initialize the states with initial_state_h and initial_state_c.

batch_size= 16
inputs = tf.keras.layers.Input(shape=(20,5),batch_size=batch_size)
units = 8
lstm_cell_fw = tf.keras.layers.LSTMCell(units)

initial_state_h = tf.random.normal(shape = (batch_size,units), mean=0., stddev=10., dtype=tf.dtypes.float32)
initial_state_c = tf.random.normal(shape = (batch_size,units), mean=0., stddev=10., dtype=tf.dtypes.float32)
lstm_layer_fw = tf.keras.layers.RNN(lstm_cell_fw, stateful=True, return_state=True, return_sequences=False)
outputs,states_h_fw, states_c_fw= lstm_layer_fw(inputs,initial_state = [initial_state_h,initial_state_c])

lstm_dense1 = tf.keras.layers.Dense(16, activation = 'relu')
lstm_dense2 = tf.keras.layers.Dense(2, activation = 'softmax')
out=lstm_dense2(lstm_dense1(outputs))

model = tf.keras.models.Model(inputs, out)

After compile and train, the model is saved with model.save('my_model_test.keras').

model.compile(optimizer='adam', loss='categorical_crossentropy',metrics=['accuracy'])
model.summary()

xTrain = np.random.rand(96,20,5)
yTrain = np.random.rand(96,2)

for i in range(10):
  model.fit(xTrain, yTrain,batch_size=batch_size)

model.save('my_model_test.keras')

But when I try to load it with load_model = tf.keras.models.load_model('my_model_test.keras'), it gives error:

13 frames
[/usr/local/lib/python3.10/dist-packages/keras/src/backend.py](https://localhost:8080/#) in int_shape(x)
   1530     """
   1531     try:
-> 1532         shape = x.shape
   1533         if not isinstance(shape, tuple):
   1534             shape = tuple(shape.as_list())

AttributeError: 'float' object has no attribute 'shape'

I tried to save in other format, .h5, .json, etc. All give the same error.

But, if I don't use initial_state in outputs,states_h_fw, states_c_fw= lstm_layer_fw(inputs), everything goes well. No problem with load_model.

Standalone code to reproduce the issue

https://colab.research.google.com/gist/sushreebarsa/df202f7ea6ad3c85bdf4184cc8e1c9a1/rnn_save_model.ipynb

Relevant log output

---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
<ipython-input-7-8e0130abf25e> in <cell line: 1>()
----> 1 load_model = tf.keras.models.load_model('my_model_test.keras')

13 frames
/usr/local/lib/python3.10/dist-packages/keras/src/backend.py in int_shape(x)
   1530     """
   1531     try:
-> 1532         shape = x.shape
   1533         if not isinstance(shape, tuple):
   1534             shape = tuple(shape.as_list())

AttributeError: 'float' object has no attribute 'shape'

The text was updated successfully, but these errors were encountered:

tilakrayal · 2023-09-05T13:50:07Z

@xing-w,
I request you to take a look at this issue where a similar feature has been proposed and it is still open.Also I request to follow the similar issue which has been raised to have the updates on the similar issue. Thank you!

xing-w · 2023-09-05T15:20:51Z

I think I am facing the same issue with load_model(). Right now, I'm using save_weight() and load_weight() to go around this problem. Hope it would be fixed soon.

tilakrayal · 2024-01-11T17:43:30Z

@xing-w,
I tried to execute the mentioned code on tf-nightly and keras 3.0 version and it was executed without any issue/error. Kindly find the gist of it here. Thank you!

github-actions · 2024-01-26T01:48:05Z

This issue is stale because it has been open for 14 days with no activity. It will be closed if no further activity occurs. Thank you.

github-actions · 2024-02-10T01:46:21Z

This issue was closed because it has been inactive for 28 days. Please reopen if you'd like to work on this further.

google-ml-butler bot added the type:bug label Aug 31, 2023

google-ml-butler bot assigned tilakrayal Aug 31, 2023

tilakrayal added the stat:awaiting response from contributor label Sep 5, 2023

google-ml-butler bot removed the stat:awaiting response from contributor label Sep 5, 2023

sachinprasadhs transferred this issue from keras-team/keras Sep 22, 2023

tilakrayal added the stat:awaiting response from contributor label Jan 11, 2024

github-actions bot added the stale label Jan 26, 2024

github-actions bot closed this as completed Feb 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rnn with initial_state model can't be loaded with load_model #32

rnn with initial_state model can't be loaded with load_model #32

xing-w commented Aug 31, 2023

tilakrayal commented Sep 5, 2023

xing-w commented Sep 5, 2023

tilakrayal commented Jan 11, 2024

github-actions bot commented Jan 26, 2024

github-actions bot commented Feb 10, 2024

rnn with initial_state model can't be loaded with load_model #32

rnn with initial_state model can't be loaded with load_model #32

Comments

xing-w commented Aug 31, 2023

Issue type

Have you reproduced the bug with TensorFlow Nightly?

Source

TensorFlow version

Custom code

OS platform and distribution

Mobile device

Python version

Bazel version

GCC/compiler version

CUDA/cuDNN version

GPU model and memory

Current behavior?

Standalone code to reproduce the issue

Relevant log output

tilakrayal commented Sep 5, 2023

xing-w commented Sep 5, 2023

tilakrayal commented Jan 11, 2024

github-actions bot commented Jan 26, 2024

github-actions bot commented Feb 10, 2024