Keras load LSTM/GRU model with constant mask/initial_state will raise error #390

RunnerZhong · 2022-10-21T02:16:26Z

Issue Type
Bug

Source
binary

Tensorflow Version
TF 2.6.3

Custom Code
Yes

OS Platform and Distribution
RedHat 7

Mobile device
No response

Python version
3.8

Bazel version
No response

GCC/Compiler version
No response

CUDA/cuDNN version
No response

GPU model and memory
No response

Current Behaviour?

Traceback (most recent call last):
File "/home/runner/work/sample_code/example.py", line 28, in
dd = tf.keras.models.load_model("./lstm.h5", compile=False, options=load_options)
File "/home/runner/anaconda3/envs/latest_env/lib/python3.8/site-packages/keras/utils/traceback_utils.py", line 67, in error_handler
raise e.with_traceback(filtered_tb) from None
File "/home/runner/anaconda3/envs/latest_env/lib/python3.8/site-packages/keras/backend.py", line 1470, in int_shape
shape = x.shape
AttributeError: 'float' object has no attribute 'shape'

Standalone code to reproduce the issue

import numpy as np
import tensorflow as tf
import tensorflow.keras.backend as K

input_t = tf.keras.Input(shape=(28, 10), batch_size=2, dtype="float32")

rdm_value = np.ones([2, 28]).astype(np.float32)
rdm_value[:, 20:] = 0
mask_value = K.constant(np.array(rdm_value), dtype='bool')

m_state = tf.keras.initializers.GlorotUniform()(shape=[2, 6], dtype='float32')
c_state = tf.keras.initializers.GlorotUniform()(shape=[2, 6], dtype='float32')
init_state_value = [m_state, c_state]

lstm = tf.keras.layers.LSTM(6,
return_sequences=True,
return_state=True,
bias_initializer='random_uniform',
time_major=False)(inputs=input_t, mask=mask_value,
training=False,
initial_state=init_state_value)
keras_model = tf.keras.Model([input_t], lstm)
keras_model.save('./lstm.h5')

load_options = tf.saved_model.LoadOptions(allow_partial_checkpoint=True)
dd = tf.keras.models.load_model("./lstm.h5", compile=False, options=load_options)
print(dd.inputs)

sushreebarsa · 2022-10-21T11:49:48Z

@RunnerZhong Could you have a look at the gist here and confirm the issue?
Thank you!

RunnerZhong · 2022-10-22T14:53:01Z

yes, this is the issue that I met

sachinprasadhs · 2022-11-02T18:33:04Z

Could you try compiling the model and train it with some data to check if you are facing the same behavior as saving and loading non compiled model.
Now, as the warning suggested, the compiled metrics has not been built yet, this will be empty till you train or evaluate the model.
Additionally, if you want to go ahead with the non compiled model, you could try saving the weights and load the weights like below.

keras_model.save_weights('./lstm.h5')
keras_model.load_weights('./lstm.h5')

google-ml-butler · 2022-11-09T19:28:38Z

This issue has been automatically marked as stale because it has no recent activity. It will be closed if no further activity occurs. Thank you.

RunnerZhong · 2022-11-10T03:43:25Z

I added compiling and train before save the model, but this issue still can reproduce.

RunnerZhong · 2022-11-10T03:44:39Z

RunnerZhong · 2022-11-10T03:46:04Z

You can have a try too, thanks your reply~

sachinprasadhs · 2022-11-16T22:57:01Z

Could you please provide the reproducible code which you have used to compile the model with sample input. Thanks!

RunnerZhong · 2022-11-17T02:39:45Z

`import numpy as np
import tensorflow as tf
import tensorflow.keras.backend as K

input_t = tf.keras.Input(shape=(28, 10), batch_size=2, dtype="float32")

rdm_value = np.ones([2, 28]).astype(np.float32)
rdm_value[:, 20:] = 0
mask_value = K.constant(np.array(rdm_value), dtype='bool')

m_state = tf.keras.initializers.GlorotUniform()(shape=[2, 6], dtype='float32')
c_state = tf.keras.initializers.GlorotUniform()(shape=[2, 6], dtype='float32')
init_state_value = [m_state, c_state]

lstm = tf.keras.layers.LSTM(6,
return_sequences=True,
return_state=True,
bias_initializer='random_uniform',
time_major=False)(inputs=input_t, mask=mask_value,
training=True,
initial_state=init_state_value)
keras_model = tf.keras.Model([input_t], lstm)
keras_model.compile(optimizer="adam", loss="mean_squared_error")
test_input = np.random.random(input_t.shape)
test_target = [np.random.random((2, 28, 6)), np.random.random((2, 6)) ,np.random.random((2, 6))]
keras_model.fit(test_input, test_target)
keras_model.save('./lstm.h5')

load_options = tf.saved_model.LoadOptions(allow_partial_checkpoint=True)
dd = tf.keras.models.load_model("./lstm.h5", compile=True, options=load_options)
print(dd.inputs)`

RunnerZhong · 2022-11-17T02:41:27Z

You can reproduce issue with above sample, thanks.

sachinprasadhs · 2023-02-09T00:11:44Z

I was able to reproduce the behavior using Tf-Nightly(2.13), please find the Gist here. Thanks!

grasskin · 2023-02-09T18:24:13Z

@nkovela1

google-ml-butler bot added the type:bug label Oct 21, 2022

google-ml-butler bot assigned sushreebarsa Oct 21, 2022

sushreebarsa added the stat:awaiting response from contributor label Oct 21, 2022

google-ml-butler bot removed the stat:awaiting response from contributor label Oct 22, 2022

sushreebarsa added the keras-team-review-pending label Oct 30, 2022

sachinprasadhs removed the keras-team-review-pending label Nov 2, 2022

sachinprasadhs assigned sachinprasadhs and unassigned sushreebarsa Nov 2, 2022

sachinprasadhs added the stat:awaiting response from contributor label Nov 2, 2022

google-ml-butler bot removed the stat:awaiting response from contributor label Nov 10, 2022

sachinprasadhs added the stat:awaiting response from contributor label Nov 16, 2022

google-ml-butler bot removed the stat:awaiting response from contributor label Nov 17, 2022

sachinprasadhs added the keras-team-review-pending label Feb 9, 2023

gowthamkpr assigned nkovela1 Feb 9, 2023

grasskin removed the keras-team-review-pending label Feb 9, 2023

tilakrayal mentioned this issue Sep 5, 2023

rnn with initial_state model can't be loaded with load_model #32

Closed

fchollet transferred this issue from keras-team/keras Sep 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keras load LSTM/GRU model with constant mask/initial_state will raise error #390

Keras load LSTM/GRU model with constant mask/initial_state will raise error #390

RunnerZhong commented Oct 21, 2022 •

edited

sushreebarsa commented Oct 21, 2022

RunnerZhong commented Oct 22, 2022

sachinprasadhs commented Nov 2, 2022 •

edited

google-ml-butler bot commented Nov 9, 2022

RunnerZhong commented Nov 10, 2022

RunnerZhong commented Nov 10, 2022

RunnerZhong commented Nov 10, 2022

sachinprasadhs commented Nov 16, 2022

RunnerZhong commented Nov 17, 2022 •

edited

RunnerZhong commented Nov 17, 2022

sachinprasadhs commented Feb 9, 2023

grasskin commented Feb 9, 2023

Keras load LSTM/GRU model with constant mask/initial_state will raise error #390

Keras load LSTM/GRU model with constant mask/initial_state will raise error #390

Comments

RunnerZhong commented Oct 21, 2022 • edited

Current Behaviour?

Standalone code to reproduce the issue

sushreebarsa commented Oct 21, 2022

RunnerZhong commented Oct 22, 2022

sachinprasadhs commented Nov 2, 2022 • edited

google-ml-butler bot commented Nov 9, 2022

RunnerZhong commented Nov 10, 2022

RunnerZhong commented Nov 10, 2022

RunnerZhong commented Nov 10, 2022

sachinprasadhs commented Nov 16, 2022

RunnerZhong commented Nov 17, 2022 • edited

RunnerZhong commented Nov 17, 2022

sachinprasadhs commented Feb 9, 2023

grasskin commented Feb 9, 2023

RunnerZhong commented Oct 21, 2022 •

edited

sachinprasadhs commented Nov 2, 2022 •

edited

RunnerZhong commented Nov 17, 2022 •

edited