How to convert a image caption model to tensorflow Lite model?ValueError: Python inputs incompatible with input_signature: #42319

DavidInWuhanChina · 2020-08-13T14:42:52Z

System information

OS Platform and Distribution :CentOS Linux release 7.7.1908
-TensorFlow version:2.3.0

I am following this example:https://www.tensorflow.org/tutorials/text/image_captioning?hl=en

It is working as it should be and saving checkpoints and I want to now convert this to a TF Lite model.

Here is the Link of full convert code:https://colab.research.google.com/drive/1GJkGcwWvDAWMooTsECzuSRUSPbirADhb?usp=sharing

Here is the Link of full train code:
https://colab.research.google.com/drive/1X2d9WW1EMEzN8Rgva3rtjevP0T_jFccj?usp=sharing

I also following the isssue#32999

Here is what I am running to save and them convert the inference graph:

@tf.function
def evaluate(image):
    hidden = decoder.reset_states(batch_size=1)

    temp_input = tf.expand_dims(load_image(image)[0], 0)
    img_tensor_val = image_features_extract_model(temp_input)
    img_tensor_val = tf.reshape(img_tensor_val, (img_tensor_val.shape[0], -1, img_tensor_val.shape[3]))

    features = encoder(img_tensor_val)

    dec_input = tf.expand_dims([tokenizer.word_index['<start>']], 0)
    result = []

    for i in range(max_length):
        predictions, hidden, attention_weights = decoder(dec_input, features, hidden)

        predicted_id = tf.random.categorical(predictions, 1)[0][0]
        # print(tokenizer.index_word)
        print(predicted_id,predicted_id.dtype)

        # for key,value in tokenizer.index_word.items():
        #     key = tf.convert_to_tensor(key)
        #     tf.dtypes.cast(key,tf.int64)
        #     print(key)

        # print(tokenizer.index_word)

        result.append(predicted_id)

        # if tokenizer.index_word[predicted_id] == '<end>':
        #     return result

        dec_input = tf.expand_dims([predicted_id], 0)

    return result

export_dir = "./"
tflite_enc_input = ''
ckpt.f = evaluate
to_save = evaluate.get_concrete_function('')

converter = tf.lite.TFLiteConverter.from_concrete_functions([to_save])
tflite_model = converter.convert()

but I get this error

ValueError: in user code:

    convert2savedmodel.py:310 evaluate  *
        predictions, hidden, attention_weights = decoder(dec_input, features, hidden)
    /share/nishome/19930072_0/miniconda3/envs/tf2.3/lib/python3.7/site-packages/tensorflow/python/keras/engine/base_layer.py:985 __call__  **
        outputs = call_fn(inputs, *args, **kwargs)
    /share/nishome/19930072_0/miniconda3/envs/tf2.3/lib/python3.7/site-packages/tensorflow/python/eager/def_function.py:780 __call__
        result = self._call(*args, **kwds)
    /share/nishome/19930072_0/miniconda3/envs/tf2.3/lib/python3.7/site-packages/tensorflow/python/eager/def_function.py:840 _call
        return self._stateless_fn(*args, **kwds)
    /share/nishome/19930072_0/miniconda3/envs/tf2.3/lib/python3.7/site-packages/tensorflow/python/eager/function.py:2828 __call__
        graph_function, args, kwargs = self._maybe_define_function(args, kwargs)
    /share/nishome/19930072_0/miniconda3/envs/tf2.3/lib/python3.7/site-packages/tensorflow/python/eager/function.py:3171 _maybe_define_function
        *args, **kwargs)
    /share/nishome/19930072_0/miniconda3/envs/tf2.3/lib/python3.7/site-packages/tensorflow/python/eager/function.py:2622 canonicalize_function_inputs
        self._flat_input_signature)
    /share/nishome/19930072_0/miniconda3/envs/tf2.3/lib/python3.7/site-packages/tensorflow/python/eager/function.py:2713 _convert_inputs_to_signature
        format_error_message(inputs, input_signature))

    ValueError: Python inputs incompatible with input_signature:
      inputs: (
        Tensor("ExpandDims_1:0", shape=(1, 1), dtype=int32),
        Tensor("cnn__encoder/StatefulPartitionedCall:0", shape=(1, 64, 256), dtype=float32),
        Tensor("zeros:0", shape=(1, 512), dtype=float32))
      input_signature: (
        TensorSpec(shape=(1, 1), dtype=tf.int64, name=None),
        TensorSpec(shape=(1, 64, 256), dtype=tf.float32, name=None),
        TensorSpec(shape=(1, 512), dtype=tf.float32, name=None))

Encoder Model:

class CNN_Encoder(tf.keras.Model):
    def __init__(self, embedding):
        super(CNN_Encoder, self).__init__()
        # shape after fc == (batch_size, 64, embedding_dim)
        self.fc = tf.keras.layers.Dense(embedding_dim)

    @tf.function(input_signature=[tf.TensorSpec(shape=(1, 64, features_shape),dtype=tf.dtypes.float32)])
    def call(self, x):
        x = self.fc(x)
        x = tf.nn.relu(x)
        return x

Decoder model:

class RNN_Decoder(tf.keras.Model):
    def __init__(self, embedding_dim, units, vocab_size):
        super(RNN_Decoder, self).__init__()
        self.units = units

        self.embedding = tf.keras.layers.Embedding(vocab_size, embedding_dim)
        self.gru = tf.keras.layers.GRU(self.units,
                                       return_sequences=True,
                                       return_state=True,
                                       recurrent_initializer='glorot_uniform',
                                       unroll = True)
        self.fc1 = tf.keras.layers.Dense(self.units)
        self.fc2 = tf.keras.layers.Dense(vocab_size)

        self.attention = BahdanauAttention(self.units)


    @tf.function(input_signature=[tf.TensorSpec(shape=[1, 1], dtype=tf.int64),
                                  tf.TensorSpec(shape=[1, 64, 256], dtype=tf.float32),
                                  tf.TensorSpec(shape=[1, 512], dtype=tf.float32)])
    def call(self, x , features, hidden):

        context_vector, attention_weights = self.attention(features, hidden)

        #x shape after passing through embedding == (batch_size, 1, embedding_dim)
        x = self.embedding(x)

        #x shape after concatenation == (batch_size, 1, embedding_dim + hidden_size)
        x = tf.concat([tf.expand_dims(context_vector, 1), x], axis=-1)


        output, state = self.gru(x)

        #shape == (batch_size, max_length, hidden_size)
        x = self.fc1(output)

        #x shape == (batch_size, max_length, hidden_size)
        x = tf.reshape(x, (-1, x.shape[2]))

        # output shape == (batch_size * max_length, vocab)
        x = self.fc2(x)

        return x, state, attention_weights

    def reset_states(self, batch_size):
        return tf.zeros((batch_size, self.units))

The text was updated successfully, but these errors were encountered:

Saduf2019 · 2020-08-17T09:45:06Z

@DavidInWuhanChina
I ran into another error while i try to execute the code shared, please find gist here, please share all dependencies for us to replicate error faced.

DavidInWuhanChina · 2020-08-17T10:41:59Z

@DavidInWuhanChina
I ran into another error while i try to execute the code shared, please find gist here, please share all dependencies for us to replicate error faced.

The coco2017 dateset is so big(13GB) that I can't put it in the colab.Please tell me How to do next?

karimnosseir · 2020-08-18T01:04:08Z

From the error message looks like your have he tf.function expecting first input as int64 while the input is int32. Can you try changing the tf.function to expect int32 instead.

Thanks

DavidInWuhanChina · 2020-08-18T02:06:32Z

From the error message looks like your have he tf.function expecting first input as int64 while the input is int32. Can you try changing the tf.function to expect int32 instead.

Thanks

I just change the tf.function to int32 as below:
@tf.function(input_signature=[tf.TensorSpec(shape=[1, 1], dtype=tf.int32), tf.TensorSpec(shape=[1, 64, 256], dtype=tf.float32), tf.TensorSpec(shape=[1, 512], dtype=tf.float32)])
but another error came:
ValueError: Python inputs incompatible with input_signature:
inputs: (
Tensor("ExpandDims_2:0", shape=(1, 1), dtype=int64),
Tensor("cnn__encoder/StatefulPartitionedCall:0", shape=(1, 64, 256), dtype=float32),
Tensor("rnn__decoder/StatefulPartitionedCall:1", shape=(1, 512), dtype=float32))
input_signature: (
TensorSpec(shape=(1, 1), dtype=tf.int32, name=None),
TensorSpec(shape=(1, 64, 256), dtype=tf.float32, name=None),
TensorSpec(shape=(1, 512), dtype=tf.float32, name=None))
Why the dtypes of inputs change from int64 to int32?

smilingday · 2020-11-19T08:22:07Z

@karimnosseir to follow up comments again. cc @abattery

sushreebarsa · 2022-11-30T07:20:41Z

@DavidInWuhanChina Could you please refer to this link for avoiding the side effects. Please try to wrap the TF model only in the function to export to TF Lite with the latest TF version 2.11, instead of everything including preprocessing and postprocessing. Thank you!

google-ml-butler · 2022-12-07T08:16:19Z

This issue has been automatically marked as stale because it has no recent activity. It will be closed if no further activity occurs. Thank you.

google-ml-butler · 2022-12-14T09:07:01Z

Closing as stale. Please reopen if you'd like to work on this further.

google-ml-butler · 2022-12-14T09:07:04Z

Are you satisfied with the resolution of your issue?
Yes
No

DavidInWuhanChina added the comp:lite TF Lite related issues label Aug 13, 2020

google-ml-butler bot assigned Saduf2019 Aug 13, 2020

Saduf2019 added TF 2.3 Issues related to TF 2.3 type:bug Bug labels Aug 17, 2020

Saduf2019 added the stat:awaiting response Status - Awaiting response from author label Aug 17, 2020

Saduf2019 removed the stat:awaiting response Status - Awaiting response from author label Aug 17, 2020

Saduf2019 assigned jvishnuvardhan and unassigned Saduf2019 Aug 17, 2020

jvishnuvardhan assigned karimnosseir and unassigned jvishnuvardhan Aug 18, 2020

jvishnuvardhan added the stat:awaiting tensorflower Status - Awaiting response from tensorflower label Aug 18, 2020

tensorflowbutler removed the stat:awaiting tensorflower Status - Awaiting response from tensorflower label Aug 20, 2020

karimnosseir assigned terryheo and unassigned karimnosseir Aug 25, 2022

sushreebarsa self-assigned this Nov 30, 2022

sushreebarsa added the stat:awaiting response Status - Awaiting response from author label Nov 30, 2022

google-ml-butler bot added the stale This label marks the issue/pr stale - to be closed automatically if no activity label Dec 7, 2022

google-ml-butler bot closed this as completed Dec 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to convert a image caption model to tensorflow Lite model?ValueError: Python inputs incompatible with input_signature: #42319

How to convert a image caption model to tensorflow Lite model?ValueError: Python inputs incompatible with input_signature: #42319

DavidInWuhanChina commented Aug 13, 2020 •

edited

Loading

Saduf2019 commented Aug 17, 2020

DavidInWuhanChina commented Aug 17, 2020

karimnosseir commented Aug 18, 2020

DavidInWuhanChina commented Aug 18, 2020

smilingday commented Nov 19, 2020 •

edited

Loading

sushreebarsa commented Nov 30, 2022

google-ml-butler bot commented Dec 7, 2022

google-ml-butler bot commented Dec 14, 2022

google-ml-butler bot commented Dec 14, 2022

How to convert a image caption model to tensorflow Lite model?ValueError: Python inputs incompatible with input_signature: #42319

How to convert a image caption model to tensorflow Lite model?ValueError: Python inputs incompatible with input_signature: #42319

Comments

DavidInWuhanChina commented Aug 13, 2020 • edited Loading

Saduf2019 commented Aug 17, 2020

DavidInWuhanChina commented Aug 17, 2020

karimnosseir commented Aug 18, 2020

DavidInWuhanChina commented Aug 18, 2020

smilingday commented Nov 19, 2020 • edited Loading

sushreebarsa commented Nov 30, 2022

google-ml-butler bot commented Dec 7, 2022

google-ml-butler bot commented Dec 14, 2022

google-ml-butler bot commented Dec 14, 2022

DavidInWuhanChina commented Aug 13, 2020 •

edited

Loading

smilingday commented Nov 19, 2020 •

edited

Loading