Saving TFVisionEncoderDecoderModel as SavedModel: `The following keyword arguments are not supported by this model: ['attention_mask', 'token_type_ids'].` #22731

DevinTDHa · 2023-04-12T15:37:21Z

System Info

transformers version: 4.27.4
Platform: Linux-6.2.6-76060206-generic-x86_64-with-debian-bookworm-sid
Python version: 3.7.16
Huggingface_hub version: 0.13.4
PyTorch version (GPU?): 1.13.1 (False)
Tensorflow version (GPU?): 2.11.0 (False)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using GPU in script?: False
Using distributed or parallel set-up in script?: False

Who can help?

@gante Could be related to #16400?

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Hello,

I am trying to save a TFVisionEncoderDecoderModel in a SavedModel format. Specifically, I am using the nlpconnect/vit-gpt2-image-captioning pretrained model. It seems like the model is able to be intiallised from the PyTorch checkpoint. However, when trying to save it as a SavedModel, it fails with the error.

ValueError: The following keyword arguments are not supported by this model: ['attention_mask', 'token_type_ids'].

Link to Google Colab Reproduction:
https://colab.research.google.com/drive/1N2TVejxiBT5S7bRJ2LSmJ8IIR45folGA#scrollTo=aIL92KqPDDjf

Thanks for your time!

Expected behavior

The model should be saved as a SavedModel without problems, similarly to other pretrained models.

The text was updated successfully, but these errors were encountered:

amyeroberts · 2023-04-12T16:49:13Z

cc @ydshieh

ydshieh · 2023-04-13T07:12:50Z

Hi @DevinTDHa Just a quick update: instead of input_ids in the signature, we have to use decoder_input_ids, as the text inputs are for the decoder.

                "pixel_values": tf.TensorSpec((None, None, None, None), tf.float32, name="pixel_values"),
                "decoder_input_ids": tf.TensorSpec((None, None), tf.int32, name="decoder_input_ids"),

This change will fix the issue you mentioned, but the saving is still not working due to other problems - I am still looking how to fix them.

ydshieh · 2023-04-13T08:11:29Z

Two extra steps to make the saving working are:

First, after model = TFVisionEncoderDecoderModel.from_pretrained(MODEL_NAME, from_pt=True) in your code, add
```
model.config.torch_dtype = None
```

Then, in the file src/transformers/models/vision_encoder_decoder/modeling_tf_vision_encoder_decoder.py, for the class TFVisionEncoderDecoderModel, change the method from

    def serving_output(self, output):
        pkv = tf.tuple(output.past_key_values)[1] if self.config.use_cache else None
        ...

to

    def serving_output(self, output):
        pkv = tf.tuple(output.past_key_values)[1] if self.config.decoder.use_cache else None
        ...

You can do these changes in your own fork if you want to proceed quickly.

I will discuss the team about the fix in our codebase.

DevinTDHa · 2023-04-13T11:12:24Z

Thanks a lot, especially for the suggested edits!

ydshieh · 2023-04-13T21:00:16Z

@DevinTDHa

In fact, what I did that works is I added the following block for the class TFVisionEncoderDecoderModel in the file src/transformers/models/vision_encoder_decoder/modeling_tf_vision_encoder_decoder.py

@tf.function(
    input_signature=[
        {
            "pixel_values": tf.TensorSpec((None, None, None, None), tf.float32, name="pixel_values"),
            "decoder_input_ids": tf.TensorSpec((None, None), tf.int32, name="decoder_input_ids"),
        }
    ]
)
def serving(self, inputs):
    """
    Method used for serving the model.

    Args:
        inputs (`Dict[str, tf.Tensor]`):
            The input of the saved model as a dictionary of tensors.
    """
    output = self.call(inputs)

    return self.serving_output(output)

I am not sure why using the approach in your notebook doesn't work (i.e. by specifying serving_fn explicitly)

ydshieh · 2023-04-13T21:50:26Z

The fixes have been merged to the main branch. The only thing to do manually is to add the correct input_signature to the proper place as shown in the above comment. However, this could not be done in transformers codebase I believe, but you can still do it in your own fork.

I will discuss with our TF experts regarding why specifying signatures as you did is not working. But I am going to close this issue. If you still have any related question on this issue, don't hesitate to leave comments 🤗

ydshieh · 2023-04-14T18:29:10Z

Hi @Rocketknight1 Since you are a TF saving expert 🔥 , could you take a look on the code snippet below, and see why it doesn't work when we specify signatures manually, please? (it works if I add serving method to TFVisionEncoderDecoderModel directly.

(You have to pull main branch to incorporate 2 fixes first)

Thank you in advanceeeeeeee ~

import tensorflow as tf
from transformers import TFVisionEncoderDecoderModel

# load a fine-tuned image captioning model and corresponding tokenizer and image processor
MODEL_NAME = "nlpconnect/vit-gpt2-image-captioning"
model = TFVisionEncoderDecoderModel.from_pretrained(MODEL_NAME, from_pt=True)
EXPORT_PATH = f"exports/{MODEL_NAME}"

# ========================================================================================================================
# This works

# Add this block to `TFVisionEncoderDecoderModel` in `src/transformers/models/vision_encoder_decoder/modeling_tf_vision_encoder_decoder.py`
"""
    @tf.function(
        input_signature=[
            {
                "pixel_values": tf.TensorSpec((None, None, None, None), tf.float32, name="pixel_values"),
                "decoder_input_ids": tf.TensorSpec((None, None), tf.int32, name="decoder_input_ids"),
            }
        ]
    )

    def serving(self, inputs):
        output = self.call(inputs)
        return self.serving_output(output)
"""
#model.save_pretrained(
#   EXPORT_PATH,
#    saved_model=True,
#    # signatures={"serving_default": my_serving_fn},
#)
# ========================================================================================================================
# Not working (without changing `TFVisionEncoderDecoderModel`)

@tf.function(
    input_signature=[
        {
            "pixel_values": tf.TensorSpec((None, None, None, None), tf.float32, name="pixel_values"),
            "decoder_input_ids": tf.TensorSpec((None, None), tf.int32, name="decoder_input_ids"),
        }
    ]
)
def my_serving_fn(inputs):
    output = model.call(inputs)
    return model.serving_output(output)

# This fails
model.save_pretrained(
    EXPORT_PATH,
    saved_model=True,
    signatures={"serving_default": my_serving_fn},
)
# ========================================================================================================================

DevinTDHa · 2023-04-19T16:01:50Z

@ydshieh I have a question regarding this actually:

Currently I'm trying to access the decoder (GPT-2) from the saved model but it seems to my knowledge that it is not possible. The default serving signature you suggested outputs the encoder (ViT) outputs only (or am I wrong in this regard?)

However, trying to create a serving for the model.generate() function, seems to cause the same error. The error is the same as with saving the model with a custom signature. Would this be possible in theory (combining encoder and decoder in one serving function)?

ydshieh · 2023-04-19T16:26:59Z

@ydshieh I have a question regarding this actually:

Currently I'm trying to access the decoder (GPT-2) from the saved model but it seems to my knowledge that it is not possible. The default serving signature you suggested outputs the encoder (ViT) outputs only (or am I wrong in this regard?)

I believe it gives the outputs of both the encoder and decoder. But if you find it is not the case, please open a new issue and we are more than happy to look into it 🤗 .

However, trying to create a serving for the model.generate() function, seems to cause the same error. The error is the same as with saving the model with a custom signature.
I have never created a saved model format with generate and not sure if it will work in most case(s) - @gante Do you have any knowledge if this is supposed to work (in most cases). cc @Rocketknight1 too.

Would this be possible in theory (combining encoder and decoder in one serving function)?
See my comment in the first paragraph 😃

ydshieh self-assigned this Apr 13, 2023

ydshieh mentioned this issue Apr 13, 2023

Change torch_dtype to str when saved_model=True in save_pretrained for TF models #22740

Merged

ydshieh mentioned this issue Apr 13, 2023

Fix serving_output for TF composite models (encoder-decoder like models) #22743

Merged

ydshieh closed this as completed in #22743 Apr 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Saving TFVisionEncoderDecoderModel as SavedModel: `The following keyword arguments are not supported by this model: ['attention_mask', 'token_type_ids'].` #22731

Saving TFVisionEncoderDecoderModel as SavedModel: `The following keyword arguments are not supported by this model: ['attention_mask', 'token_type_ids'].` #22731

DevinTDHa commented Apr 12, 2023

amyeroberts commented Apr 12, 2023

ydshieh commented Apr 13, 2023

ydshieh commented Apr 13, 2023 •

edited

Loading

DevinTDHa commented Apr 13, 2023

ydshieh commented Apr 13, 2023 •

edited

Loading

ydshieh commented Apr 13, 2023

ydshieh commented Apr 14, 2023 •

edited

Loading

DevinTDHa commented Apr 19, 2023

ydshieh commented Apr 19, 2023

Saving TFVisionEncoderDecoderModel as SavedModel: The following keyword arguments are not supported by this model: ['attention_mask', 'token_type_ids']. #22731

Saving TFVisionEncoderDecoderModel as SavedModel: The following keyword arguments are not supported by this model: ['attention_mask', 'token_type_ids']. #22731

Comments

DevinTDHa commented Apr 12, 2023

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

amyeroberts commented Apr 12, 2023

ydshieh commented Apr 13, 2023

ydshieh commented Apr 13, 2023 • edited Loading

DevinTDHa commented Apr 13, 2023

ydshieh commented Apr 13, 2023 • edited Loading

ydshieh commented Apr 13, 2023

ydshieh commented Apr 14, 2023 • edited Loading

DevinTDHa commented Apr 19, 2023

ydshieh commented Apr 19, 2023

Saving TFVisionEncoderDecoderModel as SavedModel: `The following keyword arguments are not supported by this model: ['attention_mask', 'token_type_ids'].` #22731

Saving TFVisionEncoderDecoderModel as SavedModel: `The following keyword arguments are not supported by this model: ['attention_mask', 'token_type_ids'].` #22731

ydshieh commented Apr 13, 2023 •

edited

Loading

ydshieh commented Apr 13, 2023 •

edited

Loading

ydshieh commented Apr 14, 2023 •

edited

Loading