model.save() does not save keras model that includes DIstillBert layer #4444

msahamed · 2020-05-18T18:00:40Z

🐛 Bug

Information

I am trying to build a Keras Sequential model, where, I use DistillBERT as a non-trainable embedding layer. The model complies and fits well, even predict method works. But when I want to save it using model.save(model.h5), It fails and shows the following error:

> ---------------------------------------------------------------------------
> NotImplementedError                       Traceback (most recent call last)
> <ipython-input-269-557c9cec7497> in <module>
> ----> 1 model.get_config()
> 
> /usr/local/lib/python3.7/site-packages/tensorflow/python/keras/engine/network.py in get_config(self)
>     966     if not self._is_graph_network:
>     967       raise NotImplementedError
> --> 968     return copy.deepcopy(get_network_config(self))
>     969 
>     970   @classmethod
> 
> /usr/local/lib/python3.7/site-packages/tensorflow/python/keras/engine/network.py in get_network_config(network, serialize_layer_fn)
>    2117           filtered_inbound_nodes.append(node_data)
>    2118 
> -> 2119     layer_config = serialize_layer_fn(layer)
>    2120     layer_config['name'] = layer.name
>    2121     layer_config['inbound_nodes'] = filtered_inbound_nodes
> 
> /usr/local/lib/python3.7/site-packages/tensorflow/python/keras/utils/generic_utils.py in serialize_keras_object(instance)
>     273         return serialize_keras_class_and_config(
>     274             name, {_LAYER_UNDEFINED_CONFIG_KEY: True})
> --> 275       raise e
>     276     serialization_config = {}
>     277     for key, item in config.items():
> 
> /usr/local/lib/python3.7/site-packages/tensorflow/python/keras/utils/generic_utils.py in serialize_keras_object(instance)
>     268     name = get_registered_name(instance.__class__)
>     269     try:
> --> 270       config = instance.get_config()
>     271     except NotImplementedError as e:
>     272       if _SKIP_FAILED_SERIALIZATION:
> 
> /usr/local/lib/python3.7/site-packages/tensorflow/python/keras/engine/network.py in get_config(self)
>     965   def get_config(self):
>     966     if not self._is_graph_network:
> --> 967       raise NotImplementedError
>     968     return copy.deepcopy(get_network_config(self))
>     969 
> 
> NotImplementedError:

The language I am using the model in English.

The problem arises when using my own modified scripts: (give details below)

from transformers import DistilBertConfig, TFDistilBertModel, DistilBertTokenizer
max_len = 8
distil_bert = 'distilbert-base-uncased'
config = DistilBertConfig(dropout=0.2, attention_dropout=0.2)
config.output_hidden_states = False
transformer_model = TFDistilBertModel.from_pretrained(distil_bert, config = config)

input_word_ids = tf.keras.layers.Input(shape=(max_len,), dtype = tf.int32, name = "input_word_ids")
distill_output =  transformer_model(input_word_ids)[0]

cls_out = tf.keras.layers.Lambda(lambda seq: seq[:, 0, :])(distill_output)
X = tf.keras.layers.BatchNormalization()(cls_out)
X = tf.keras.layers.Dense(256, activation='relu')(X)
X = tf.keras.layers.Dropout(0.2)(X)

X = tf.keras.layers.BatchNormalization()(X)
X = tf.keras.layers.Dense(128, activation='relu')(X)
X = tf.keras.layers.Dropout(0.2)(X)

X = tf.keras.layers.BatchNormalization()(X)
X = tf.keras.layers.Dense(64, activation='relu')(X)
X = tf.keras.layers.Dropout(0.2)(X)

X = tf.keras.layers.Dense(2)(X)
model = tf.keras.Model(inputs=input_word_ids, outputs=X)

for layer in model.layers[:3]:
    layer.trainable = False

The tasks I am working on is my own dataset.

To reproduce

Steps to reproduce the behavior:

Run the above code
You will get the error when saving the model as

model.save('model.h5')

You can get the same error if you try:

model.get_config()

An interesting observation:
if you save the model without specifying ".h5" like

model.save('./model')

it saves the model as TensorFlow saved_model format and creates folders (assets (empty), variables, and some index files). But if you try to load the model, it produces different errors related to the DistillBert/Bert. It may be due to some naming inconsistency (input_ids vs. inputs, see below) inside the DistillBert model.


new_model = tf.keras.models.load_model('./model)

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
/usr/local/lib/python3.7/site-packages/tensorflow/python/util/nest.py in assert_same_structure(nest1, nest2, check_types, expand_composites)
    377     _pywrap_utils.AssertSameStructure(nest1, nest2, check_types,
--> 378                                       expand_composites)
    379   except (ValueError, TypeError) as e:

ValueError: The two structures don't have the same nested structure.

First structure: type=dict str={'input_ids': TensorSpec(shape=(None, 5), dtype=tf.int32, name='input_ids')}

Second structure: type=TensorSpec str=TensorSpec(shape=(None, 8), dtype=tf.int32, name='inputs')

More specifically: Substructure "type=dict str={'input_ids': TensorSpec(shape=(None, 5), dtype=tf.int32, name='input_ids')}" is a sequence, while substructure "type=TensorSpec str=TensorSpec(shape=(None, 8), dtype=tf.int32, name='inputs')" is not

During handling of the above exception, another exception occurred:

ValueError                                Traceback (most recent call last)
<ipython-input-229-b46ed71fd9ad> in <module>
----> 1 new_model = tf.keras.models.load_model(keras_model_path)

/usr/local/lib/python3.7/site-packages/tensorflow/python/keras/saving/save.py in load_model(filepath, custom_objects, compile)
    188     if isinstance(filepath, six.string_types):
    189       loader_impl.parse_saved_model(filepath)
--> 190       return saved_model_load.load(filepath, compile)
    191 
    192   raise IOError(

/usr/local/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py in load(path, compile)
    114   # TODO(kathywu): Add saving/loading of optimizer, compiled losses and metrics.
    115   # TODO(kathywu): Add code to load from objects that contain all endpoints
--> 116   model = tf_load.load_internal(path, loader_cls=KerasObjectLoader)
    117 
    118   # pylint: disable=protected-access

/usr/local/lib/python3.7/site-packages/tensorflow/python/saved_model/load.py in load_internal(export_dir, tags, loader_cls)
    602       loader = loader_cls(object_graph_proto,
    603                           saved_model_proto,
--> 604                           export_dir)
    605       root = loader.get(0)
    606       if isinstance(loader, Loader):

/usr/local/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py in __init__(self, *args, **kwargs)
    186     self._models_to_reconstruct = []
    187 
--> 188     super(KerasObjectLoader, self).__init__(*args, **kwargs)
    189 
    190     # Now that the node object has been fully loaded, and the checkpoint has

/usr/local/lib/python3.7/site-packages/tensorflow/python/saved_model/load.py in __init__(self, object_graph_proto, saved_model_proto, export_dir)
    121       self._concrete_functions[name] = _WrapperFunction(concrete_function)
    122 
--> 123     self._load_all()
    124     self._restore_checkpoint()
    125 

/usr/local/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py in _load_all(self)
    213 
    214     # Finish setting up layers and models. See function docstring for more info.
--> 215     self._finalize_objects()
    216 
    217   @property

/usr/local/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py in _finalize_objects(self)
    504         layers_revived_from_saved_model.append(node)
    505 
--> 506     _finalize_saved_model_layers(layers_revived_from_saved_model)
    507     _finalize_config_layers(layers_revived_from_config)
    508 

/usr/local/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py in _finalize_saved_model_layers(layers)
    675       call_fn = _get_keras_attr(layer).call_and_return_conditional_losses
    676       if call_fn.input_signature is None:
--> 677         inputs = infer_inputs_from_restored_call_function(call_fn)
    678       else:
    679         inputs = call_fn.input_signature[0]

/usr/local/lib/python3.7/site-packages/tensorflow/python/keras/saving/saved_model/load.py in infer_inputs_from_restored_call_function(fn)
    919   for concrete in fn.concrete_functions[1:]:
    920     spec2 = concrete.structured_input_signature[0][0]
--> 921     spec = nest.map_structure(common_spec, spec, spec2)
    922   return spec
    923 

/usr/local/lib/python3.7/site-packages/tensorflow/python/util/nest.py in map_structure(func, *structure, **kwargs)
    609   for other in structure[1:]:
    610     assert_same_structure(structure[0], other, check_types=check_types,
--> 611                           expand_composites=expand_composites)
    612 
    613   flat_structure = [flatten(s, expand_composites) for s in structure]

/usr/local/lib/python3.7/site-packages/tensorflow/python/util/nest.py in assert_same_structure(nest1, nest2, check_types, expand_composites)
    383                   "Entire first structure:\n%s\n"
    384                   "Entire second structure:\n%s"
--> 385                   % (str(e), str1, str2))
    386 
    387 

ValueError: The two structures don't have the same nested structure.

First structure: type=dict str={'input_ids': TensorSpec(shape=(None, 5), dtype=tf.int32, name='input_ids')}

Second structure: type=TensorSpec str=TensorSpec(shape=(None, 8), dtype=tf.int32, name='inputs')

More specifically: Substructure "type=dict str={'input_ids': TensorSpec(shape=(None, 5), dtype=tf.int32, name='input_ids')}" is a sequence, while substructure "type=TensorSpec str=TensorSpec(shape=(None, 8), dtype=tf.int32, name='inputs')" is not
Entire first structure:
{'input_ids': .}
Entire second structure:
.

Expected behavior

I expect to have a normal saving and loading of the model.

Environment info

transformers version: 2.9.1
Platform:
Python version: 3.7.6
Tensorflow version (CPU): 2.2.0
Using GPU in script?: No
Using distributed or parallel set-up in script?: No

The text was updated successfully, but these errors were encountered:

sajib-kumar · 2020-06-17T10:46:47Z

Same issue

LysandreJik · 2020-06-22T16:46:00Z

Hi, we don't fully support saving/loading these models using keras' save/load methods (yet). In the meantime, please use model.from_pretrained or model.save_pretrained, which also saves the configuration file.

msahamed · 2020-06-23T14:44:23Z

Hello @LysandreJik ,
Thank you for the information.

Could you point me a direction and tell me a little more about the implementation procedure, so that I could do research and possibly implement the methods? If everything goes well, I could make a pull request that might benefit others as well.

Sabber

pdegner · 2020-07-31T01:32:36Z

I had this exact error. I got around it by saving the weights and the code that creates the model. After training your model, runmodel.save_weights('path/savefile'). Note there is no .h5 on it.

When you want to reuse the model later, run your code until model.compile(). Then, model.load_weights('path/savefile').

azayz · 2020-08-10T08:53:41Z

Thanks, works perfectly

rdisipio · 2021-03-16T16:29:50Z

Does this work now with newer versions?

Somabhadra · 2021-05-04T14:00:31Z

I am also facing same issue. Any solution.

Vadym-Hadetskyi · 2021-08-26T11:50:45Z

The issue still occurs on TF 2.6.0 which is very disappointing.
I tried training on Colab's TPU and on GPU.

For TPU case I did not find a way to save & then load model properly;
For GPU case model.save() throws 'NotImplemented' error. However, saving weights and then loading them into a compiled model works:

Save weights, either with callbacks or with model.save_weights;
When you need the model for inference, firstly create the model of the same architecture that was used for training (I packed everything into a create_model() function to ensure the architecture is the same)
Compile the model
Use model.load_weights

LysandreJik · 2021-08-26T12:28:54Z

cc @Rocketknight1

Zahlii · 2021-11-05T09:28:01Z

This still occurs, not only with distilbert but also many others. I don't see why this issue was closed - The described workaround is quite cumbersome and error-prone, and I don't see why this cannot be implemented inside the library, given that the configuration should already be in place to allow overriding get_config / from_config methods?

Rocketknight1 · 2021-11-09T14:05:16Z

Hi, TF maintainer here! You're right, and we're going to reopen this one. We're very constrained on time right now, though - I'll try to investigate it as soon as I get the chance.

Zahlii · 2021-11-09T14:12:30Z

Thanks for reopening this. I think i was able to work around it by using the model.distilbert property, which itself is the base layer. Maybe it would be as simple as returning the base layers get_config/from_config with some tweaks?

Rocketknight1 · 2021-11-10T16:00:23Z

@Zahlii You are correct - the underlying issue is simply that get_config and from_config were never implemented correctly for most Transformers models! We only got away with it for this long because a lot of the standard training setups never called them. We're working on a PR right now.

Rocketknight1 · 2021-11-10T18:38:23Z

We've attempted a patch at #14361 - if anyone has any suggestions, or wants to try it out, please let us know! You can test the PR branch with pip install git+https://github.com/huggingface/transformers.git@add_get_config

Rocketknight1 · 2021-11-11T15:41:43Z

The patch has now been merged. It'll be in the next release, or if anyone else is encountering this issue before then, you can install from master with pip install git+https://github.com/huggingface/transformers.git

skbaur · 2021-12-07T12:02:35Z

Since the patch in #14361 has been reverted, is there a timeline for a fix? (Or is there a known workaround one could use?) Thanks :)

Rocketknight1 · 2021-12-07T13:46:50Z

@skbaur Although that patch was reverted, we quickly followed up with a fixed one at #14415 , so the issue should now be resolved. If you're still encountering this issue after updating to the most recent version of Transformers, please let me know!

skbaur · 2021-12-07T15:20:48Z

@skbaur Although that patch was reverted, we quickly followed up with a fixed one at #14415 , so the issue should now be resolved. If you're still encountering this issue after updating to the most recent version of Transformers, please let me know!

Hi @Rocketknight1 , thanks for your reply! You are right, it does work when saving in the tensorflow format (not hdf5). This does solve the issue I was facing.

What did not work for me was this (minimal example adapted from #14430 ):

import tensorflow as tf
import transformers
import sys

print(sys.version)
print(tf.__version__)
print(transformers.__version__)

bert = transformers.TFBertModel(transformers.BertConfig())
input_ids = tf.keras.layers.Input(shape=(512,), dtype=tf.int32)
model = tf.keras.Model(inputs=[input_ids], outputs=[bert(input_ids).last_hidden_state])
model.compile()

# tf.keras.models.save_model(model, "model_tf", save_format='tf')  # This works
tf.keras.models.save_model(model, "model_h5.h5", save_format='h5') # This fails

Output:

3.6.9 (default, Oct  8 2020, 12:12:24) 
[GCC 8.4.0]
2.4.4
4.12.5

and then it fails with

~/.local/lib/python3.6/site-packages/tensorflow/python/keras/engine/functional.py in get_network_config(network, serialize_layer_fn)
   1347         filtered_inbound_nodes.append(node_data)
   1348 
-> 1349     layer_config = serialize_layer_fn(layer)
   1350     layer_config['name'] = layer.name
   1351     layer_config['inbound_nodes'] = filtered_inbound_nodes

~/.local/lib/python3.6/site-packages/tensorflow/python/keras/utils/generic_utils.py in serialize_keras_object(instance)
    248         return serialize_keras_class_and_config(
    249             name, {_LAYER_UNDEFINED_CONFIG_KEY: True})
--> 250       raise e
    251     serialization_config = {}
    252     for key, item in config.items():

~/.local/lib/python3.6/site-packages/tensorflow/python/keras/utils/generic_utils.py in serialize_keras_object(instance)
    243     name = get_registered_name(instance.__class__)
    244     try:
--> 245       config = instance.get_config()
    246     except NotImplementedError as e:
    247       if _SKIP_FAILED_SERIALIZATION:

~/.local/lib/python3.6/site-packages/tensorflow/python/keras/engine/training.py in get_config(self)
   2247 
   2248   def get_config(self):
-> 2249     raise NotImplementedError
   2250 
   2251   @classmethod

NotImplementedError:

Rocketknight1 · 2021-12-07T15:52:50Z

Hi @skbaur, your code runs fine for me! Here's my outputs:

3.9.6 (default, Aug 18 2021, 19:38:01) 
[GCC 7.5.0]
2.6.0
4.13.0.dev0

Can you try, in order:

Installing transformers from master with pip install git+https://github.com/huggingface/transformers.git
Updating TF to version 2.6 or 2.7

and let me know if either of those fixes it for you?

skbaur · 2021-12-07T15:59:03Z

Hi @skbaur, your code runs fine for me! Here's my outputs:
3.9.6 (default, Aug 18 2021, 19:38:01) 
[GCC 7.5.0]
2.6.0
4.13.0.dev0
Can you try, in order:

Installing transformers from master with pip install git+https://github.com/huggingface/transformers.git

Updating TF to version 2.6 or 2.7

and let me know if either of those fixes it for you?

Option 1. already seems to work (Installing transformers from master with pip install git+https://github.com/huggingface/transformers.git , but not updating TF).

The error reappears when downgrading back to transformers 4.12.5.

Rocketknight1 · 2021-12-07T16:07:44Z

@skbaur It seems like one of the relevant PRs didn't make it into the release, in that case - please use the master version for now, and hopefully once 4.13 is released you can just use that instead!

msahamed changed the title ~~model.save() does not work DIstillBert with Keras squential model~~ model.save() does not save keras model that includes DIstillBert layer May 18, 2020

LysandreJik closed this as completed Jun 22, 2020

msahamed mentioned this issue Jul 2, 2020

Error while saving model: TypeError: ('Not JSON Serializable:', DistilBertConfig #5459

Closed

Rocketknight1 reopened this Nov 9, 2021

Rocketknight1 linked a pull request Nov 10, 2021 that will close this issue

Experimenting with adding proper get_config() and from_config() methods #14361

Merged

Rocketknight1 closed this as completed in #14361 Nov 11, 2021

kermitt2 mentioned this issue Jan 2, 2022

Migration to TF2 and new version 0.3 kermitt2/delft#120

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model.save() does not save keras model that includes DIstillBert layer #4444

model.save() does not save keras model that includes DIstillBert layer #4444

msahamed commented May 18, 2020

sajib-kumar commented Jun 17, 2020

LysandreJik commented Jun 22, 2020

msahamed commented Jun 23, 2020

pdegner commented Jul 31, 2020 •

edited

azayz commented Aug 10, 2020

rdisipio commented Mar 16, 2021

Somabhadra commented May 4, 2021

Vadym-Hadetskyi commented Aug 26, 2021

LysandreJik commented Aug 26, 2021

Zahlii commented Nov 5, 2021

Rocketknight1 commented Nov 9, 2021

Zahlii commented Nov 9, 2021

Rocketknight1 commented Nov 10, 2021

Rocketknight1 commented Nov 10, 2021

Rocketknight1 commented Nov 11, 2021

skbaur commented Dec 7, 2021

Rocketknight1 commented Dec 7, 2021 •

edited

skbaur commented Dec 7, 2021 •

edited

Rocketknight1 commented Dec 7, 2021

skbaur commented Dec 7, 2021

Rocketknight1 commented Dec 7, 2021

model.save() does not save keras model that includes DIstillBert layer #4444

model.save() does not save keras model that includes DIstillBert layer #4444

Comments

msahamed commented May 18, 2020

🐛 Bug

Information

To reproduce

Expected behavior

Environment info

sajib-kumar commented Jun 17, 2020

LysandreJik commented Jun 22, 2020

msahamed commented Jun 23, 2020

pdegner commented Jul 31, 2020 • edited

azayz commented Aug 10, 2020

rdisipio commented Mar 16, 2021

Somabhadra commented May 4, 2021

Vadym-Hadetskyi commented Aug 26, 2021

LysandreJik commented Aug 26, 2021

Zahlii commented Nov 5, 2021

Rocketknight1 commented Nov 9, 2021

Zahlii commented Nov 9, 2021

Rocketknight1 commented Nov 10, 2021

Rocketknight1 commented Nov 10, 2021

Rocketknight1 commented Nov 11, 2021

skbaur commented Dec 7, 2021

Rocketknight1 commented Dec 7, 2021 • edited

skbaur commented Dec 7, 2021 • edited

Rocketknight1 commented Dec 7, 2021

skbaur commented Dec 7, 2021

Rocketknight1 commented Dec 7, 2021

pdegner commented Jul 31, 2020 •

edited

Rocketknight1 commented Dec 7, 2021 •

edited

skbaur commented Dec 7, 2021 •

edited