Cloning a TextVectorization Layer with Split Function Doesn't Work #716

rlcauvin · 2024-01-08T23:14:40Z

System information.
Please see the Google Colab notebook. The TensorFlow version is 2.15.0.

Describe the problem.
Cloning a TextVectorization layer with a split function results in
TypeError: Could not parse config: <function pipe_split_fn at 0x7d103d1f7640>

Describe the expected behavior.
It should successfully clone the TextVectorization layer.

Standalone code to reproduce the issue.
https://colab.research.google.com/drive/1tIS0Ynjck80kKGWv1tTQ6oCc8Nj1xnNa?usp=sharing

I originally reported the issue here.

The text was updated successfully, but these errors were encountered:

rlcauvin · 2024-01-08T23:19:12Z

Inspecting the code for TextVectorization (https://github.com/keras-team/keras/blob/master/keras/layers/preprocessing/text_vectorization.py#L491) and deserialize_keras_object (https://github.com/keras-team/keras/blob/master/keras/saving/serialization_lib.py#L392), I see that there is no way the proper logic for deserializing the split function will run. The deserialization code looks for if module_objects is not None:, but TextVectorization.from_config() doesn't pass a module_objects parameter to deserialize_keras_object, so that code block doesn't execute.

As a workaround, I extended the tf.keras.layers.TextVectorization class with:

class PatchedTextVectorization(tf.keras.layers.TextVectorization):

  @classmethod
  def from_config(cls, config):
    if not isinstance(config["standardize"], str):
      config["standardize"] = tf.keras.saving.deserialize_keras_object(config["standardize"])
    if not isinstance(config["split"], str):
      config["split"] = tf.keras.saving.deserialize_keras_object(config["split"], module_objects = [])

    return cls(**config)

Cloning an instance of PatchedTextVectorization constructed with the split function works fine. You can see I shoehorned module_objects = [] into its invocation of tf.keras.saving.deserialize_keras_object.

tilakrayal · 2024-01-11T16:19:15Z

@sachinprasadhs,
I was able to reproduce the issue on tensorflow v2.14, v2.15. Kindly find the gist of it here.

sachinprasadhs · 2024-01-11T21:38:46Z

@rlcauvin , From TensorFlow 2.16, Keras 3 will be the backend for tf.keras, I see this is working fine with Keras 3, that should fix your issue, is there any specific reason you're using tf.keras with 2.15?
You can use tensorflow 2.15 and Keras 3 as well.
install tensorflow first and then install keras 3.

github-actions · 2024-01-26T01:48:04Z

This issue is stale because it has been open for 14 days with no activity. It will be closed if no further activity occurs. Thank you.

rlcauvin · 2024-01-30T21:01:17Z

Thank you, @sachinprasadhs. Using !pip install -U keras to install Keras 3 worked for that isolated case. However, I want to use tensorflow_decision_forests. Unfortunately, import tensorflow_decision_forests as tfdf results in the error below. Please see the modified Google Colab notebook for the full code.

[/usr/local/lib/python3.10/dist-packages/tensorflow_decision_forests/keras/core.py](https://localhost:8080/#) in <module>
     75   # tf>1.12
---> 76   import keras.src.engine.data_adapter as data_adapter
     77 except ImportError:

ModuleNotFoundError: No module named 'keras.src.engine'

During handling of the above exception, another exception occurred:

ModuleNotFoundError                       Traceback (most recent call last)
3 frames
[<ipython-input-3-b4486e63aff0>](https://localhost:8080/#) in <cell line: 1>()
----> 1 import tensorflow_decision_forests as tfdf
      2 import tensorflow as tf
      3 from typing import Text

[/usr/local/lib/python3.10/dist-packages/tensorflow_decision_forests/__init__.py](https://localhost:8080/#) in <module>
     62 check_version.check_version(__version__, compatible_tf_versions)
     63 
---> 64 from tensorflow_decision_forests import keras
     65 from tensorflow_decision_forests.component import py_tree
     66 from tensorflow_decision_forests.component.builder import builder

[/usr/local/lib/python3.10/dist-packages/tensorflow_decision_forests/keras/__init__.py](https://localhost:8080/#) in <module>
     51 from typing import Callable, List
     52 
---> 53 from tensorflow_decision_forests.keras import core
     54 from tensorflow_decision_forests.keras import wrappers
     55 

[/usr/local/lib/python3.10/dist-packages/tensorflow_decision_forests/keras/core.py](https://localhost:8080/#) in <module>
     77 except ImportError:
     78   # tf<=1.12
---> 79   import keras.engine.data_adapter as data_adapter
     80 get_data_handler = data_adapter.get_data_handler
     81 

ModuleNotFoundError: No module named 'keras.engine'

nkovela1 · 2024-02-06T17:57:51Z

Fixed in this commit! Thank you, closing this issue.

rlcauvin added the type:bug label Jan 8, 2024

github-actions bot assigned tilakrayal Jan 8, 2024

rlcauvin mentioned this issue Jan 9, 2024

Cloning a TextVectorization Layer with Split Function Doesn't Work keras-team/keras#18950

Closed

tilakrayal assigned sachinprasadhs and unassigned tilakrayal Jan 11, 2024

sachinprasadhs added the keras-team-review-pending label Jan 11, 2024

sachinprasadhs added stat:awaiting response from contributor and removed keras-team-review-pending labels Jan 11, 2024

github-actions bot added the stale label Jan 26, 2024

sachinprasadhs added keras-team-review-pending and removed stat:awaiting response from contributor stale labels Jan 30, 2024

fchollet assigned nkovela1 Feb 1, 2024

fchollet removed the keras-team-review-pending label Feb 1, 2024

nkovela1 closed this as completed Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cloning a TextVectorization Layer with Split Function Doesn't Work #716

Cloning a TextVectorization Layer with Split Function Doesn't Work #716

rlcauvin commented Jan 8, 2024

rlcauvin commented Jan 8, 2024

tilakrayal commented Jan 11, 2024

sachinprasadhs commented Jan 11, 2024

github-actions bot commented Jan 26, 2024

rlcauvin commented Jan 30, 2024 •

edited

Loading

nkovela1 commented Feb 6, 2024

Cloning a TextVectorization Layer with Split Function Doesn't Work #716

Cloning a TextVectorization Layer with Split Function Doesn't Work #716

Comments

rlcauvin commented Jan 8, 2024

rlcauvin commented Jan 8, 2024

tilakrayal commented Jan 11, 2024

sachinprasadhs commented Jan 11, 2024

github-actions bot commented Jan 26, 2024

rlcauvin commented Jan 30, 2024 • edited Loading

nkovela1 commented Feb 6, 2024

rlcauvin commented Jan 30, 2024 •

edited

Loading