from_tensor_slices() method not working with GPT-2 model in gpt2_causal_lm.py example

Hi team,

I was doing some experiments with gpt2 causal lm, I started by running the examples that are given in the documentation
[Link to documentation.](https://github.com/Neeshamraghav012/keras-nlp/blob/master/keras_nlp/models/gpt2/gpt2_causal_lm.py)
But this example didn't worked,

```
features = [
        "I don't listen to music while coding.",
        "But I watch youtube while coding!",
]

ds = tf.data.Dataset.from_tensor_slices(features)

gpt2_lm = keras_nlp.models.GPT2CausalLM.from_preset(
        "gpt2_base_en",
)
gpt2_lm.compile(
        loss=keras.losses.SparseCategoricalCrossentropy(from_logits=True),
)
gpt2_lm.fit(ds, batch_size=2)
```
Output:
![CI](https://res.cloudinary.com/hire-easy/image/upload/v1678812193/Screenshot_2023-03-14_at_11.24.40_AM_p2ssex.png)

But when I passed the features directly it worked. Here is the working code;
```
features = [
        "I don't listen to music while coding.",
        "But I watch youtube while coding!",
]

ds = tf.data.Dataset.from_tensor_slices(features)

gpt2_lm = keras_nlp.models.GPT2CausalLM.from_preset(
        "gpt2_base_en",
)
gpt2_lm.compile(
        loss=keras.losses.SparseCategoricalCrossentropy(from_logits=True),
)
gpt2_lm.fit(features, batch_size=2)
```
It seems that there might be an issue with the from_tensor_slices() method not working properly with the GPT-2 model in this example. I wanted to report this issue and bring it to the attention of the KerasNLP community. I am very much interested in solving these kinds of bugs that are present in existing pre-trained models.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

from_tensor_slices() method not working with GPT-2 model in gpt2_causal_lm.py example #849

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

from_tensor_slices() method not working with GPT-2 model in gpt2_causal_lm.py example #849

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions