Bert training with TPU does not work on Keras Core #18422

martin-gorner · 2023-08-03T14:07:55Z

Repro notebook: https://www.kaggle.com/code/alexia/kerasnlp-starter-notebook-contradictory-dearwatson

This notebook is configured to use keras_nlp and standard tf.keras. It works perfectly.

If reconfigured to use keras_nlp with Keras Core the model stops working (failing version here):

it displays non-sensical accuracies > 1 during training
eval accuracy the same as if the mode was doing random predictions
.predict returns predictions of the wrong shape (shape=(n,) instead of (n,3)
therefore the final np.argmax(predictions, axis=1) fails.
there is nan in the predictions

The behavior does not change when pip installing form GitHub master (latest version) rather than PyPi (latest published package) for both Keras Core and KerasNLP

The config I used to make the model fail:

!pip install keras-core
import os
os.environ['KERAS_BACKEND'] = 'tensorflow'
import keras_core as keras

The text was updated successfully, but these errors were encountered:

sampathweb · 2023-08-03T23:07:21Z

Yes. I confirm the problem of accuracies > 1 in TPU env with keras-core backend. The notebook works fine in GPU env with keras-core backend.

fchollet transferred this issue from keras-team/keras-core Sep 22, 2023

sachinprasadhs added the type:Bug label Apr 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bert training with TPU does not work on Keras Core #18422

Bert training with TPU does not work on Keras Core #18422

martin-gorner commented Aug 3, 2023 •

edited

sampathweb commented Aug 3, 2023 •

edited

Bert training with TPU does not work on Keras Core #18422

Bert training with TPU does not work on Keras Core #18422

Comments

martin-gorner commented Aug 3, 2023 • edited

sampathweb commented Aug 3, 2023 • edited

martin-gorner commented Aug 3, 2023 •

edited

sampathweb commented Aug 3, 2023 •

edited