Eager execution breaks fit_generator in tf.keras #18287

batzner · 2018-04-06T06:16:35Z

System information

Have I written custom code (as opposed to using a stock example script provided in TensorFlow): Yes
OS Platform and Distribution (e.g., Linux Ubuntu 16.04): macOS 10.12.6
TensorFlow installed from (source or binary): binary
TensorFlow version: 1.7.0
Python version: 3.6.3
Numpy version: 1.14.2
Bazel version (if compiling from source): N/A
GCC/Compiler version (if compiling from source): N/A
CUDA/cuDNN version: not installed
GPU model and memory: CPU only
Exact command to reproduce: Run code below

Describe the problem

tf.enable_eager_execution() leads to a RuntimeError: You must compile your model before using it. when calling Keras's model.fit_generator, even if the model has already been compiled. Calling model.fit works on the other hand.

Source code / logs

Minimum reproducible test case:

import numpy as np
import tensorflow as tf

tf.enable_eager_execution()  # It works without this line

x, y = np.random.randn(100, 10), np.random.randn(100, 4)
model = tf.keras.models.Sequential([tf.keras.layers.Dense(4, input_dim=10)])
model.compile(tf.train.RMSPropOptimizer(0.001), 'mse')

model.fit(x, y)  # Fitting without a generator works in eager mode

class Iterator:
    def __next__(self):
        return x, y

model.fit_generator(Iterator(), steps_per_epoch=10)

Log:

Epoch 1/1
100/100 [==============================] - 0s 445us/step - loss: 2.1153
Traceback (most recent call last):
  File "tmp.py", line 16, in <module>
    model.fit_generator(Iterator(), steps_per_epoch=10)
  File "/Users/kilian/.pyenv/versions/3.6.3/lib/python3.6/site-packages/tensorflow/python/keras/_impl/keras/engine/sequential.py", line 860, in fit_generator
    initial_epoch=initial_epoch)
  File "/Users/kilian/.pyenv/versions/3.6.3/lib/python3.6/site-packages/tensorflow/python/keras/_impl/keras/engine/training.py", line 1603, in fit_generator
    initial_epoch=initial_epoch)
  File "/Users/kilian/.pyenv/versions/3.6.3/lib/python3.6/site-packages/tensorflow/python/keras/_impl/keras/engine/training_generator.py", line 52, in fit_generator
    model._make_train_function()
  File "/Users/kilian/.pyenv/versions/3.6.3/lib/python3.6/site-packages/tensorflow/python/keras/_impl/keras/engine/training.py", line 578, in _make_train_function
    raise RuntimeError('You must compile your model before using it.')
RuntimeError: You must compile your model before using it.

The text was updated successfully, but these errors were encountered:

tensorflowbutler · 2018-04-06T18:41:31Z

Thank you for your post. We noticed you have not filled out the following field in the issue template. Could you update them if they are relevant in your case, or leave them as N/A? Thanks.
Bazel version

batzner · 2018-04-06T18:55:29Z

Updated

facaiy · 2018-04-07T02:37:34Z

@batzner Could you use tf.keras to make a test, instead of contrib module?

facaiy · 2018-04-07T03:18:05Z

@fchollet Sounds like a problem. I checked the keras codes and found that generator seems to have not been supported in eager mode, right? Does anyone have worked on it?

batzner · 2018-04-07T06:11:38Z

I updated the code to use tf.keras instead of contrib. The output is the same as before.

tensorflowbutler · 2018-04-22T18:31:06Z

Nagging Assignee @fchollet: It has been 15 days with no activity and this issue has an assignee. Please update the label and/or status accordingly.

tensorflowbutler · 2018-05-09T18:59:58Z

Nagging Assignee @fchollet: It has been 15 days with no activity and this issue has an assignee. Please update the label and/or status accordingly.

AakashKumarNain · 2018-05-10T17:48:01Z

I faced a similar issue. When using fit_generator() in eager mode, keras throws a NotImplemented error.

fchollet · 2018-05-10T20:29:53Z

Thanks for the bug report. I have fixed the issue and the fix will soon be available in the TF nightly release.

…ensorflow#18287 PiperOrigin-RevId: 196171525

DollarAkshay · 2019-08-11T17:40:35Z

Just noticed that having tf.enable_eager_execution() enabled drastically changed the training part for fit_generator(). Here are some examples that I ran on a pre-trained VGG19 model.

Without Eager Exectution`

The loss reduces drastically after one epoch
The training accuracy goes to 67% after epoch 1 and 93% after epoch 2
The validation accuracy jumps to 80% after one epoch

With Eager Execution

The loss is stuck at 15 and reduces by a tiny amount every epoch
The training accuracy is at 4% after epoch 1 and at 5% after epoch 2
The validation accuracy is also at 4-5%

This is quite a strong difference. I am guessing that this is not expected ?

Link to Full Notebook : here

Dataset used for Training : HackerEarth ML Challenge

tensorflowbutler added the stat:awaiting response Status - Awaiting response from author label Apr 6, 2018

tensorflowbutler assigned michaelisard Apr 6, 2018

asimshankar assigned fchollet and unassigned michaelisard Apr 7, 2018

tensorflowbutler removed the stat:awaiting response Status - Awaiting response from author label Apr 7, 2018

fchollet closed this as completed May 10, 2018

yifeif pushed a commit to yifeif/tensorflow that referenced this issue May 15, 2018

Enable Model training/eval from generator in eager execution. Fixes t…

1b67ccb

…ensorflow#18287 PiperOrigin-RevId: 196171525

ahundt mentioned this issue Jun 22, 2018

Bug Report: Keras crashes with tf.enable_eager_execution() keras-team/keras#10500

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eager execution breaks fit_generator in tf.keras #18287

Eager execution breaks fit_generator in tf.keras #18287

batzner commented Apr 6, 2018 •

edited

tensorflowbutler commented Apr 6, 2018

batzner commented Apr 6, 2018

facaiy commented Apr 7, 2018 •

edited

facaiy commented Apr 7, 2018

batzner commented Apr 7, 2018

tensorflowbutler commented Apr 22, 2018

tensorflowbutler commented May 9, 2018

AakashKumarNain commented May 10, 2018 •

edited

fchollet commented May 10, 2018

DollarAkshay commented Aug 11, 2019

Eager execution breaks fit_generator in tf.keras #18287

Eager execution breaks fit_generator in tf.keras #18287

Comments

batzner commented Apr 6, 2018 • edited

System information

Describe the problem

Source code / logs

tensorflowbutler commented Apr 6, 2018

batzner commented Apr 6, 2018

facaiy commented Apr 7, 2018 • edited

facaiy commented Apr 7, 2018

batzner commented Apr 7, 2018

tensorflowbutler commented Apr 22, 2018

tensorflowbutler commented May 9, 2018

AakashKumarNain commented May 10, 2018 • edited

fchollet commented May 10, 2018

DollarAkshay commented Aug 11, 2019

batzner commented Apr 6, 2018 •

edited

facaiy commented Apr 7, 2018 •

edited

AakashKumarNain commented May 10, 2018 •

edited