Skip to content

Time out error when serving models due to update from 1.8 to 1.9 #1006

@chleibig

Description

@chleibig

First of all, thx for your work in TF serving!

We recently updated both tensorflow and tensorflow serving from 1.8.0 to 1.9.0.

Now we have one model that cannot be served anymore in the sense that requesting
predictions via the REST API results in a ""Timed out waiting for notification"" error.

The results of my debugging suggest a problem on the serving side.

It is not a problem of the SavedModel, as I can get predictions from the same saved_model via:

from tensorflow.contrib import predictor

predict_fn = predictor.from_saved_model(saved_model_dir)
predictions = predict_fn({'input': ...}
...

For the record, version combinations I have tried (always using the same code):

  • train models with TF 1.8, serve with TF 1.8: all models work
  • train models with TF 1.9, serve with TF 1.9: one model fails
  • train models with TF 1.9, serve with TF 1.8: at least one model fails because it uses (internally, user code unchanged!) TF 1.9 functionality that is not available in serving TF 1.8.

Note: We use the server installed via apt-get, but I observed the same behavior with the official
TF serving docker images.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions