tf.contrib.data.prefetch_to_device not compatible with tf.data.Iterator.from_structure #19244

RunOrVeith · 2018-05-12T11:00:33Z

System information

Have I written custom code (as opposed to using a stock example script provided in TensorFlow):
Yes
OS Platform and Distribution (e.g., Linux Ubuntu 16.04):
Linux Ubuntu 16.04
TensorFlow installed from (source or binary):
tensorflow-gpu binary
Bazel Version:
N/A
TensorFlow version (use command below):
v1.8.0-0-g93bc2e2072 1.8.0
Python version:
3.6.3
CUDA/cuDNN version:
CUDA 9.0 cuDNN 7.0.3
GPU model and memory:
GTX 1070 8 GB VRAM
Exact command to reproduce:

import tensorflow as tf

class MyData(object):
    def __call__(self):
         return range(100)

expected_shapes = []
expected_types = tf.int32
iterator = tf.data.Iterator.from_structure(output_types=expected_types, output_shapes=expected_shapes)
dataset = tf.data.Dataset.from_generator(MyData(), output_types=expected_types, output_shapes=expected_shapes)

prefetch_op = tf.contrib.data.prefetch_to_device(device="/gpu:0")
dataset = dataset.apply(prefetch_op)
initializer = iterator.make_initializer(dataset)

Describe the problem

This raises NotImplementedError: prefetch_to_device() must be the last transformation in a dataset pipeline.

It is not possible to apply this to the dataset after the initializer has been created, since a new dataset is returned, instead of it being modified in place.

If one reads through this testcase, it is clear that it works when creating the iterator from the dataset.

It is not clear from the documentation of make_initializer that this function is a transformation of the dataset and thus counts as an additional step after prefetching.
I am not sure if this is a bug/was overlooked, or is known to be not implemented.

Proposed short term solution:

Mention in the documentation of prefetch_to_device, that it is not supported in combination with make_initializer.
Mention in the documentation of make_initializer that this operation modifies the dataset
(although I don't think this is the correct choice of words, the issue is with a call to dataset._as_variant_tensor in make_initializer line 308).

Proposed longterm solution:

This is already a TODO in line 289 of prefetching_ops.py:
Implement _as_variant_tensor for _PrefetchToDeviceDataset.

Reason why this is needed:

Creating the data pipeline using from_structure and make_initializer allows to dynamically switch the input source to the network, e.g. between training and testing set after an epoch without having to reinitialize the graph or fall back to using feed dicts.

Source code / logs

Exact stack trace of the error:

Traceback (most recent call last):
  File "test.py", line 14, in <module>
    initializer = iterator.make_initializer(dataset)
  File "/home/veith/.pyenv/versions/3.6.3/lib/python3.6/site-packages/tensorflow/python/data/ops/iterator_ops.py", line 308, in make_initializer
    dataset._as_variant_tensor(), self._iterator_resource, name=name)  # pylint: disable=protected-access
  File "/home/veith/.pyenv/versions/3.6.3/lib/python3.6/site-packages/tensorflow/contrib/data/python/ops/prefetching_ops.py", line 291, in _as_variant_tensor
    raise NotImplementedError("`prefetch_to_device()` must be the last "
NotImplementedError: `prefetch_to_device()` must be the last transformation in a dataset pipeline.

The text was updated successfully, but these errors were encountered:

tensorflowbutler · 2018-05-12T18:30:06Z

Thank you for your post. We noticed you have not filled out the following field in the issue template. Could you update them if they are relevant in your case, or leave them as N/A? Thanks.
Bazel version

tensorflowbutler · 2018-05-27T12:35:55Z

It has been 14 days with no activity and the awaiting response label was assigned. Is this still an issue?

RunOrVeith · 2018-05-28T15:00:30Z

Yes, this is still an issue.

facaiy · 2018-05-29T02:13:02Z

cc @mrry, I think the authority on this issue.

mrry · 2018-05-29T02:48:19Z

@rohan100jain is working on a solution to this.

tensorflowbutler · 2018-06-12T19:14:33Z

Nagging Assignee @rohan100jain: It has been 14 days with no activity and this issue has an assignee. Please update the label and/or status accordingly.

tensorflowbutler · 2018-06-27T18:50:00Z

Nagging Assignee @rohan100jain: It has been 29 days with no activity and this issue has an assignee. Please update the label and/or status accordingly.

tensorflowbutler · 2018-07-12T19:08:03Z

Nagging Assignee @rohan100jain: It has been 44 days with no activity and this issue has an assignee. Please update the label and/or status accordingly.

rohan100jain · 2018-07-25T14:34:55Z

Please use CopyToDevice + Prefetch instead of using prefetch_to_device directly.

for example

ds = ...
ds = ds.apply(prefetching_ops.copy_to_device("/gpu:0")).prefetch(1)

This should give you a regular dataset and all the other iterator / dataset support that comes with it.

tensorflowbutler assigned rohan100jain May 12, 2018

tensorflowbutler added the stat:awaiting response Status - Awaiting response from author label May 12, 2018

tensorflowbutler removed the stat:awaiting response Status - Awaiting response from author label May 29, 2018

rohan100jain added type:support Support issues stat:awaiting response Status - Awaiting response from author labels Jul 26, 2018

rohan100jain closed this as completed Jul 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tf.contrib.data.prefetch_to_device not compatible with tf.data.Iterator.from_structure #19244

tf.contrib.data.prefetch_to_device not compatible with tf.data.Iterator.from_structure #19244

RunOrVeith commented May 12, 2018 •

edited

tensorflowbutler commented May 12, 2018

tensorflowbutler commented May 27, 2018

RunOrVeith commented May 28, 2018

facaiy commented May 29, 2018

mrry commented May 29, 2018

tensorflowbutler commented Jun 12, 2018

tensorflowbutler commented Jun 27, 2018

tensorflowbutler commented Jul 12, 2018

rohan100jain commented Jul 25, 2018

tf.contrib.data.prefetch_to_device not compatible with tf.data.Iterator.from_structure #19244

tf.contrib.data.prefetch_to_device not compatible with tf.data.Iterator.from_structure #19244

Comments

RunOrVeith commented May 12, 2018 • edited

System information

Describe the problem

Reason why this is needed:

Source code / logs

tensorflowbutler commented May 12, 2018

tensorflowbutler commented May 27, 2018

RunOrVeith commented May 28, 2018

facaiy commented May 29, 2018

mrry commented May 29, 2018

tensorflowbutler commented Jun 12, 2018

tensorflowbutler commented Jun 27, 2018

tensorflowbutler commented Jul 12, 2018

rohan100jain commented Jul 25, 2018

RunOrVeith commented May 12, 2018 •

edited