[mnist]: Use FixedLengthRecordDataset #3093

asimshankar · 2018-01-02T22:04:45Z

Prior to this change, the use of tf.data.Dataset essentially embedded
the entire training/evaluation dataset into the graph as a constant,
leading to unnecessarily humungous graphs (Fixes Official MNIST example should avoid huge constants #3017)
Also, use batching on the evaluation dataset to allow
evaluation on GPUs that cannot fit the entire evaluation dataset in
memory (Fixes the mnist under official model has OOM issue #3046)

Using FixedLengthRecordDataset also provides an opportunity to use the same input pipeline code for the TPU demos (https://github.com/tensorflow/tpu-demos/tree/42a987e/cloud_tpu/models/mnist) without having to convert the raw data to TFRecords.

- Prior to this change, the use of tf.data.Dataset essentially embedded the entire training/evaluation dataset into the graph as a constant, leading to unnecessarily humungous graphs (Fixes #3017) - Also, use batching on the evaluation dataset to allow evaluation on GPUs that cannot fit the entire evaluation dataset in memory (Fixes #3046)

mrry

Just a couple of documentation nits....

mrry · 2018-01-02T22:06:58Z

official/mnist/dataset.py

+
+
+def maybe_download(directory, filename):
+  """Download a file from the MNIST dataset, if it doesn't already exist."""


Perhaps mention that this gunzips the file as well?

mrry · 2018-01-02T22:09:37Z

official/mnist/mnist.py

+    ds = ds.cache().shuffle(buffer_size=50000).batch(FLAGS.batch_size).repeat(
        FLAGS.train_epochs)
-    (images, labels) = dataset.make_one_shot_iterator().get_next()
+    (images, labels) = ds.make_one_shot_iterator().get_next()


While we're in here, would it make sense to switch to the new style of returning a Dataset directly? (Or perhaps, since 1.5 hasn't landed yet, we should have a TODO to make that switch?)

(Same applies to eval_input_fn() below.)

Yup, was waiting for 1.5 to land.
Tempted to avoid TODOs in these "best practices" samples, unless you feel strongly.

Fair enough!

k-w-w · 2018-01-02T22:17:51Z

@mrry Does Dataset.cache() cache all the examples into memory (which would defeat the purpose of trying to use less memory)?

asimshankar · 2018-01-02T22:20:26Z

@k-w-w - It caches them in CPU memory, not GPU memory. (That said, we could remove the use of cache() here).

k-w-w · 2018-01-02T22:23:59Z

@asimshankar ohh, I see. Thanks for the response!

mrry · 2018-01-02T22:24:11Z

Even with caching it will tend to use less memory, because the payloads of tf.constant() ops end up being stored multiple times in RAM (something that would be good to fix independently...). While you could disable caching, that moves the reading and parsing onto the critical path, and there's a good chance it would make the training process I/O-bound.

mrry · 2018-01-02T22:25:03Z

official/mnist/mnist.py

+    ds = ds.cache().shuffle(buffer_size=50000).batch(FLAGS.batch_size).repeat(
        FLAGS.train_epochs)
-    (images, labels) = dataset.make_one_shot_iterator().get_next()
+    (images, labels) = ds.make_one_shot_iterator().get_next()


Fair enough!

k-w-w

Thanks for addressing these issues!

k-w-w · 2018-01-02T22:35:02Z

@mrry good to know, thanks! (looking forward to the dataset performance guide to learn more)

mrry · 2018-01-02T22:36:19Z

@k-w-w On that topic, you can get a sneak peek at the guide here:

https://github.com/tensorflow/tensorflow/blob/master/tensorflow/docs_src/performance/datasets_performance.md

It should be on tensorflow.org once the 1.5 release is published.

nealwu

Looks good! Just a few comments.

nealwu · 2018-01-02T23:01:38Z

official/mnist/dataset.py

@@ -0,0 +1,112 @@
+#  Copyright 2018 The TensorFlow Authors. All Rights Reserved.


2018! Nice :)

nealwu · 2018-01-02T23:08:58Z

official/mnist/dataset.py

+                                                                     f.name))
+
+
+def maybe_download(directory, filename):


nit: I don't like the name maybe_download since the 'maybe' part seems very ambiguous. Let's call this attempt_download or just download instead?

Renamed to download.

nealwu · 2018-01-02T23:11:30Z

official/mnist/dataset.py

+  url = 'https://storage.googleapis.com/cvdf-datasets/mnist/' + filename + '.gz'
+  zipped_filename = filename + '.gz'
+  zipped_filepath = os.path.join(directory, zipped_filename)
+  tf.contrib.learn.datasets.base.maybe_download(zipped_filename, directory, url)


Is there a way we can do this without contrib?

Changed to use urllib.request.urlretrieve. This means that the retry logic implemented in
https://github.com/tensorflow/tensorflow/blob/master/tensorflow/contrib/learn/python/learn/datasets/base.py#L189 no longer applies, but I suspect that this retry business is less relevant now that we're using a CVDF mirror.

Sounds good!

nealwu · 2018-01-02T23:15:12Z

official/mnist/mnist.py

  def eval_input_fn():
-    return eval_dataset(FLAGS.data_dir).make_one_shot_iterator().get_next()
+    return dataset.test(FLAGS.data_dir).batch(
+        FLAGS.batch_size).make_one_shot_iterator().get_next()


nit: I don't think this is the right code style. Can we split this onto two lines instead?

This was the result of running the Python formatter (https://github.com/google/yapf), so it should be right? :)

That's surprising; the part that seemed weird to me was indenting the arguments to the next line and then following up with more function calls. If the formatter says it's good though, should be fine.

[mnist]: Use FixedLengthRecordDataset

asimshankar requested review from mrry and nealwu January 2, 2018 22:04

asimshankar requested a review from k-w-w as a code owner January 2, 2018 22:04

googlebot added the cla: yes label Jan 2, 2018

mrry suggested changes Jan 2, 2018

View reviewed changes

[mnist]: Address review comments.

120526a

mrry approved these changes Jan 2, 2018

View reviewed changes

k-w-w approved these changes Jan 2, 2018

View reviewed changes

nealwu reviewed Jan 2, 2018

View reviewed changes

nealwu changed the title ~~[mnist]: Use FixedLengthRecordDatatest~~ [mnist]: Use FixedLengthRecordDataset Jan 2, 2018

[mnist]: Incorporate comments in PR

4a36e31

nealwu approved these changes Jan 3, 2018

View reviewed changes

asimshankar merged commit 8e4a1e2 into tensorflow:master Jan 3, 2018

asimshankar deleted the mnist branch January 3, 2018 01:46

Adrrei pushed a commit to Adrrei/models that referenced this pull request Dec 16, 2018

Merge pull request tensorflow#3093 from asimshankar/mnist

27dbfc3

[mnist]: Use FixedLengthRecordDataset



		def maybe_download(directory, filename):
		"""Download a file from the MNIST dataset, if it doesn't already exist."""

		@@ -0,0 +1,112 @@
		# Copyright 2018 The TensorFlow Authors. All Rights Reserved.

[mnist]: Use FixedLengthRecordDataset #3093

[mnist]: Use FixedLengthRecordDataset #3093

Uh oh!

Conversation

asimshankar commented Jan 2, 2018

Uh oh!

mrry left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

k-w-w commented Jan 2, 2018

Uh oh!

asimshankar commented Jan 2, 2018

Uh oh!

k-w-w commented Jan 2, 2018

Uh oh!

mrry commented Jan 2, 2018

Uh oh!

Choose a reason for hiding this comment

Uh oh!

k-w-w left a comment

Choose a reason for hiding this comment

Uh oh!

k-w-w commented Jan 2, 2018

Uh oh!

mrry commented Jan 2, 2018

Uh oh!

nealwu left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants