Removed depreciated tensorflow dataset APIs #680

abditag2 · 2018-12-06T01:41:40Z

Some of the examples were using depreciated tensorflow APIs to load the data and these changes fix the problem.

updated the MNIST Dataset API
updated ModeKey

Issue #673

CLAassistant · 2018-12-06T01:41:47Z

All committers have signed the CLA.

tgaddair

Looks good! Thanks for putting this together so quickly. Just a handful of comments.

tgaddair · 2018-12-07T17:59:11Z

examples/tensorflow_mnist.py


 import tensorflow as tf
+import keras


Can we use tf.keras instead? Ideally, we'd like this example to work with standalone TensorFlow without requiring the user to install standalone keras as well.

I initially did that but the problem is tf.keras is not available in TF 1.1.x. I can check for the TF version in the code and import keras or tf.keras..

I would do a version check again v1.4.0 similar to what we do for horovod.tensorflow.keras:

if LooseVersion(tf.__version__) >= LooseVersion("1.4.0"): from tensorflow import keras else: from tensorflow.contrib import keras

tgaddair · 2018-12-07T17:59:19Z

examples/tensorflow_mnist.py

@@ -12,12 +12,17 @@
 # See the License for the specific language governing permissions and
 # limitations under the License.
 # ==============================================================================
-#!/usr/bin/env python
+# !/usr/bin/env python


Nit: remove leading space

tgaddair · 2018-12-07T18:00:46Z

examples/tensorflow_mnist.py

 import horovod.tensorflow as hvd
+import numpy as np
+
 layers = tf.contrib.layers


Is it possible to use tf.layers instead of tf.contrib.layers as well?

Changed this as well.

tgaddair · 2018-12-07T18:01:03Z

examples/tensorflow_mnist_estimator.py

+import os
+import shutil
+
+import keras


As above: tf.keras

same as above.

tgaddair · 2018-12-07T18:03:49Z

examples/tensorflow_mnist.py

+            x_test, y_test) = keras.datasets.mnist.load_data(
+            'MNIST-data-%d' % hvd.rank())
+
+    x_train = np.reshape(x_train, (-1, 784)) / 255


Is this safe in Python2, where integer division is the default? Maybe we can divide by 255.0 to be safe?

tgaddair · 2018-12-07T18:04:23Z

examples/tensorflow_mnist.py

+        # When running tests, if dataset is previously downloaded, it may cause
+        # the tests to fail. In this case, we need to remove the dataset cache
+        # folder first and download the dataset again.
+        cache_dir = os.path.join(os.path.expanduser('~'), '.keras')


Hmmm, is it necessary to remove the dataset every time? This seems expensive.

When running with MPI and with more than 1 process, all the processes try
to download the data and this can cause a race condition.
Multiple processes might simultaneously check if the dataset folder
exists and then try to create the folder and download the data. However,
one of them only succeeds, and the rest fail with an os IOError.

tgaddair · 2018-12-07T18:04:46Z

examples/tensorflow_mnist_estimator.py

+        (train_data, train_labels), (eval_data, eval_labels) = \
+            keras.datasets.mnist.load_data('MNIST-data-%d' % hvd.rank())
+    except OSError as ex:
+        # When running tests, if dataset is previously downloaded, it may cause


Same comment as above: is it necessary to remove all?

tgaddair · 2018-12-07T18:04:58Z

examples/tensorflow_mnist_estimator.py

+            'MNIST-data-%d' % hvd.rank())
+
+    # reshape the features and normalize them between 0 and 1
+    train_data = np.reshape(train_data, (-1, 784)) / 255


Same comment as above: division by int vs float.

alsrgv

Thanks for the PR! Few comments inline.

alsrgv · 2018-12-08T06:17:12Z

examples/tensorflow_mnist.py

 def main(_):
    # Horovod: initialize Horovod.
    hvd.init()

    # Download and load MNIST dataset.
-    mnist = learn.datasets.mnist.read_data_sets('MNIST-data-%d' % hvd.rank())
+    dataset_dir = os.path.join(os.path.dirname(os.path.realpath(__file__)),
+                            'MNIST-data-%d' % hvd.rank())


Fix formatting

alsrgv · 2018-12-08T06:18:15Z

examples/tensorflow_mnist.py

@@ -14,10 +14,20 @@
 # ==============================================================================
 #!/usr/bin/env python

+import os


Move to the next import group

alsrgv · 2018-12-08T06:18:53Z

examples/tensorflow_mnist.py

+
+from distutils.version import LooseVersion
+
+if LooseVersion(tf.__version__) >= LooseVersion("1.4.0"):


I actually think it's OK to assume reasonably fresh TF version in examples - so, just doing tf.keras should be OK (and cleaner).

And if this breaks integration tests for TF 1.1.0, we can fix them by adding code in .travis.yml to conditionally patch the import. This way users don't have to see this code.

…ace condition

tgaddair

LGTM!

abditag2 added 2 commits December 6, 2018 11:27

removed depreciated tensorflow dataset APIs from examples

28b4117

triggering test

5d22d20

tgaddair requested changes Dec 7, 2018

View reviewed changes

abditag2 added 7 commits December 7, 2018 13:31

tensorflow_mnist.py

2ed5cb5

updated tf.contrib.layers -> tf.layers and addresses comments

cd2d019

style fix

118342a

style fix

b3bbbdf

fixed dataset download race condition problem

708d619

using absoule path instead of relative path

62e9395

Added a comment about reshape

8306318

alsrgv reviewed Dec 8, 2018

View reviewed changes

abditag2 added 2 commits December 10, 2018 09:32

fixed style, created cache directory before loading data to prevent r…

aeb33aa

…ace condition

changed travis-ci config

883a236

tgaddair approved these changes Dec 10, 2018

View reviewed changes

alsrgv approved these changes Dec 10, 2018

View reviewed changes

alsrgv merged commit 7d90bd1 into horovod:master Dec 10, 2018

jeffdaily pushed a commit to ROCm/horovod that referenced this pull request Nov 27, 2019

Removed depreciated tensorflow dataset APIs (horovod#680)

5d13eb6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removed depreciated tensorflow dataset APIs #680

Removed depreciated tensorflow dataset APIs #680

abditag2 commented Dec 6, 2018

CLAassistant commented Dec 6, 2018 •

edited

Loading

tgaddair left a comment

tgaddair Dec 7, 2018

abditag2 Dec 7, 2018

tgaddair Dec 7, 2018

tgaddair Dec 7, 2018

abditag2 Dec 7, 2018

tgaddair Dec 7, 2018

abditag2 Dec 7, 2018

tgaddair Dec 7, 2018

abditag2 Dec 7, 2018

tgaddair Dec 7, 2018

abditag2 Dec 7, 2018

tgaddair Dec 7, 2018

abditag2 Dec 7, 2018

tgaddair Dec 7, 2018

tgaddair Dec 7, 2018

abditag2 Dec 7, 2018

alsrgv left a comment

alsrgv Dec 8, 2018

abditag2 Dec 10, 2018

alsrgv Dec 8, 2018

alsrgv Dec 8, 2018

alsrgv Dec 8, 2018 •

edited

Loading

tgaddair left a comment


		from distutils.version import LooseVersion

		if LooseVersion(tf.__version__) >= LooseVersion("1.4.0"):

Removed depreciated tensorflow dataset APIs #680

Removed depreciated tensorflow dataset APIs #680

Conversation

abditag2 commented Dec 6, 2018

CLAassistant commented Dec 6, 2018 • edited Loading

tgaddair left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alsrgv left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alsrgv Dec 8, 2018 • edited Loading

Choose a reason for hiding this comment

tgaddair left a comment

Choose a reason for hiding this comment

CLAassistant commented Dec 6, 2018 •

edited

Loading

alsrgv Dec 8, 2018 •

edited

Loading