Be more clear about train_step and test_step #1969

MalcolmSlaney · 2021-11-08T01:24:28Z

Be more clear that the train_step and test_step are also moved to the accelerator. This was not clear before. This makes it explicit, which should help people understanding TPU efficiency.

github-actions · 2021-11-08T01:24:50Z

Preview

Preview and run these notebook edits with Google Colab:

site/en/guide/distributed_training.ipynb

Rendered notebook diffs available on ReviewNB.com.

Format and style

Use the TensorFlow docs notebook tools to format for consistent source diffs and lint for style:

$ python3 -m pip install -U --user git+https://github.com/tensorflow/docs

$ python3 -m tensorflow_docs.tools.nbfmt notebook.ipynb

$ python3 -m tensorflow_docs.tools.nblint --arg=repo:tensorflow/docs notebook.ipynb

If commits are added to the pull request, synchronize your local branch: git pull origin patch-1

8bitmp3 · 2021-12-01T19:33:48Z

@rchao Can you PTAL? Thanks!

rchao · 2021-12-02T23:59:43Z

site/en/guide/distributed_training.ipynb

@@ -508,7 +508,7 @@
        "Here's what you need to change in your code:\n",
        "\n",
        "1. Create an instance of the appropriate `tf.distribute.Strategy`.\n",
-        "2. Move the creation of Keras model, optimizer and metrics inside `strategy.scope`.\n",
+        "2. Move the creation of Keras model, optimizer and metrics inside `strategy.scope`. Thus the code in the model's run(), train_step(), and test_step() will all be distributed and executed on the accelerator(s).\n",


I'm not sure what "model's run()" refers to. Also, please backtick the symbols. Thanks!

Done. Made the references explicit by describing them as methods.

(Let me know if I need to be even more explicit.)

Thanks for the suggestion.

Thanks! train_step() and test_step() make sense to me, but I'm not sure what run() refers to because there is not such run() method of a Keras model.

Oops, I should have said call method (not run). Sorry for the tardy response... I had to look up the right terminology.

Made the references to the run, train_step and test_step methods more clear. [Do I need to be more explicit that the Keras model is a (sub) class and it does the work with the run, train_step and test_step methods, all of which might be subclassed for a specific model implementation?]

MalcolmSlaney · 2022-05-04T19:08:02Z

Ping? Is the update (using the proper terminology) still pending? I think I have pushed everything correctly...

MarkDaoust · 2022-05-04T19:58:35Z

Hi, thanks for the ping.

I'll see if I can get this merged.

Be more clear about train_step and test_step

61a0f1e

Be more clear that the train_step and test_step are also moved to the accelerator. This was not clear before. This makes it explicit, which should help people understanding TPU efficiency.

MalcolmSlaney requested review from 8bitmp3, lamberta and MarkDaoust as code owners November 8, 2021 01:24

google-cla bot added the cla: yes CLA has been signed label Nov 8, 2021

8bitmp3 assigned 8bitmp3 and rchao Dec 1, 2021

rchao reviewed Dec 2, 2021

View reviewed changes

lamberta removed their request for review January 4, 2022 21:50

MalcolmSlaney added 2 commits April 19, 2022 05:02

Merge branch 'tensorflow:master' into patch-1

4599725

Update distributed_training.ipynb

a0378e4

MarkDaoust added the ready to pull Start merge process label May 4, 2022

MarkDaoust approved these changes May 4, 2022

View reviewed changes

github-actions bot added the lgtm Community-added approval label May 4, 2022

copybara-service bot merged commit 20f4caa into tensorflow:master May 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Be more clear about train_step and test_step #1969

Be more clear about train_step and test_step #1969

Uh oh!

MalcolmSlaney commented Nov 8, 2021

Uh oh!

github-actions bot commented Nov 8, 2021

Uh oh!

8bitmp3 commented Dec 1, 2021

Uh oh!

rchao Dec 2, 2021

Uh oh!

MalcolmSlaney Dec 31, 2021

Uh oh!

rchao Jan 4, 2022

Uh oh!

MalcolmSlaney Apr 19, 2022

Uh oh!

MalcolmSlaney commented May 4, 2022

Uh oh!

MarkDaoust commented May 4, 2022

Uh oh!

Uh oh!

Be more clear about train_step and test_step #1969

Be more clear about train_step and test_step #1969

Uh oh!

Conversation

MalcolmSlaney commented Nov 8, 2021

Uh oh!

github-actions bot commented Nov 8, 2021

Preview

Format and style

Uh oh!

8bitmp3 commented Dec 1, 2021

Uh oh!

rchao Dec 2, 2021

Choose a reason for hiding this comment

Uh oh!

MalcolmSlaney Dec 31, 2021

Choose a reason for hiding this comment

Uh oh!

rchao Jan 4, 2022

Choose a reason for hiding this comment

Uh oh!

MalcolmSlaney Apr 19, 2022

Choose a reason for hiding this comment

Uh oh!

MalcolmSlaney commented May 4, 2022

Uh oh!

MarkDaoust commented May 4, 2022

Uh oh!

Uh oh!