[TF 2.0 API Docs] Docstring for tf.train.experimental.enable_mixed_precision_graph_rewrite #29249

tlkh · 2019-06-01T13:27:14Z

In response to #29241

Improved the docstring for tf.train.experimental.enable_mixed_precision_graph_rewrite:

Added example for using function
Added colab notebook to demonstrate speed-up without performance penalty
Added original graphic for loss scaling (source)
Added more information about graph rewrite operation
Added performance guide
Added exception information
Added more clarification to loss_scale argument

A gist with the rendered docstring is here for ease of review.

Thank you, any feedback or criticism is welcome.

perfinion

Great work! Just a few small things to tweak :)

tensorflow/python/training/experimental/mixed_precision.py

perfinion · 2019-06-01T14:04:34Z

Can you change the colours in the image as well? On my monitor the green and blue look fairly similar its hard to distinguish.

tlkh · 2019-06-01T14:30:52Z

I have made all the requested changes!

tensorflow/python/training/experimental/mixed_precision.py

perfinion · 2019-06-01T14:46:47Z

@rthadur Can you make sure the links are updated when this is merged?

tlkh · 2019-06-01T15:44:17Z

Fixed the pylint 80 char line limit errors.

tensorflow/python/training/experimental/mixed_precision.py

tlkh · 2019-06-02T07:41:20Z

Pushed the amended commit with the requested changes.

recrusader · 2019-07-01T18:57:41Z

@tlkh, when enabling "enable_mixed_precision_graph_rewrite", how about the data type of model network when defining the model?
still use FP32? if it is, how to output the FP16 trained model? Thanks!

tlkh · 2019-07-02T18:55:35Z

Thanks a lot, tlkh. However, if FP16 trained network cannot be saved, it is difficult to use the network outside tensorflow after finishing the training.
In addition, it seems that ops conversion from FP32 to FP16 can happen on new GPU, like Volta. TF cannot do this conversion for old one.

If you wish to use the network outside of TensorFlow for inference in FP16, typically just converting all the datatypes to FP16 will work. Only for training does there have to be special considerations. There are other toolkits to help with inference optimization outside of TensorFlow, such as TensorRT.

Only Volta and Turing GPUs have the FP16 units and Tensor Cores that can benefit from mixed precision. Older GPUs will not see a benefit, hence the feature is not enabled for them.

@recrusader let's move this discussion to a new GitHub issue if you wish to raise any concerns or suggestions. Keep this thread for discussion specifically about this PR.

recrusader · 2019-07-02T19:04:23Z

I asked this question in official model. However, no one can give me an answer. Thank you very much! I think that I have got the answer.

tlkh · 2019-08-10T05:46:14Z

@perfinion @martinwicke requesting review once again.

Changes made:

image and example notebook is now hosted by NVIDIA
docstring wording has been improved, and updated/merged with recent changes from master
docstring added to both enable_mixed_precision_graph_rewrite and enable_mixed_precision_graph_rewrite_v1, with the correct distinction made with which kinds of Optimizer are accepted

Thank you!

martinwicke

@MarkDaoust will this render correctly? I see there's no indent on the args and returns sections, I'm wondering whether the indent is required.

tlkh · 2019-08-12T17:27:57Z

@martinwicke thanks for pointing that out. Let me fix that anyway.
EDIT: fixed the indent!

MarkDaoust

Hi @tlkh,

It looks like you miss understood @martinwicke's comment.

All the Args/Returns/Raises blocks need to be indented, or they will not be recognized by our linters, and will render incorrectly on tensorflow.org (They will be interpreted as markdown which will just flatten them into a single paragraph).

Thanks.

tensorflow/python/training/experimental/mixed_precision.py

…raph_rewrite

tlkh · 2019-08-13T06:16:37Z

Sorry about that! I fixed it.

tlkh · 2019-08-13T15:41:35Z

Hi tlkh,

It looks like you miss understood martinwicke's comment.

All the Args/Returns/Raises blocks need to be indented, or they will not be recognized by our linters, and will render incorrectly on tensorflow.org (They will be interpreted as markdown which will just flatten them into a single paragraph).

Thanks.

MarkDaoust Hmm I've addressed the changes but I'm not sure why GitHub is still complaining about "1 change requested". Do you need to approve it as well?

Edit: never mind

tensorflow-bot bot added the size:M CL Change Size: Medium label Jun 1, 2019

googlebot added the cla: yes label Jun 1, 2019

tlkh mentioned this pull request Jun 1, 2019

[TF 1.14 API Docs] tf.train.experimental.enable_mixed_precision_graph_rewrite #29241

Closed

perfinion changed the title ~~Docstring for tf.train.experimental.enable_mixed_precision_graph_rewrite~~ [TF 2.0 API Docs] Docstring for tf.train.experimental.enable_mixed_precision_graph_rewrite Jun 1, 2019

perfinion added the type:docs-bug Document issues label Jun 1, 2019

perfinion requested changes Jun 1, 2019

View reviewed changes

tlkh force-pushed the master branch 2 times, most recently from 1129f0a to 85b7da7 Compare June 1, 2019 14:25

perfinion reviewed Jun 1, 2019

View reviewed changes

tensorflow/python/training/experimental/mixed_precision.py Outdated Show resolved Hide resolved

perfinion previously approved these changes Jun 1, 2019

View reviewed changes

tensorflow-bot bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Jun 1, 2019

kokoro-team removed the kokoro:force-run Tests on submitted change label Jun 1, 2019

perfinion assigned rthadur Jun 1, 2019

tlkh dismissed perfinion’s stale review via 1d0c165 June 1, 2019 15:08

tlkh force-pushed the master branch 2 times, most recently from 1d0c165 to 025c463 Compare June 1, 2019 15:35

rthadur requested a review from perfinion June 1, 2019 17:06

martinwicke reviewed Jun 1, 2019

View reviewed changes

tensorflow/python/training/experimental/mixed_precision.py Outdated Show resolved Hide resolved

tlkh force-pushed the master branch from 025c463 to 02e675a Compare June 2, 2019 07:38

perfinion added the kokoro:force-run Tests on submitted change label Jun 2, 2019

kokoro-team removed the kokoro:force-run Tests on submitted change label Jun 2, 2019

rthadur requested a review from martinwicke June 3, 2019 17:48

rthadur removed the ready to pull PR ready for merge process label Jul 8, 2019

tlkh mentioned this pull request Jul 19, 2019

Best practice of using tensor-core on TensorFlow r1.12 #30729

Closed

tlkh mentioned this pull request Aug 9, 2019

Added demo notebook for AMP (image classification) NVIDIA/DeepLearningExamples#151

Merged

tlkh closed this Aug 10, 2019

tlkh force-pushed the master branch from 02e675a to ec2d7ca Compare August 10, 2019 04:33

tlkh reopened this Aug 10, 2019

tlkh force-pushed the master branch from ea68c03 to 0579f5f Compare August 10, 2019 05:44

tlkh force-pushed the master branch 3 times, most recently from 02d2eea to 75f7b2b Compare August 10, 2019 05:54

martinwicke reviewed Aug 12, 2019

View reviewed changes

tlkh force-pushed the master branch from 75f7b2b to a07b6e0 Compare August 12, 2019 17:42

MarkDaoust requested changes Aug 12, 2019

View reviewed changes

Improved docstring for tf.train.experimental.enable_mixed_precision_g…

ca9e667

…raph_rewrite

tlkh force-pushed the master branch from a07b6e0 to ca9e667 Compare August 13, 2019 06:14

martinwicke approved these changes Aug 13, 2019

View reviewed changes

tensorflow-bot bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Aug 13, 2019

kokoro-team removed the kokoro:force-run Tests on submitted change label Aug 13, 2019

tensorflow-copybara merged commit ca9e667 into tensorflow:master Aug 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TF 2.0 API Docs] Docstring for tf.train.experimental.enable_mixed_precision_graph_rewrite #29249

[TF 2.0 API Docs] Docstring for tf.train.experimental.enable_mixed_precision_graph_rewrite #29249

tlkh commented Jun 1, 2019 •

edited

Loading

perfinion left a comment

perfinion commented Jun 1, 2019

tlkh commented Jun 1, 2019

perfinion commented Jun 1, 2019

tlkh commented Jun 1, 2019

tlkh commented Jun 2, 2019

recrusader commented Jul 1, 2019

tlkh commented Jul 2, 2019

recrusader commented Jul 2, 2019

tlkh commented Aug 10, 2019 •

edited

Loading

martinwicke left a comment

tlkh commented Aug 12, 2019 •

edited

Loading

MarkDaoust left a comment •

edited

Loading

tlkh commented Aug 13, 2019

tlkh commented Aug 13, 2019 •

edited

Loading

[TF 2.0 API Docs] Docstring for tf.train.experimental.enable_mixed_precision_graph_rewrite #29249

[TF 2.0 API Docs] Docstring for tf.train.experimental.enable_mixed_precision_graph_rewrite #29249

Conversation

tlkh commented Jun 1, 2019 • edited Loading

perfinion left a comment

Choose a reason for hiding this comment

perfinion commented Jun 1, 2019

tlkh commented Jun 1, 2019

perfinion commented Jun 1, 2019

tlkh commented Jun 1, 2019

tlkh commented Jun 2, 2019

recrusader commented Jul 1, 2019

tlkh commented Jul 2, 2019

recrusader commented Jul 2, 2019

tlkh commented Aug 10, 2019 • edited Loading

martinwicke left a comment

Choose a reason for hiding this comment

tlkh commented Aug 12, 2019 • edited Loading

MarkDaoust left a comment • edited Loading

Choose a reason for hiding this comment

tlkh commented Aug 13, 2019

tlkh commented Aug 13, 2019 • edited Loading

tlkh commented Jun 1, 2019 •

edited

Loading

tlkh commented Aug 10, 2019 •

edited

Loading

tlkh commented Aug 12, 2019 •

edited

Loading

MarkDaoust left a comment •

edited

Loading

tlkh commented Aug 13, 2019 •

edited

Loading