[ROCm] Skipping subtests that check support for float64 type in the NN ops #30500

deven-amd · 2019-07-08T17:40:57Z

ROCm platform currently does not support the float64/double type in the NN ops

This commit skips subtests (within python unit-tests) that test this functionality. The "skip" is guarded by the call to "is_built_with_rocm()", and hence these unit-tests will not be affected in any way when running with TF which was not built with ROCm support (i.e. --config=rocm)

@tatianashp @whchung @chsigg

tensorflow/python/ops/nn_test.py

whchung

@deven-amd minor cosmetic changes are required.

deven-amd · 2019-07-10T14:27:55Z

I looked at the CI failure logs, and they do not seem related to the changes in this PR.

The one failure in Linux GPU is in a test that was modified by this PR. However the changes in this PR should be a no-op for the CUDA / Linux GPU run, and hence it does not seem likely that the change in this PR is the cause for the failure.

rthadur · 2019-07-10T17:41:27Z

@deven-amd here is the internal error :
InternalError: 2 root error(s) found. (0) Internal: cuDNN Backward Data function launch failure : input shape([1,5,8,7,2]) filter shape([1,2,3,2,3]) [[node gradients_5/conv_5_grad/Conv3DBackpropInputV2 (defined at /third_party/py/absl/third_party/unittest3_backport/case.py:162) ]] (1) Internal: cuDNN Backward Data function launch failure : input shape([1,5,8,7,2]) filter shape([1,2,3,2,3]) [[node gradients_5/conv_5_grad/Conv3DBackpropInputV2 (defined at /third_party/py/absl/third_party/unittest3_backport/case.py:162) ]] [[gradients_5/conv_5_grad/Conv3DBackpropInputV2/_19]]

deven-amd · 2019-07-10T18:02:31Z

@rthadur

I saw the error in the invocation log, but cannot see how the change in this PR can cause it.

For the CUDA run, the only thing that will change with this PR is the ordering the dtypes in the list, it will go from [dtypes.float64, dtypes.float32, dtypes.float16] to [dtypes.float32, dtypes.float16, dtypes.float64]. The two are functionally equivalent and should not cause any failure (I would think)

Let me push out a change that keeps the above mentioned ordering intact, and see it that fixes the error!

deven

…N ops ROCm platform currently does not support the float64/double type in the NN ops This commit skips subtests (within python unit-tests) that test this functionality. The "skip" is guarded by the call to "is_built_with_rocm()", and hence these unit-tests will not be affected in any way when running with TF which was not built with ROCm support (i.e. `--config=rocm`)

deven-amd · 2019-07-11T14:03:09Z

@whchung , please re-approve this to kick-off the CI runs.
I need to figure out whether or not my last change fixes the failure in the LInux GPU CI run.

Thanks
deven

deven-amd · 2019-07-11T16:32:35Z

@rthadur , the CI errors are gone :)

rthadur · 2019-07-11T21:30:13Z

@deven-amd thank you , @chsigg can you please review this again.

…kip_double_dtyp_subtests PiperOrigin-RevId: 258814196

tensorflow-bot bot added the size:M CL Change Size: Medium label Jul 8, 2019

googlebot added the cla: yes label Jul 8, 2019

whchung reviewed Jul 8, 2019

View reviewed changes

tensorflow/python/ops/nn_test.py Outdated Show resolved Hide resolved

whchung requested changes Jul 8, 2019

View reviewed changes

deven-amd force-pushed the google_upstream_skip_double_dtyp_subtests branch from c8ee4e7 to d315f3b Compare July 8, 2019 18:21

whchung previously approved these changes Jul 8, 2019

View reviewed changes

tensorflow-bot bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Jul 8, 2019

whchung requested a review from chsigg July 8, 2019 18:27

kokoro-team removed the kokoro:force-run Tests on submitted change label Jul 8, 2019

deven-amd dismissed whchung’s stale review via 7276d27 July 8, 2019 18:30

deven-amd force-pushed the google_upstream_skip_double_dtyp_subtests branch from d315f3b to 7276d27 Compare July 8, 2019 18:30

rthadur self-assigned this Jul 9, 2019

rthadur added this to Assigned Reviewer in PR Queue via automation Jul 9, 2019

rthadur added the comp:gpu GPU related issues label Jul 9, 2019

chsigg previously approved these changes Jul 9, 2019

View reviewed changes

PR Queue automation moved this from Assigned Reviewer to Approved by Reviewer Jul 9, 2019

tensorflow-bot bot added the kokoro:force-run Tests on submitted change label Jul 9, 2019

kokoro-team removed the kokoro:force-run Tests on submitted change label Jul 9, 2019

deven-amd dismissed chsigg’s stale review via ca79b8d July 10, 2019 19:48

deven-amd force-pushed the google_upstream_skip_double_dtyp_subtests branch from 7276d27 to ca79b8d Compare July 10, 2019 19:48

PR Queue automation moved this from Approved by Reviewer to Reviewer Requested Changes Jul 10, 2019

whchung added the kokoro:force-run Tests on submitted change label Jul 11, 2019

kokoro-team removed the kokoro:force-run Tests on submitted change label Jul 11, 2019

rthadur requested review from chsigg and whchung July 11, 2019 21:29

rthadur removed the ready to pull PR ready for merge process label Jul 11, 2019

whchung approved these changes Jul 11, 2019

View reviewed changes

tensorflow-bot bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Jul 11, 2019

PR Queue automation moved this from Reviewer Requested Changes to Approved by Reviewer Jul 11, 2019

kokoro-team removed the kokoro:force-run Tests on submitted change label Jul 11, 2019

rthadur removed the ready to pull PR ready for merge process label Jul 11, 2019

chsigg approved these changes Jul 16, 2019

View reviewed changes

tensorflow-bot bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Jul 16, 2019

kokoro-team removed the kokoro:force-run Tests on submitted change label Jul 16, 2019

tensorflow-copybara merged commit ca79b8d into tensorflow:master Jul 18, 2019

PR Queue automation moved this from Approved by Reviewer to Merged Jul 18, 2019

tensorflow-copybara pushed a commit that referenced this pull request Jul 18, 2019

Merge pull request #30500 from ROCmSoftwarePlatform:google_upstream_s…

f1b70ad

…kip_double_dtyp_subtests PiperOrigin-RevId: 258814196

deven-amd deleted the google_upstream_skip_double_dtyp_subtests branch August 2, 2019 15:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm] Skipping subtests that check support for float64 type in the NN ops #30500

[ROCm] Skipping subtests that check support for float64 type in the NN ops #30500

deven-amd commented Jul 8, 2019

whchung left a comment

deven-amd commented Jul 10, 2019

rthadur commented Jul 10, 2019

deven-amd commented Jul 10, 2019

deven-amd commented Jul 11, 2019

deven-amd commented Jul 11, 2019

rthadur commented Jul 11, 2019

[ROCm] Skipping subtests that check support for float64 type in the NN ops #30500

[ROCm] Skipping subtests that check support for float64 type in the NN ops #30500

Conversation

deven-amd commented Jul 8, 2019

whchung left a comment

Choose a reason for hiding this comment

deven-amd commented Jul 10, 2019

rthadur commented Jul 10, 2019

deven-amd commented Jul 10, 2019

deven-amd commented Jul 11, 2019

deven-amd commented Jul 11, 2019

rthadur commented Jul 11, 2019