[ROCm] Fix for the broken ROCm CSB. #34104

deven-amd · 2019-11-08T18:04:50Z

The following commit breaks the --config=rocm build

f72695e

The above commit adds a couple of subtests that require support for the StatefulUnirformFullInt Op on the GPU. Currently ROCm does not support that Op on the GPU, which leads to those subtests failing.

The "fix" is to skip those subtests on the ROCm platform.

/cc @chsigg @whchung

The following commit breaks the --config=rocm build tensorflow@f72695e The above commit adds a couple of subtests that require support for the `StatefulUnirformFullInt` Op on the GPU. Currently ROCm does not support that Op on the GPU, which leads to those subtests failing. The "fix" is to skip those subtests on the ROCm platform.

deven-amd · 2019-11-12T14:53:32Z

@chsigg @tanzhenyu gentle ping

tanzhenyu

I don't understand why is this needed -- where is StatefulUniformFullInt used? I think the layer was only using stateless random ops.

deven-amd · 2019-11-12T16:26:36Z

The error message we get for the failure indicates that StatefulUniformFullInt is getting instantiated in the TF graph.

tensorflow.python.framework.errors_impl.NotFoundError: No registered 'StatefulUniformFullInt' OpKernel for 'GPU' devices compatible with node {{node StatefulUniformFullInt}}                                                                                                                                                                                                                                                        
        .  Registered:  device='XLA_GPU'; shape_dtype in [DT_FLOAT, DT_DOUBLE, DT_INT32, DT_UINT8, DT_INT16, ..., DT_UINT16, DT_COMPLEX128, DT_HALF, DT_UINT32, DT_UINT64]; dtype in [DT_INT32, DT_INT64, DT_UINT32, DT_UINT64]                                                                                                                                                                                                      
  device='XLA_CPU'; shape_dtype in [DT_FLOAT, DT_DOUBLE, DT_INT32, DT_UINT8, DT_INT16, ..., DT_UINT16, DT_COMPLEX128, DT_HALF, DT_UINT32, DT_UINT64]; dtype in [DT_INT32, DT_INT64, DT_UINT32, DT_UINT64]                                                                                                                                                                                                                            
  device='XLA_CPU_JIT'; shape_dtype in [DT_FLOAT, DT_DOUBLE, DT_INT32, DT_UINT8, DT_INT16, ..., DT_UINT16, DT_COMPLEX128, DT_HALF, DT_UINT32, DT_UINT64]; dtype in [DT_INT32, DT_INT64, DT_UINT32, DT_UINT64]                                                                                                                                                                                                                        
  device='CPU'; dtype in [DT_UINT64]                                                                                                                                                                                                                                                                                                                                                                                                 
  device='CPU'; dtype in [DT_UINT32]                                                                                                                                                                                                                                                                                                                                                                                                 
  device='CPU'; dtype in [DT_INT64]                                                                                                                                                                                                                                                                                                                                                                                                  
  device='CPU'; dtype in [DT_INT32]                                                                                                                                                                                                                                                                                                                                                                                                  
  device='XLA_GPU_JIT'; shape_dtype in [DT_FLOAT, DT_DOUBLE, DT_INT32, DT_UINT8, DT_INT16, ..., DT_UINT16, DT_COMPLEX128, DT_HALF, DT_UINT32, DT_UINT64]; dtype in [DT_INT32, DT_INT64, DT_UINT32, DT_UINT64]                                                                                                                                                                                                                        
 [Op:StatefulUniformFullInt] name: model_1/random_crop/stateful_uniform_full_int/

deven-amd · 2019-11-19T15:02:14Z

@chsigg @tanzhenyu gentle ping

deven-amd · 2019-11-27T16:26:51Z

@gbaned , anything we can do to help get this PR merged?

…ocm_fix_191108 PiperOrigin-RevId: 282922989 Change-Id: Id75e811dc0668a448800b712fe86975ac76ae991

tensorflow-bot bot added the size:S CL Change Size: Small label Nov 8, 2019

googlebot added the cla: yes label Nov 8, 2019

whchung requested a review from tanzhenyu November 8, 2019 18:39

whchung added the kokoro:force-run Tests on submitted change label Nov 8, 2019

kokoro-team removed the kokoro:force-run Tests on submitted change label Nov 8, 2019

gbaned self-assigned this Nov 11, 2019

gbaned added the comp:keras Keras related issues label Nov 11, 2019

gbaned added this to Assigned Reviewer in PR Queue via automation Nov 11, 2019

gbaned added the awaiting review Pull request awaiting review label Nov 12, 2019

tanzhenyu reviewed Nov 12, 2019

View reviewed changes

tensorflowbutler removed the awaiting review Pull request awaiting review label Nov 13, 2019

gbaned requested a review from tanzhenyu November 21, 2019 12:33

gbaned added the awaiting review Pull request awaiting review label Nov 21, 2019

chsigg approved these changes Nov 28, 2019

View reviewed changes

PR Queue automation moved this from Assigned Reviewer to Approved by Reviewer Nov 28, 2019

tensorflow-bot bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Nov 28, 2019

kokoro-team removed the kokoro:force-run Tests on submitted change label Nov 28, 2019

gbaned removed the awaiting review Pull request awaiting review label Nov 28, 2019

tensorflow-copybara pushed a commit that referenced this pull request Nov 28, 2019

Merge pull request #34104 from ROCmSoftwarePlatform:google_upstream_r…

d3645bd

…ocm_fix_191108 PiperOrigin-RevId: 282922989 Change-Id: Id75e811dc0668a448800b712fe86975ac76ae991

tensorflow-copybara merged commit b1787be into tensorflow:master Nov 28, 2019

PR Queue automation moved this from Approved by Reviewer to Merged Nov 28, 2019

deven-amd deleted the google_upstream_rocm_fix_191108 branch December 20, 2019 17:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm] Fix for the broken ROCm CSB. #34104

[ROCm] Fix for the broken ROCm CSB. #34104

deven-amd commented Nov 8, 2019

deven-amd commented Nov 12, 2019

tanzhenyu left a comment

deven-amd commented Nov 12, 2019

deven-amd commented Nov 19, 2019

deven-amd commented Nov 27, 2019

[ROCm] Fix for the broken ROCm CSB. #34104

[ROCm] Fix for the broken ROCm CSB. #34104

Conversation

deven-amd commented Nov 8, 2019

deven-amd commented Nov 12, 2019

tanzhenyu left a comment

Choose a reason for hiding this comment

deven-amd commented Nov 12, 2019

deven-amd commented Nov 19, 2019

deven-amd commented Nov 27, 2019