fix hcc linking error caused by __fdividef #25

sunway513 · 2018-06-13T19:43:19Z

This commit fixes linking error with recent HIP updates on, thanks to @AlexVlx .

ROCm-Apps-Test · 2018-06-13T19:43:20Z

Can one of the admins verify this patch?

whchung

Would the proposed change be applicable on CUDA path?

whchung · 2018-06-13T19:48:45Z

ok to test

sunway513 · 2018-06-13T19:48:48Z

Not tested yet -- this is a perfect need from NV CI..

whchung · 2018-06-13T20:00:58Z

@parallelo It seems "ok to test" doesn't really trigger the test.... guess I'm not admin?

parallelo · 2018-06-13T20:31:46Z

Jenkins: ok to test

parallelo · 2018-06-13T20:33:08Z

Let me check the Jenkins logs for the blocker..

parallelo · 2018-06-13T20:41:33Z

Jenkins: add to whitelist

parallelo · 2018-06-13T21:00:02Z

Jenkins logs show:

Pull request #25s author has been whitelisted

Jenkins: test this please

parallelo · 2018-06-13T21:02:28Z

Okay, all set. Build triggered for merge commit.

Next: figure out why the group whitelist isn't working properly with this Jenkins plugin.

whchung · 2018-06-13T21:58:07Z

tensorflow/core/kernels/fused_batch_norm_op.cu.cc

@@ -50,7 +50,7 @@ __global__ void InvVarianceToVarianceKernel(int nthreads, double epsilon,
                                            int sample_size, T* variance) {
  GPU_1D_KERNEL_LOOP(index, nthreads) {
    T inv_var = variance[index];
-    T var = __fdividef(1, inv_var * inv_var) - T(epsilon);


__fdividef() has some magic tricks in CUDA as it appears. I'd recommend using #if / #else to switch logic between CUDA and ROCm before we figure this out.

whchung

__fdividef() has some magic tricks in CUDA as it appears. I'd recommend using #if / #else to switch logic between CUDA and ROCm before we figure this out.

whchung

// instead of #?

fix hcc linking error caused by __fdividef

fix hcc linking error caused by __fdividef

f64bd5d

sunway513 requested a review from whchung June 13, 2018 19:47

whchung reviewed Jun 13, 2018

View reviewed changes

whchung mentioned this pull request Jun 13, 2018

upstream sync 180613 #24

Merged

whchung reviewed Jun 13, 2018

View reviewed changes

Update fused_batch_norm_op.cu.cc

412496e

whchung reviewed Jun 14, 2018

View reviewed changes

whchung merged commit 5dafc62 into develop-upstream Jun 14, 2018

sunway513 pushed a commit that referenced this pull request Jun 20, 2018

Merge pull request #25 from ROCmSoftwarePlatform/fix_fdividef

8001a0c

fix hcc linking error caused by __fdividef

whchung deleted the fix_fdividef branch August 27, 2018 23:38

wormwang mentioned this pull request Jun 26, 2019

[aarch64] Run python meet core dump failure on TF 1.12.2 with rocm 2.4 #506

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix hcc linking error caused by __fdividef #25

fix hcc linking error caused by __fdividef #25

sunway513 commented Jun 13, 2018

ROCm-Apps-Test commented Jun 13, 2018

whchung left a comment

whchung commented Jun 13, 2018

sunway513 commented Jun 13, 2018

whchung commented Jun 13, 2018

parallelo commented Jun 13, 2018

parallelo commented Jun 13, 2018

parallelo commented Jun 13, 2018

parallelo commented Jun 13, 2018

parallelo commented Jun 13, 2018

whchung Jun 13, 2018

whchung left a comment

whchung left a comment

fix hcc linking error caused by __fdividef #25

fix hcc linking error caused by __fdividef #25

Conversation

sunway513 commented Jun 13, 2018

ROCm-Apps-Test commented Jun 13, 2018

whchung left a comment

Choose a reason for hiding this comment

whchung commented Jun 13, 2018

sunway513 commented Jun 13, 2018

whchung commented Jun 13, 2018

parallelo commented Jun 13, 2018

parallelo commented Jun 13, 2018

parallelo commented Jun 13, 2018

parallelo commented Jun 13, 2018

parallelo commented Jun 13, 2018

whchung Jun 13, 2018

Choose a reason for hiding this comment

whchung left a comment

Choose a reason for hiding this comment

whchung left a comment

Choose a reason for hiding this comment