-
Notifications
You must be signed in to change notification settings - Fork 91
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix hcc linking error caused by __fdividef #25
Conversation
Can one of the admins verify this patch? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Would the proposed change be applicable on CUDA path?
ok to test |
Not tested yet -- this is a perfect need from NV CI.. |
@parallelo It seems "ok to test" doesn't really trigger the test.... guess I'm not admin? |
Jenkins: ok to test |
Let me check the Jenkins logs for the blocker.. |
Jenkins: add to whitelist |
Jenkins logs show:
Jenkins: test this please |
Okay, all set. Next: figure out why the group whitelist isn't working properly with this Jenkins plugin. |
@@ -50,7 +50,7 @@ __global__ void InvVarianceToVarianceKernel(int nthreads, double epsilon, | |||
int sample_size, T* variance) { | |||
GPU_1D_KERNEL_LOOP(index, nthreads) { | |||
T inv_var = variance[index]; | |||
T var = __fdividef(1, inv_var * inv_var) - T(epsilon); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
__fdividef()
has some magic tricks in CUDA as it appears. I'd recommend using #if / #else to switch logic between CUDA and ROCm before we figure this out.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
__fdividef() has some magic tricks in CUDA as it appears. I'd recommend using #if / #else to switch logic between CUDA and ROCm before we figure this out.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
// instead of #?
fix hcc linking error caused by __fdividef
This commit fixes linking error with recent HIP updates on, thanks to @AlexVlx .