Cuda safety fixes #198

lgritz · 2021-08-13T19:58:49Z

Many places (basically everywhere), we had the following idiom:

template<class T>
class Foo {
    IMATH_HOSTDEVICE float bar();    // declaration
};
...
template<class T> float Foo<T>::bar() { return 0.0; } // implementation

But this is wrong! When actually compiled in Cuda mode (maybe depending
on the compiler?), you can get errors about how you can't overload
a __host__ __device__ declaration with a __host__-only implementation.
Which kinda makes sense. You have to match the two. So I have a WHOLE LOT
of places where I had to add IMATH_HOSTDEVICE.

Also, in ImathMath.h, we used this idiom:

IMATH_HOSTDEVICE IMATH_DEPRECATED("reason") float foo() { ... }

Now, this seems to work with cudacc, but when you use
clang++ --language=cuda to compile Cuda to PTX (it can do that!), clang
really doesn't like it when the __host__ __device__ comes before the
[[ deprecated("msg") ]], it has an error about how the deprecated
attribute can't go there. So we have to transpose these so that the
IMATH_DEPRECATED is alwys the first thing in the declaration (which is
the way we almost always write it anyway).

Signed-off-by: Larry Gritz lg@larrygritz.com

Many places (basically everywhere), we had the following idiom: template<class T> class Foo { IMATH_HOSTDEVICE float bar(); // declaration }; ... template<class T> float Foo<T>::bar() { return 0.0; } // implementation But this is wrong! When actually compiled in Cuda mode (maybe depending on the compiler?), you can get errors about how you can't overload a `__host__ __device__` declaration with a `__host__`-only implementation. Which kinda makes sense. You have to match the two. So I have a WHOLE LOT of places where I had to add IMATH_HOSTDEVICE. Also, in ImathMath.h, we used this idiom: IMATH_HOSTDEVICE IMATH_DEPRECATED("reason") float foo() { ... } Now, this seems to work with `cudacc`, but when you use `clang++ --language=cuda` to compile Cuda to PTX (it can do that!), clang really doesn't like it when the `__host__ __device__` comes before the `[[ deprecated("msg") ]]`, it has an error about how the deprecated attribute can't go there. So we have to transpose these so that the IMATH_DEPRECATED is alwys the first thing in the declaration (which is the way we almost always write it anyway). Signed-off-by: Larry Gritz <lg@larrygritz.com>

lgritz · 2021-08-13T20:35:14Z

This PR is against RB-3.1 because I noticed the problem when trying to use 3.1. When accepted, it should also be cherry-picked into master and RB-3.0. (2.x is unnecessary because it predates the HOSTDEVICE annotations.) I would very much appreciate a 3.1.3 release that incorporates these fixes (when convenient), since the Imath headers are quite broken under Cuda without it.

meshula

I hope automation helped with this! Looks thorough and consistent

lgritz · 2021-08-13T21:58:56Z

Some search & replace aid, but mostly manual, and double checking that my OSL Cuda builds against it stopped spitting out error messages. I think I got all the right spots.

src/Imath/ImathQuat.h

src/Imath/ImathRandom.h

Signed-off-by: Larry Gritz <lg@larrygritz.com>

cary-ilm · 2021-08-17T22:26:41Z

Other places that appear to be missing IMATH_HOSTDEVICE:

ImathShear.h line 681 (operator*)
ImathSphere.h line 97 (circumscribe)
ImathVec.h line 1422 (Vec3::Vec3)
ImathVec.h line 2068 (Vec4::lengthTiny)

I grepped for "inline" and looked for missing IMATH_HOSTDEVICE.

Signed-off-by: Larry Gritz <lg@larrygritz.com>

lgritz · 2021-08-17T22:52:15Z

Thanks for the extra pair of eyes, Cary!

cary-ilm

LGTM

* Cuda safety fixes Many places (basically everywhere), we had the following idiom: template<class T> class Foo { IMATH_HOSTDEVICE float bar(); // declaration }; ... template<class T> float Foo<T>::bar() { return 0.0; } // implementation But this is wrong! When actually compiled in Cuda mode (maybe depending on the compiler?), you can get errors about how you can't overload a `__host__ __device__` declaration with a `__host__`-only implementation. Which kinda makes sense. You have to match the two. So I have a WHOLE LOT of places where I had to add IMATH_HOSTDEVICE. Also, in ImathMath.h, we used this idiom: IMATH_HOSTDEVICE IMATH_DEPRECATED("reason") float foo() { ... } Now, this seems to work with `cudacc`, but when you use `clang++ --language=cuda` to compile Cuda to PTX (it can do that!), clang really doesn't like it when the `__host__ __device__` comes before the `[[ deprecated("msg") ]]`, it has an error about how the deprecated attribute can't go there. So we have to transpose these so that the IMATH_DEPRECATED is alwys the first thing in the declaration (which is the way we almost always write it anyway). Signed-off-by: Larry Gritz <lg@larrygritz.com> * Add missing HOSTDEVICE Signed-off-by: Larry Gritz <lg@larrygritz.com> * Add missing HOSTDEVICE Signed-off-by: Larry Gritz <lg@larrygritz.com>

* Cuda safety fixes from #198 Many places (basically everywhere), we had the following idiom: template<class T> class Foo { IMATH_HOSTDEVICE float bar(); // declaration }; ... template<class T> float Foo<T>::bar() { return 0.0; } // implementation But this is wrong! When actually compiled in Cuda mode (maybe depending on the compiler?), you can get errors about how you can't overload a `__host__ __device__` declaration with a `__host__`-only implementation. Which kinda makes sense. You have to match the two. So I have a WHOLE LOT of places where I had to add IMATH_HOSTDEVICE. Also, in ImathMath.h, we used this idiom: IMATH_HOSTDEVICE IMATH_DEPRECATED("reason") float foo() { ... } Now, this seems to work with `cudacc`, but when you use `clang++ --language=cuda` to compile Cuda to PTX (it can do that!), clang really doesn't like it when the `__host__ __device__` comes before the `[[ deprecated("msg") ]]`, it has an error about how the deprecated attribute can't go there. So we have to transpose these so that the IMATH_DEPRECATED is alwys the first thing in the declaration (which is the way we almost always write it anyway). * Some minor cuda corrections to address warnings Signed-off-by: Larry Gritz <lg@larrygritz.com>

* Cuda safety fixes from AcademySoftwareFoundation#198 Many places (basically everywhere), we had the following idiom: template<class T> class Foo { IMATH_HOSTDEVICE float bar(); // declaration }; ... template<class T> float Foo<T>::bar() { return 0.0; } // implementation But this is wrong! When actually compiled in Cuda mode (maybe depending on the compiler?), you can get errors about how you can't overload a `__host__ __device__` declaration with a `__host__`-only implementation. Which kinda makes sense. You have to match the two. So I have a WHOLE LOT of places where I had to add IMATH_HOSTDEVICE. Also, in ImathMath.h, we used this idiom: IMATH_HOSTDEVICE IMATH_DEPRECATED("reason") float foo() { ... } Now, this seems to work with `cudacc`, but when you use `clang++ --language=cuda` to compile Cuda to PTX (it can do that!), clang really doesn't like it when the `__host__ __device__` comes before the `[[ deprecated("msg") ]]`, it has an error about how the deprecated attribute can't go there. So we have to transpose these so that the IMATH_DEPRECATED is alwys the first thing in the declaration (which is the way we almost always write it anyway). * Some minor cuda corrections to address warnings Signed-off-by: Larry Gritz <lg@larrygritz.com> Signed-off-by: Cary Phillips <cary@ilm.com>

* Cuda safety fixes from #198 Many places (basically everywhere), we had the following idiom: template<class T> class Foo { IMATH_HOSTDEVICE float bar(); // declaration }; ... template<class T> float Foo<T>::bar() { return 0.0; } // implementation But this is wrong! When actually compiled in Cuda mode (maybe depending on the compiler?), you can get errors about how you can't overload a `__host__ __device__` declaration with a `__host__`-only implementation. Which kinda makes sense. You have to match the two. So I have a WHOLE LOT of places where I had to add IMATH_HOSTDEVICE. Also, in ImathMath.h, we used this idiom: IMATH_HOSTDEVICE IMATH_DEPRECATED("reason") float foo() { ... } Now, this seems to work with `cudacc`, but when you use `clang++ --language=cuda` to compile Cuda to PTX (it can do that!), clang really doesn't like it when the `__host__ __device__` comes before the `[[ deprecated("msg") ]]`, it has an error about how the deprecated attribute can't go there. So we have to transpose these so that the IMATH_DEPRECATED is alwys the first thing in the declaration (which is the way we almost always write it anyway). * Some minor cuda corrections to address warnings Signed-off-by: Larry Gritz <lg@larrygritz.com> Signed-off-by: Cary Phillips <cary@ilm.com>

PR AcademySoftwareFoundation#198 added IMATH_HOSTDEVICE to the RB-3.1 branch, and PR AcademySoftwareFoundation#202 cherry-picked the change into main, but that cherry-pick somehow lost the IMATH_HOSTDEVICE on Matrix33<T>::invert(bool). Signed-off-by: Cary Phillips <cary@ilm.com>

PR #198 added IMATH_HOSTDEVICE to the RB-3.1 branch, and PR #202 cherry-picked the change into main, but that cherry-pick somehow lost the IMATH_HOSTDEVICE on Matrix33<T>::invert(bool). Signed-off-by: Cary Phillips <cary@ilm.com>

meshula approved these changes Aug 13, 2021

View reviewed changes

cary-ilm reviewed Aug 17, 2021

View reviewed changes

src/Imath/ImathQuat.h Show resolved Hide resolved

cary-ilm reviewed Aug 17, 2021

View reviewed changes

src/Imath/ImathRandom.h Show resolved Hide resolved

Add missing HOSTDEVICE

714c754

Signed-off-by: Larry Gritz <lg@larrygritz.com>

Add missing HOSTDEVICE

d59c2bf

Signed-off-by: Larry Gritz <lg@larrygritz.com>

cary-ilm approved these changes Aug 17, 2021

View reviewed changes

cary-ilm merged commit 811f6ea into AcademySoftwareFoundation:RB-3.1 Aug 17, 2021

lgritz deleted the lg-cuda branch August 25, 2021 16:40

cary-ilm mentioned this pull request Aug 26, 2021

Cuda safety fixes #202

Merged

cary-ilm mentioned this pull request Aug 30, 2021

Cherry-pick commits for v3.1.3, plus notes and version bump #205

Merged

cary-ilm added the v3.1.3 label Sep 2, 2021

cary-ilm mentioned this pull request May 19, 2023

Add missing IMATH_HOSTDEVICE to Matrix33<T>::invert(bool) #320

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cuda safety fixes #198

Cuda safety fixes #198

lgritz commented Aug 13, 2021

lgritz commented Aug 13, 2021

meshula left a comment

lgritz commented Aug 13, 2021

cary-ilm commented Aug 17, 2021

lgritz commented Aug 17, 2021

cary-ilm left a comment

Cuda safety fixes #198

Cuda safety fixes #198

Conversation

lgritz commented Aug 13, 2021

lgritz commented Aug 13, 2021

meshula left a comment

Choose a reason for hiding this comment

lgritz commented Aug 13, 2021

cary-ilm commented Aug 17, 2021

lgritz commented Aug 17, 2021

cary-ilm left a comment

Choose a reason for hiding this comment