You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I noticed a substantial slowdown (30%) of one of my codes after upgrading CUDA.jl to 3.5.
I bisected it to 0bc1191.
It seems that after this commit the __init_driver__ function is no longer called and the -nvptx-fma-level=1 LLVM option is not set, inhibiting fma generation.
I noticed a substantial slowdown (30%) of one of my codes after upgrading CUDA.jl to 3.5.
I bisected it to 0bc1191.
It seems that after this commit the
__init_driver__
function is no longer called and the-nvptx-fma-level=1
LLVM option is not set, inhibiting fma generation.@lcw
The text was updated successfully, but these errors were encountered: