You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
(Generating random numbers has been a bottleneck in a couple of test setups.)
Managing k parallel random states is no big deal, but doing the range reduction is annoying, as vector multiplication support is ehhhh. It occurs to me that, pre-avx512, you can get a 52-bit high multiply out of a float multiply if you direct the rounding mode towards 0 or -inf (and deal with the artifacts of the virtual bit). Probably the interval from 2^53 to 2^63 is not so important.
The text was updated successfully, but these errors were encountered:
(Generating random numbers has been a bottleneck in a couple of test setups.)
Managing k parallel random states is no big deal, but doing the range reduction is annoying, as vector multiplication support is ehhhh. It occurs to me that, pre-avx512, you can get a 52-bit high multiply out of a float multiply if you direct the rounding mode towards 0 or -inf (and deal with the artifacts of the virtual bit). Probably the interval from 2^53 to 2^63 is not so important.
The text was updated successfully, but these errors were encountered: