Don't use integer division for cong #50427

gbaraldi · 2023-07-05T19:34:07Z

Based on the bitmask implementation described in https://www.pcg-random.org/posts/bounded-rands.html. The are potentially more benefits here to switching to a 32 bit only implementation similar to the one in the conclusion of the post, specially because some users of cong only need 32 bits or probably even 16 bits

instead do a bitwise and and resample

oscardssmith · 2023-07-05T19:39:57Z

src/julia_internal.h

-    return *seed % max;
+    uint64_t mask = ~(uint64_t)0;
+    --max;
+    mask >>= __builtin_clzll(max|1);


ideally we would be passing in the mask also but I expect this to still be better

true, we could attempt to keep this as the unbias_cong argument that it was before, but is it worth it? This seems cheap enough.

vtjnash · 2023-07-05T20:07:43Z

src/julia_internal.h

+    mask >>= __builtin_clzll(max|1);
+    uint64_t x;
+    do {
+        while ((*seed = 69069 * (*seed) + 362437) > unbias)


this unbias term is now possibly biasing your results slightly and should be removed
(it is from the algorithm "Division with Rejection (Unbiased)" or equivalently "Debiased Modulo (Twice)" previously)

Do we want to keep the api of rand_ptls and just throwaway that argument, or do I potentially just break stuff?

just change the API

vchuravy · 2023-07-05T21:33:47Z

base/partr.jl

-function unbias_cong(max::UInt32)
-    return typemax(UInt32) - ((typemax(UInt32) % max) + UInt32(1))
-end
+cong(max::UInt32) = ccall(:jl_rand_ptls, UInt32, (UInt32,), max) + UInt32(1)


Could we implement it in Julia?

We can as well, though I wonder if we can potentially avoid the PTLS entirely.

oscardssmith · 2023-07-10T13:37:00Z

Do we have any benchmarks for this? Otherwise, LGTM.

gbaraldi · 2023-07-10T13:37:48Z

On my computer the rand_ptls call went from 70 to 50 ns.

gbaraldi · 2023-07-10T15:10:27Z

I haven't done extensive profiling though, it might be possible to make this even faster, but I didn't look too deeply.

Don't use integer division for cong,

a472948

instead do a bitwise and and resample

gbaraldi assigned vchuravy and oscardssmith and unassigned vchuravy and oscardssmith Jul 5, 2023

gbaraldi requested review from vchuravy and oscardssmith July 5, 2023 19:36

oscardssmith reviewed Jul 5, 2023

View reviewed changes

vtjnash reviewed Jul 5, 2023

View reviewed changes

Remove unbias argument and change the API

ffc23a3

vtjnash approved these changes Jul 5, 2023

View reviewed changes

vchuravy reviewed Jul 5, 2023

View reviewed changes

Merge branch 'master' into faster-cong

f2a1c10

Merge branch 'master' into faster-cong

79d09a0

vtjnash added the status:merge me PR is reviewed. Merge when all tests are passing label Jul 24, 2023

gbaraldi merged commit 6f6439e into JuliaLang:master Jul 24, 2023
7 checks passed

oscardssmith removed the status:merge me PR is reviewed. Merge when all tests are passing label Jul 24, 2023

maleadt mentioned this pull request Apr 16, 2024

Task scheduling regression #54101

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't use integer division for cong #50427

Don't use integer division for cong #50427

gbaraldi commented Jul 5, 2023

oscardssmith Jul 5, 2023

vtjnash Jul 5, 2023

vtjnash Jul 5, 2023 •

edited

gbaraldi Jul 5, 2023

vtjnash Jul 5, 2023

vchuravy Jul 5, 2023

gbaraldi Jul 5, 2023

oscardssmith commented Jul 10, 2023

gbaraldi commented Jul 10, 2023

gbaraldi commented Jul 10, 2023

Don't use integer division for cong #50427

Don't use integer division for cong #50427

Conversation

gbaraldi commented Jul 5, 2023

oscardssmith Jul 5, 2023

Choose a reason for hiding this comment

vtjnash Jul 5, 2023

Choose a reason for hiding this comment

vtjnash Jul 5, 2023 • edited

Choose a reason for hiding this comment

gbaraldi Jul 5, 2023

Choose a reason for hiding this comment

vtjnash Jul 5, 2023

Choose a reason for hiding this comment

vchuravy Jul 5, 2023

Choose a reason for hiding this comment

gbaraldi Jul 5, 2023

Choose a reason for hiding this comment

oscardssmith commented Jul 10, 2023

gbaraldi commented Jul 10, 2023

gbaraldi commented Jul 10, 2023

vtjnash Jul 5, 2023 •

edited