Several Improvements to the random package #3135

mfelsche · 2019-04-11T21:19:21Z

I packaged those changes in 1 PR but left them as separate commits. If any of the changes is controversial or should be split, i am happy to do so.

Add XorOshiro128** and SplitMix64 PRNGs http://xoshiro.di.unimi.it/
Update XorOshiro128+ paramaters as noted in http://xoshiro.di.unimi.it/xoroshiro128plus.c
Add from_64bits constructor to 128 bit state PRNGs that uses SplitMix64 as suggested here: http://xoshiro.di.unimi.it/
Use fixed-point inversion for generating bounded random numbers using Random.int. But fall back to slower floating-point multiplication technique if no native 128 bit ops are supported.
Add Random.int_unbiased function for generating an unbiased bounded random number. For details see: http://www.pcg-random.org/posts/bounded-rands.html This one is slightly slower as Random.int.
Added tests and benchmarks

Example benchmark run on my machine:

Benchmark                                   mean            median   deviation  iterations
random/int                                  3 ns              3 ns      ±0.31%     3000000
random/int-unbiased                        25 ns             25 ns      ±0.45%     2000000
random/int-fp-mult                         17 ns             17 ns      ±0.72%     3000000
random/next/mt                             11 ns             11 ns      ±0.69%     3000000
random/next/xorshift128+                    3 ns              3 ns      ±0.47%     3000000
random/next/xoroshiro128+                   3 ns              3 ns      ±0.67%     3000000
random/next/xoroshiro128**                  3 ns              3 ns      ±0.17%     3000000
random/next/splitmix64                      4 ns              4 ns      ±0.73%     3000000

Legend: random/int is the new fixed-point inversion method. random/int-fp-mult is the old Random.int implementation.

srenatus

LGTM. Just some questions inline. Haven't checked the algorithms. ⚠

srenatus · 2019-04-13T07:21:02Z

packages/random/_test.pony

+
+  fun apply(h: TestHelper) =>
+    let xoroshiro128 = XorOshiro128StarStar(5489)
+    h.assert_eq[U64](xoroshiro128.next(), 529225608228480)


Would it be possible (and would you consider it worth doing) to instead iterate over a list of U64 instead of repeating this line over and over?

Also, are these test vectors from the papers, or from some other implementation? I suppose adding a comment where the numbers come from could be useful in the future.

The numbers are from executions of the reference implementation in C. I added comments about each source.

I also initially wanted to use a list to iterate over, but i decided against it, as the effort would have been roughly the same and the current version has the advantage that the error message in case of failure will tell you at which position it failed, without counting lines while iterating through a list.

Thanks! You're right about that line no detail, what you've chosen is better! =)

srenatus · 2019-04-13T07:42:20Z

packages/random/benchmarks/main.pony

+    DoNotOptimise[U64](x)
+    DoNotOptimise.observe()
+
+class iso RandomBenchmark[T: Random ref] is MicroBenchmark


jemc · 2019-04-16T16:09:24Z

packages/random/xorshift.pony

@@ -10,6 +10,15 @@ class XorShift128Plus is Random
  var _x: U64
  var _y: U64

+  new from_64bit(x: U64 = 5489) =>


Maybe from_u64 would be more consistent with other APIs?

Same comment applies to other additions.

SeanTAllen · 2019-04-23T16:13:03Z

@mfelsche should this be squashed? should it be merged? it's unclear to me. i'll leave it to for you to merge/squash as needed.

mfelsche · 2019-04-29T18:34:32Z

Sorry, I was off for a little while. I am gonna add manual changelog entries and squash while providing a proper commit message.

instead of floating-point multiplication. But only use it if the platform supports native 128 bit ops, as we need to multiply with U128, because we cannot (yet) usesmaller types when generating e.g. U32 with Random.int. Waiting for specialized generic functions here. Also added some benchmarks to show the performance increase. http://www.pcg-random.org/posts/bounded-rands.html states it is just as biased as fp-mult version and a quick local test, generating a histogram showed proper uniform distribution. Also added 'int_unbiased a slower but unbiased function for generating bounded random numbers.

xoroshiro128** is a new xoroshiro variant with less bias in the low bits than xoroshiro128** but also with a small performance impact. Also use new parameters for xoroshiro128+ as described in the NOTE here: http://xoshiro.di.unimi.it/xoroshiro128plus.c splitmix64 is a PRNG which only needs 64 bits of state and is primarily used to seed xoroshiro and xorshift generators from 64 bits only using the new 'from_64bits' constructors on those. splitmix64 should only be used when 64 bit of state is a hard requirement, otherwise use xoroshiro128+ or xoroshiro128**.

srenatus approved these changes Apr 13, 2019

View reviewed changes

jemc reviewed Apr 16, 2019

View reviewed changes

jemc approved these changes Apr 16, 2019

View reviewed changes

mfelsche added 5 commits April 29, 2019 20:35

add comments to tests about source of random numbers to test agains

4cef79e

rename from_64_bits constructors to from_u64

e136f2a

[skip ci] added manual changelog entry

63c5237

mfelsche force-pushed the random-int-range branch from 04fa65e to 63c5237 Compare April 29, 2019 18:43

mfelsche merged commit 18a5c87 into master Apr 29, 2019

mfelsche deleted the random-int-range branch April 29, 2019 18:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Several Improvements to the random package #3135

Several Improvements to the random package #3135

mfelsche commented Apr 11, 2019

srenatus left a comment

srenatus Apr 13, 2019

srenatus Apr 13, 2019

mfelsche Apr 14, 2019

srenatus Apr 15, 2019

srenatus Apr 13, 2019

jemc Apr 16, 2019

SeanTAllen commented Apr 23, 2019

mfelsche commented Apr 29, 2019

Several Improvements to the random package #3135

Several Improvements to the random package #3135

Conversation

mfelsche commented Apr 11, 2019

srenatus left a comment

Choose a reason for hiding this comment

srenatus Apr 13, 2019

Choose a reason for hiding this comment

srenatus Apr 13, 2019

Choose a reason for hiding this comment

mfelsche Apr 14, 2019

Choose a reason for hiding this comment

srenatus Apr 15, 2019

Choose a reason for hiding this comment

srenatus Apr 13, 2019

Choose a reason for hiding this comment

jemc Apr 16, 2019

Choose a reason for hiding this comment

SeanTAllen commented Apr 23, 2019

mfelsche commented Apr 29, 2019