64 bit precision #4

Matt-Esch · 2015-12-30T14:21:41Z

18446744073709551616 is not a number you can represent in JS. It looks as if the logic of the code assumes that you can represent full 64bit number.

AndreasMadsen · 2015-12-30T15:30:54Z

That is true. However that code only scales the 64 bit number to a [0,1) double and a double can't hold a 64 bit integer anyway. The end result is that some bits are truncated, thus the randomness is maintained.

That being said, I think there is a risk that the output range is [0, 1] because of this truncation. To solve this I think one should do as sugested in: http://xorshift.di.unimi.it/

x = UINT64_C(0x3FF) << 52 | x >> 12;
double d = *((double *)&x) - 1.0;

AndreasMadsen · 2016-01-02T17:24:19Z

51d7ce3 fixes the issue. Unfortunately it is about 5 times slower than before.

/cc @emilbayes any ideas

AndreasMadsen · 2016-01-08T16:13:42Z

A faster solution might be to construct the IEEE double indirectly using Math.pow(2, x) since 2^x is fully represented by IEEE 754.

The solution that I attempted looks like this:

// :: t2 = randomint()
var t2 = this.randomint();
var t2U = t2[0];
var t2L = t2[1];

// :: s = t2 >> 12
var a1 = 12;
var m1 = 0xFFFFFFFF >>> (32 - a1);
var xU = t2U >>> a1;
var xL = (t2L >>> a1) | ((t2U & m1) << (32 - a1));

var x = xU * Math.pow(2, 32) + xL;
return x * Math.pow(2, -52);

and has almost no performance penalty.

However there are some cases where this is not exactly equal to the explicit casting method that uses a Buffer object. I don't know why this is, but I suspect it might be that v8 choices a suboptimal representation of the initial double xU * Math.pow(2, 32) + xL.

This is quite tricky to debug because values may have multiply representations in IEEE: For example I see that depending on the casting method 2048 * Math.row(2, -52) is represented as both:

buffer:   00111111 11110000 00000000 00000000 00000000 00000000 00001000 00000000
math.pow: 00111101 01100000 00000000 00000000 00000000 00000000 00000000 00000000

fanatid · 2016-02-11T16:37:48Z

@AndreasMadsen I implement transformation without Buffer and got speedup in ~4x.
~~https://github.com/fanatid/xorshift/blob/a32712184697836d31d432735f1a8128f12e1277/lib/xorshift.js#L21~~ https://github.com/fanatid/xorshift.js/blob/45a74a0224e97c61bacd0bde3782f4f5e766539d/lib/xorshift.js#L23

AndreasMadsen · 2016-02-11T17:57:03Z

@fanatid Wow, that is amazing. I will try to see if I can understand it. The IEEE 754 multiplication algorithms are still quite mysterious for me.

fanatid · 2016-02-12T06:16:55Z

@AndreasMadsen there is a simple formula: sign * matissa * 2 ** exponent
sign encoded in first bit, exponent in next 11 and matissa in next 52, total 64
0x3ff is 0b001111111111, first zero bit says that sign is +, exponent value encoded as 1023 that give 2**(1023 - ((2**11 - 1) >> 1)) = 2**0 = 1 and finally matissa is random number received from xorshift (with right shift on 12 bits): 1 * 1 * matissa * Math.pow(2, -52).

AndreasMadsen · 2016-02-12T08:26:36Z

That part I do understand, but it doesn't explain how one can manipulate the bits by multiplication and addition. Sometimes multiplication will change the matissa, other times it will change the exponent.

For example multiplication with 2 could bitshift the matissa by one or substract one from the exponent. Both are correct, but bitshifting could have consequences for the existing precision while substracing could have consequences for the future precision.

AndreasMadsen · 2016-02-12T08:41:11Z

I inserted your code into my test/debug setup. It works as you say. But interestingly enough it does produces different binary representations.

value: 0.000011444093673373956
buffer:   00111111 11110000 00000000 00001100 00000000 00000000 00100001 00000011
math.pow: 00111110 11101000 00000000 00000000 01000010 00000110 00000000 00000000

I understand that the two representations are the same, but in my implementation they sometimes where exactly the same. Because it added the exponent when it should have multiplied the matissa or vice versa.

I looks good, but I will have to test this for a ginormous dataset to be sure.

fanatid · 2016-02-12T08:57:46Z

I think we shouldn't think about ieee754 at all. 9007199254740991 is max integer represented in js -- 53 bits, knowing this we can write ~~(high * 0x00300000 + (low >>> 11)) * Math.pow(2, -53)~~
EDIT: oophs, should be: (high * 0x00200000 + (low >>> 11)) * Math.pow(2, -53)

AndreasMadsen · 2016-09-24T11:35:21Z

I never could get @fanatid's solution to match the reference. The current implementation uses a slow buffer approach.

LMLB · 2017-10-01T18:22:32Z

How about this:

  var t2 = this.randomint();
  // 2.220446049250313e-16 = Math.pow(2, -52)
  // 2.3283064365386963e-10 = Math.pow(2, -32)
  return t2[0] * 2.3283064365386963e-10 + (t2[1] >>> 12) * 2.220446049250313e-16;

I hard-coded the numbers for performance.

Also:

I inserted your code into my test/debug setup. It works as you say. But interestingly enough it does produces different binary representations.
value: 0.000011444093673373956
buffer:   00111111 11110000 00000000 00001100 00000000 00000000 00100001 00000011
math.pow: 00111110 11101000 00000000 00000000 01000010 00000110 00000000 00000000
I understand that the two representations are the same, but in my implementation they sometimes where exactly the same. Because it added the exponent when it should have multiplied the matissa or vice versa.

I looks good, but I will have to test this for a ginormous dataset to be sure.

They are not the same. "buffer" is actually 1.0000114440936734. In IEEE 754, every value has a unique binary representation, except NaN.

AndreasMadsen · 2017-10-01T19:01:45Z

How about this:

Excellent, that appears to work. If you want street creds you can submit a PR. But without the hard-coded optimization, V8 optimizes it for you :)

They are not the same. "buffer" is actually 1.0000114440936734. In IEEE 754, every value has a unique binary representation, except NaN.

Yeah, I realized that later, as I became wiser with the years.

LMLB · 2017-10-01T19:21:47Z

But without the hard-coded optimization, V8 optimizes it for you :)

When it comes to browsers, at least one doesn't (IE11). :)

If you want street creds you can submit a PR.

Sure, why not: #11

AndreasMadsen closed this as completed Sep 24, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

64 bit precision #4

64 bit precision #4

Matt-Esch commented Dec 30, 2015

AndreasMadsen commented Dec 30, 2015

AndreasMadsen commented Jan 2, 2016

AndreasMadsen commented Jan 8, 2016

fanatid commented Feb 11, 2016

AndreasMadsen commented Feb 11, 2016

fanatid commented Feb 12, 2016

AndreasMadsen commented Feb 12, 2016

AndreasMadsen commented Feb 12, 2016

fanatid commented Feb 12, 2016

AndreasMadsen commented Sep 24, 2016

LMLB commented Oct 1, 2017

AndreasMadsen commented Oct 1, 2017

LMLB commented Oct 1, 2017

64 bit precision #4

64 bit precision #4

Comments

Matt-Esch commented Dec 30, 2015

AndreasMadsen commented Dec 30, 2015

AndreasMadsen commented Jan 2, 2016

AndreasMadsen commented Jan 8, 2016

fanatid commented Feb 11, 2016

AndreasMadsen commented Feb 11, 2016

fanatid commented Feb 12, 2016

AndreasMadsen commented Feb 12, 2016

AndreasMadsen commented Feb 12, 2016

fanatid commented Feb 12, 2016

AndreasMadsen commented Sep 24, 2016

LMLB commented Oct 1, 2017

AndreasMadsen commented Oct 1, 2017

LMLB commented Oct 1, 2017