Incorrect range calculation in mcode_alloc() #282

akopytov · 2017-02-23T19:16:08Z

The random address selection loop in mcode_alloc() unnecessarily reduces the available allocation range by 2 times:

    /* Next try probing pseudo-random addresses. */
    do {
      hint = (0x78fb ^ LJ_PRNG_BITS(J, 15)) << 16;  /* 64K aligned. */
    } while (!(hint + sz < range));
    hint = target + hint - (range>>1);

Two problems here:

we probably want LJ_PRNG_BITS(J, LJ_TARGET_JUMPRANGE-16) there, otherwise we may be wasting cycles on architectures with LJ_TARGET_JUMPRANGE < 31
to get a block within the [target - range; target + range) range (since range is already half the available jump range) we want to check hint + sz against range * 2, then subtract range.

This is likely a minor issue for x86, where the available jump range is big. This becomes a big problem for architectures where the available range is already small (e.g. ARM64 with the +-128 MB range) and there are many parallel threads, amplifying contention on mmap() that I reported earlier in the mailing list.

The text was updated successfully, but these errors were encountered:

akopytov · 2017-02-24T06:26:27Z

Updated the report to remove the first part -- that code in mcode_alloc() is actually correct.

akopytov · 2017-02-25T11:04:18Z

There are other issues in mcode_alloc() contributing to mmap() contention. I'm going to report the separately.

Since 'range' in mcode_alloc() is calculated based on LJ_TARGET_JUMPRANGE-1, i.e. already half the available jump range, don't divide it by 2 again for randomized allocations. Also fix the number of bits argument to LJ_PRNG_BITS() to not generate excessive bits on architectures with LJ_TARGET_JUMPRANGE < 31. That wouldn't play well with the 0x78b constant being XORed with the generated random number apparently to improve PRNG properties, so that part has been removed. Improving PRNG will be addressed separately.

MikePall · 2017-03-08T22:08:50Z

Applied. Thanks!

This was referenced Feb 25, 2017

mcode_alloc() doing busywork on exhausted allocation pool #283

Closed

LJ_PRNG_BITS() is too weak for multi-threaded applications #284

Closed

Increase available allocation range in mcode_alloc() #285

Open

MikePall added 2.0 2.1 bug labels Mar 8, 2017

MikePall closed this as completed Mar 8, 2017

hrsantiago mentioned this issue Nov 30, 2022

Windows x64 performance issue, high number of exceptions #781

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect range calculation in mcode_alloc() #282

Incorrect range calculation in mcode_alloc() #282

akopytov commented Feb 23, 2017 •

edited

akopytov commented Feb 24, 2017

akopytov commented Feb 25, 2017

MikePall commented Mar 8, 2017

Incorrect range calculation in mcode_alloc() #282

Incorrect range calculation in mcode_alloc() #282

Comments

akopytov commented Feb 23, 2017 • edited

akopytov commented Feb 24, 2017

akopytov commented Feb 25, 2017

MikePall commented Mar 8, 2017

akopytov commented Feb 23, 2017 •

edited