linear_congruential_engine: Fixes for __lce_alg_picker #81080

LRFLEW · 2024-02-08T02:52:08Z

This fixes two major mistakes in the implementation of linear_congruential_engine that allowed it to produce incorrect output. Specifically, these mistakes are in __lce_alg_picker, which is used to determine whether Schrage's algorithm is valid and needed.

The first mistake is in the definition of _OverflowOK. The code comment and the description of D65041 both indicate that it's supposed to be true iff m is a power of two. However, the definition used does not work out to that, and instead is true whenever m is even. This could result in linear_congruential_engine using an invalid implementation, as it would incorrectly assume that any integer overflow can't change the result. I changed the implementation to one that accurately checks if m is a power of two. Technically, this implementation has an edge case where it considers 0 to be a power of two, but in this case this is actually accurate behavior, as m = 0 indicates a modulus of 2^w where w is the size of result_type in bits, which is a power of two.

The second mistake is in the static assert. The original static assert erroneously included an unnecessary a != 0 || m != 0. Combined with the || !_MightOverflow, this actually resulted in the static assert being impossible to fail. Applying De Morgan's law and expanding _MightOverflow gives that the only way this static assert can be triggered is if a == 0 && m == 0 && a != 0 && m != 0 && ..., which clearly cannot be true. I simply removed the explicit checks against a and m, as the intended checks are already included in _MightOverflow and _SchrageOK, and their inclusion doesn't provide any obvious semantic benefit.

This should fix all the current instances where linear_congruential_engine uses an invalid implementation. This technically isn't a complete implementation, though, since the static assert will cause some instantiations of linear_congruential_engine not disallowed by the standard from compiling. However, this should still be an improvement, as all compiling instantiations of linear_congruential_engine should use a valid implementation. Fixing the cases where the static assert triggers will require adding additional implementations, some of which will be fairly non-trivial, so I'd rather leave those for another PR so they don't hold up these more important fixes.

Fixes #33554

github-actions · 2024-02-08T02:52:23Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be
notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write
permissions for the repository. In which case you can instead tag reviewers by
name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review
by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate
is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

llvmbot · 2024-02-08T02:52:50Z

@llvm/pr-subscribers-libcxx

Author: None (LRFLEW)

Changes

This fixes two major mistakes in the implementation of linear_congruential_engine that allowed it to produce incorrect output. Specifically, these mistakes are in __lce_alg_picker, which is used to determine whether Schrage's algorithm is valid and needed.

The first mistake is in the definition of _OverflowOK. The code comment and the description of D65041 both indicate that it's supposed to be true iff m is a power of two. However, the definition used does not work out to that, and instead is true whenever m is even. This could result in linear_congruential_engine using an invalid implementation, as it would incorrectly assume that any integer overflow can't change the result. I changed the implementation to one that accurately checks if m is a power of two. Technically, this implementation has an edge case where it considers 0 to be a power of two, but in this case this is actually accurate behavior, as m = 0 indicates a modulus of 2^w where w is the size of result_type in bits, which is a power of two.

The second mistake is in the static assert. The original static assert erroneously included an unnecessary a != 0 || m != 0. Combined with the || !_MightOverflow, this actually resulted in the static assert being impossible to fail. Applying De Morgan's law and expanding _MightOverflow gives that the only way this static assert can be triggered is if a == 0 && m == 0 && a != 0 && m != 0 && ..., which clearly cannot be true. I simply removed the explicit checks against a and m, as the intended checks are already included in _MightOverflow and _SchrageOK, and their inclusion doesn't provide any obvious semantic benefit.

This should fix all the current instances where linear_congruential_engine uses an invalid implementation. This technically isn't a complete implementation, though, since the static assert will cause some instantiations of linear_congruential_engine not disallowed by the standard from compiling. However, this should still be an improvement, as all compiling instantiations of linear_congruential_engine should use a valid implementation. Fixing the cases where the static assert triggers will require adding additional implementations, some of which will be fairly non-trivial, so I'd rather leave those for another PR so they don't hold up these more important fixes.

Full diff: https://github.com/llvm/llvm-project/pull/81080.diff

1 Files Affected:

(modified) libcxx/include/__random/linear_congruential_engine.h (+3-3)

diff --git a/libcxx/include/__random/linear_congruential_engine.h b/libcxx/include/__random/linear_congruential_engine.h
index 51f6b248d8f974..5f2f845c98b141 100644
--- a/libcxx/include/__random/linear_congruential_engine.h
+++ b/libcxx/include/__random/linear_congruential_engine.h
@@ -31,10 +31,10 @@ template <unsigned long long __a,
           unsigned long long __m,
           unsigned long long _Mp,
           bool _MightOverflow = (__a != 0 && __m != 0 && __m - 1 > (_Mp - __c) / __a),
-          bool _OverflowOK    = ((__m | (__m - 1)) > __m),                    // m = 2^n
-          bool _SchrageOK     = (__a != 0 && __m != 0 && __m % __a <= __m / __a)> // r <= q
+          bool _OverflowOK    = ((__m & (__m - 1)) == 0ull),                           // m = 2^n
+          bool _SchrageOK     = (__a != 0 && __m != 0 && __m % __a <= __m / __a)>      // r <= q
 struct __lce_alg_picker {
-  static_assert(__a != 0 || __m != 0 || !_MightOverflow || _OverflowOK || _SchrageOK,
+  static_assert(!_MightOverflow || _OverflowOK || _SchrageOK,
                 "The current values of a, c, and m cannot generate a number "
                 "within bounds of linear_congruential_engine.");

github-actions · 2024-02-08T02:54:33Z

✅ With the latest revision this PR passed the C/C++ code formatter.

LRFLEW · 2024-02-08T03:00:42Z

I don't know why the formatter wants the comments to be unaligned (seems like a bizarre choice to me), but if that's what the project wants, I'll make the change.

LRFLEW · 2024-02-08T05:32:41Z

Looks like the failed tests are because the tests in rand.eng.lcong end up constructing engines that trigger the static assert. I'll look to update the tests specifically to avoid those cases for now.

LRFLEW · 2024-02-08T08:24:47Z

I've pushed some updated regression tests to account for the build failures. I reorganized some of the tests in rand.eng.lcong to include more test cases, while commenting out the test cases that are currently unsupported due to the static assert.

I also went ahead and added a specific regression test for the _OverflowOK fix, using the example I mentioned in #33554. I simply added it to rand.eng.lcong/alg.pass.cpp instead of making a new file because it seems to me like it's all testing the same idea (using the right algorithm to avoid integer overflow breaking the result), and figure it's better to keep like-tests together. If you want me to split this out to a separate file, let me know.

libcxx/test/std/numerics/rand/rand.eng/rand.eng.lcong/alg.pass.cpp

Fixes _OverflowOK and the static_assert to actually check for what they're intended to check.

ldionne

Thanks for the fix, I think this LGTM with a green CI -- I don't have much to add.

Let's add a release note explaining that we changed the value returned by this utility to be Standards conforming, but that it's technically an ABI break to do that. I think this falls within the realm of what is reasonable to change, especially since the status quo is that we disagree with other implementations.

LRFLEW · 2024-02-09T00:46:33Z

I looked into the CI failures, and they all seem to say "The runner has received a shutdown signal." I'm guessing they timed out from the CI as a whole taking too long. I don't see anything I need to address so far, so those tasks may just need to be restarted. I don't think I have the permissions needed to restart them, so I don't think I can do anything about it myself right now.

mordante · 2024-02-09T17:15:04Z

Thanks for working on this! I'll restart the failed jobs.

github-actions · 2024-02-15T18:46:40Z

@LRFLEW Congratulations on having your first Pull Request (PR) merged into the LLVM Project!

Your changes will be combined with recent changes from other authors, then tested
by our build bots. If there is a problem with a build, you may recieve a report in an email or a comment on this PR.

Please check whether problems have been caused by your change specifically, as
the builds can include changes from many authors. It is not uncommon for your
change to be included in a build that fails due to someone else's changes, or
infrastructure issues.

How to do this, and the rest of the post-merge process, is covered in detail here.

If your change does cause a problem, it may be reverted, or you can revert it yourself.
This is a normal part of LLVM development. You can fix your changes and open a new PR to merge them again.

If you don't get any reports, no action is required from you. Your changes are working as expected, well done!

…low (#81583) This PR is a followup to #81080. This PR makes two major changes to how the LCG operation is computed: The first is that I added an additional case where `ax + c` might overflow the intermediate variable, but `ax` by itself won't. In this case, it's much better to use `(ax mod m) + c mod m` than the previous behavior of falling back to Schrage's algorithm. The addition modulo is done in the same way as when using Schrage's algorithm (i.e. `x += c - (x >= m - c)*m`), but the multiplication modulo is calculated directly, which is faster. The second is that I added handling for the case where the `ax` intermediate might overflow, but Schrage's algorithm doesn't apply (i.e. r > q). In this case, the only real option is to increase the precision of the intermediate values. The good news is that - for `x`, `a`, and `c` being n-bit values - `ax + c` will never overflow a 2n-bit intermediary, meaning this promotion can only happen once, and will always be able to use the simplest implementation. This is already the case for 16-bit LCGs, as libcxx chooses to compute them with 32-bit intermediate values. For 32-bit LCGs, I simply added code similar to the 16-bit case to use the existing 64-bit implementations. Lastly, for 64-bit LCGs, I wrote a case that calculates it using `unsigned __int128` if it is available to use. While this implementation covers a *lot* of the missing cases from #81080, this still won't compile **every** possible `linear_congruential_engine`. Specifically, if `a`, `c`, and `m` are chosen such that it needs 128-bit integers, but the platform doesn't support `__int128` (eg. 32-bit x86), then it will fail to compile. However, this is a fairly rare case to see actually used, and libcxx would be in good company with this, as [libstdc++ also fails to compile under these circumstances](https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87744). Fixing **this** gap would require even **more** work of further complexity, so that would probably be best handled by a different PR (I'll put more details on what that PR would entail in a comment).

…low (llvm#81583) This PR is a followup to llvm#81080. This PR makes two major changes to how the LCG operation is computed: The first is that I added an additional case where `ax + c` might overflow the intermediate variable, but `ax` by itself won't. In this case, it's much better to use `(ax mod m) + c mod m` than the previous behavior of falling back to Schrage's algorithm. The addition modulo is done in the same way as when using Schrage's algorithm (i.e. `x += c - (x >= m - c)*m`), but the multiplication modulo is calculated directly, which is faster. The second is that I added handling for the case where the `ax` intermediate might overflow, but Schrage's algorithm doesn't apply (i.e. r > q). In this case, the only real option is to increase the precision of the intermediate values. The good news is that - for `x`, `a`, and `c` being n-bit values - `ax + c` will never overflow a 2n-bit intermediary, meaning this promotion can only happen once, and will always be able to use the simplest implementation. This is already the case for 16-bit LCGs, as libcxx chooses to compute them with 32-bit intermediate values. For 32-bit LCGs, I simply added code similar to the 16-bit case to use the existing 64-bit implementations. Lastly, for 64-bit LCGs, I wrote a case that calculates it using `unsigned __int128` if it is available to use. While this implementation covers a *lot* of the missing cases from llvm#81080, this still won't compile **every** possible `linear_congruential_engine`. Specifically, if `a`, `c`, and `m` are chosen such that it needs 128-bit integers, but the platform doesn't support `__int128` (eg. 32-bit x86), then it will fail to compile. However, this is a fairly rare case to see actually used, and libcxx would be in good company with this, as [libstdc++ also fails to compile under these circumstances](https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87744). Fixing **this** gap would require even **more** work of further complexity, so that would probably be best handled by a different PR (I'll put more details on what that PR would entail in a comment).

LRFLEW requested a review from a team as a code owner February 8, 2024 02:52

llvmbot added the libc++ libc++ C++ Standard Library. Not GNU libstdc++. Not libc++abi. label Feb 8, 2024

LRFLEW force-pushed the lcg-ovfl-fix branch from 9f9d69b to c3137ea Compare February 8, 2024 03:56

LRFLEW mentioned this pull request Feb 8, 2024

Linear congruential generator produces different values than gcc and msvc #33554

Closed

LRFLEW force-pushed the lcg-ovfl-fix branch from c3137ea to 1032be4 Compare February 8, 2024 08:20

LRFLEW force-pushed the lcg-ovfl-fix branch 2 times, most recently from a8eb596 to 7edc51a Compare February 8, 2024 10:00

jwakely reviewed Feb 8, 2024

View reviewed changes

libcxx/test/std/numerics/rand/rand.eng/rand.eng.lcong/alg.pass.cpp Outdated Show resolved Hide resolved

jwakely reviewed Feb 8, 2024

View reviewed changes

libcxx/test/std/numerics/rand/rand.eng/rand.eng.lcong/alg.pass.cpp Outdated Show resolved Hide resolved

LRFLEW force-pushed the lcg-ovfl-fix branch 2 times, most recently from 172e790 to 87e3c8d Compare February 8, 2024 12:03

Fixes for __lce_alg_picker

d1ea651

Fixes _OverflowOK and the static_assert to actually check for what they're intended to check.

LRFLEW force-pushed the lcg-ovfl-fix branch from 87e3c8d to d1ea651 Compare February 8, 2024 12:04

ldionne approved these changes Feb 8, 2024

View reviewed changes

LRFLEW mentioned this pull request Feb 13, 2024

linear_congruential_engine: add using more precision to prevent overflow #81583

Merged

mordante merged commit fc027e1 into llvm:main Feb 15, 2024
51 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

linear_congruential_engine: Fixes for __lce_alg_picker #81080

linear_congruential_engine: Fixes for __lce_alg_picker #81080

LRFLEW commented Feb 8, 2024 •

edited by ldionne

Loading

github-actions bot commented Feb 8, 2024

llvmbot commented Feb 8, 2024

github-actions bot commented Feb 8, 2024 •

edited

Loading

LRFLEW commented Feb 8, 2024

LRFLEW commented Feb 8, 2024

LRFLEW commented Feb 8, 2024

ldionne left a comment

LRFLEW commented Feb 9, 2024

mordante commented Feb 9, 2024

github-actions bot commented Feb 15, 2024

linear_congruential_engine: Fixes for __lce_alg_picker #81080

linear_congruential_engine: Fixes for __lce_alg_picker #81080

Conversation

LRFLEW commented Feb 8, 2024 • edited by ldionne Loading

github-actions bot commented Feb 8, 2024

llvmbot commented Feb 8, 2024

github-actions bot commented Feb 8, 2024 • edited Loading

LRFLEW commented Feb 8, 2024

LRFLEW commented Feb 8, 2024

LRFLEW commented Feb 8, 2024

ldionne left a comment

Choose a reason for hiding this comment

LRFLEW commented Feb 9, 2024

mordante commented Feb 9, 2024

github-actions bot commented Feb 15, 2024

LRFLEW commented Feb 8, 2024 •

edited by ldionne

Loading

github-actions bot commented Feb 8, 2024 •

edited

Loading