libafl_bolts: some improvements to the `rands` module #2086

flyingmutant · 2024-04-21T10:56:30Z

First, I'd like to thank you for an excellent library!

I started playing around with libafl and reading the source code, and quickly noticed that there are some things that could be improved in the rands module of libafl_bolts. I've tried to make the commits in this PR relatively self-contained:

f5ef082 fixes seeding problems,
8298cb7 improves the speed of bounded number generation,
9951535, 40a45d6 and 19bbf44 add convenient utility functions,
fbea313 adds SFC64 PRNG.

Would gladly discuss and rework anything that raises concerns.

Seeding with splitmix64 is a good way to avoid starting with low-entropy PRNG states, and is explicitly recommended by the authors of both xoshiro256++ and Romu. While at it, give the xoshiro256++ PRNG its proper name.

SFC64 is a well-established and well-understood PRNG designed by Chris Doty-Humphrey, the author of PractRand. It has been tested quite a lot over the years, and to date has no known weaknesses. Compared to xoshiro256++, it is slightly faster and is likely to be a more future-proof design (xoshiro/xoroshiro family of generators come with quite long history of [flaws][1] found over the years). Compared to Romu, it is slightly slower, but guarantees absense of bias, minimum period of at least 2^64 for any seed, and non-overlapping streams for different seeds. [1]: https://tom-kaitchuck.medium.com/designing-a-new-prng-1c4ffd27124d

domenukk · 2024-04-22T08:27:27Z

libafl_bolts/src/rands.rs

+#[must_use]
+pub fn fast_bound(rand: u64, n: u64) -> u64 {
+    debug_assert_ne!(n, 0);
+    let mul = u128::from(rand).wrapping_mul(u128::from(n));


I don't know where I left the original comment, but: would u128 be an issue on 32 bit systems?
Also, this method should maybe be #[inline](?)

The compiler will inline it if it's good to do so, no worries.

I'll check if #[inline] can help with cross-crate inlining without requiring LTO.

Regarding speed, how serious are we about 32-bit performance? Most of the PRNGs in the module already assume 64-bit system (and Lehmer64 even uses an 128x64 multiply).

What we can do is change the signature to fast_bound(rand: u64, n: usize), and on 32-bit system fall back to full-width 32x32 multiply (as suggested by Lemire), instead of 64x64 like we do on 64-bit. This will give us the maximum possible speed on 32-bit systems, at the expense of more bias. I'd argue that this amount of bias is not a problem; usually n is much smaller than 2^32.

It looks like #[inline] should probably be added to everything non-generic performance-critical in the rands module, will do.

I like a 32 bit fallback since people sometimes run things on arm or even x86 32 bit code

domenukk · 2024-04-22T08:30:17Z

Looks good overall! Just a stupid question: aren't the canges to floats for probabilities potentially slower than integers everywhere?

domenukk · 2024-04-22T09:27:03Z

libafl_bolts/src/rands.rs

    #[must_use]
    pub fn with_seed(seed: u64) -> Self {
-        Self { val: seed }
+        let mut rand = XkcdRand { val: 0 };


We usually use Self in this case. Also, I don't get the reason for this specific change

The reason for this change is to ensure that regardless of the seeding, the output of next_float() or below() is reasonable (and changes with changes to the seed). With direct seeding without splitmix e.g. below(2^32) will return 0 for all 32-bit bit seeds.

Won't the output of next_float and below always be the same for XkcdRand anyway?

The difference is how the output varies with different seeds. With direct seeding:

seed 0 => below(100) = 0 seed 1 => below(100) = 0 seed ... => below(100) = 0 seed 2^32 => below(100) = 0

This makes it impossible to find a simple seed that gives the below(100) value you are aiming for. With splitmix, conceptually:

seed 0 => below(100) = 37 seed 1 => below(100) = 5 seed ... => below(100) = 81 seed 2^32 => below(100) = 17

Without splitmix-ed XkcdRand, 2e53c99 would have to change its test usage to another PRNG with more robust seeding. If that is preferrable, I can leave XkcdRand as-is and change the test instead.

XkcdRand is a joke type... https://xkcd.com/221/

domenukk · 2024-04-22T10:21:28Z

utils/libafl_benches/benches/rand_speeds.rs

    let mut romu = RomuDuoJrRand::with_seed(1);
    let mut lehmer = Lehmer64Rand::with_seed(1);
    let mut romu_trio = RomuTrioRand::with_seed(1);

+    c.bench_function("sfc64", |b| b.iter(|| black_box(sfc64.next())));


Random question, how does this guy do in the benchmark?

Sub-nanosecond u64 generation on modern machines (I am fighting with turbo boost on my laptop, so can't get stable numbers to report right now). Slower than Romu (which in general is considered risky due to having no guarantees regarding bias or cycle length), but faster than xoshiro256++ (which is among the fastest widely-used general-purpose PRNGs today).

I'd say this is as fast as you can go without sacrificing something (like Romu and Lehmer64 do).

flyingmutant · 2024-04-22T18:37:06Z

Looks good overall! Just a stupid question: aren't the canges to floats for probabilities potentially slower than integers everywhere?

If we are realistically targeting machines with no FPU, then yes. If that is the case, I can revert this back to integer percentages. My main reason to go with floats was that they are more natural to use and more convenient, especially when computing the probabilities.

domenukk · 2024-04-23T13:38:04Z

Think this looks good for now! if you have more things you want to change (like, 32 bit fallbacks or similar), feel free to open a new PR. Thx! :)

flyingmutant added 7 commits April 21, 2024 01:46

rands: use splitmix64 for seeding

f5ef082

Seeding with splitmix64 is a good way to avoid starting with low-entropy PRNG states, and is explicitly recommended by the authors of both xoshiro256++ and Romu. While at it, give the xoshiro256++ PRNG its proper name.

rands: use fast_bound() to generate number in range

8298cb7

rands: add top-level choose()

40a45d6

rands: add Rand::next_float()

9951535

rands: add Rand::coinflip() helper

19bbf44

libafl: unbreak tests that relied on direct seeding

2e53c99

domenukk reviewed Apr 22, 2024

View reviewed changes

domenukk merged commit e1b8c9b into AFLplusplus:main Apr 23, 2024
97 of 98 checks passed

flyingmutant deleted the rand branch April 23, 2024 19:23

flyingmutant mentioned this pull request Apr 23, 2024

libafl_bolts: more rands improvements #2096

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

libafl_bolts: some improvements to the `rands` module #2086

libafl_bolts: some improvements to the `rands` module #2086

flyingmutant commented Apr 21, 2024

domenukk Apr 22, 2024

addisoncrump Apr 22, 2024

flyingmutant Apr 22, 2024

flyingmutant Apr 22, 2024

domenukk Apr 23, 2024

domenukk commented Apr 22, 2024

domenukk Apr 22, 2024

flyingmutant Apr 22, 2024

domenukk Apr 23, 2024

flyingmutant Apr 23, 2024

addisoncrump Apr 23, 2024

domenukk Apr 22, 2024

flyingmutant Apr 22, 2024

flyingmutant commented Apr 22, 2024

domenukk commented Apr 23, 2024

libafl_bolts: some improvements to the rands module #2086

libafl_bolts: some improvements to the rands module #2086

Conversation

flyingmutant commented Apr 21, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

domenukk commented Apr 22, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

flyingmutant commented Apr 22, 2024

domenukk commented Apr 23, 2024

libafl_bolts: some improvements to the `rands` module #2086

libafl_bolts: some improvements to the `rands` module #2086