random_seed.c: prefer getrandom() / /dev/{,u}random over arc4random() #832

jacmet · 2023-09-15T14:59:28Z

arc4random(3) is problematic to use during early boot, as it (as implemented by glibc or libbsd) uses getrandom(2) without the GRND_NONBLOCK flag, causing it to block until the kernel entropy pool is initialized, which may take a very long time.

One example of this is cryptsetup(8), which may be used for bringing up a LUKS encrypted root filesystem during early boot.

So instead prefer getrandom(.., GRND_NONBLOCK) (which will fail before the pool is initialized) and /dev/urandom (which will succeed, but cause the kernel to print a warning about access before the pool is initialized) before falling back to arc4random().

arc4random(3) is problematic to use during early boot, as it (as implemented by glibc or libbsd) uses getrandom(2) without the GRND_NONBLOCK flag, causing it to block until the kernel entropy pool is initialized, which may take a very long time. One example of this is cryptsetup(8), which may be used for bringing up a LUKS encrypted root filesystem during early boot. So instead prefer getrandom(.., GRND_NONBLOCK) (which will fail before the pool is initialized) and /dev/urandom (which will succeed, but cause the kernel to print a warning about access before the pool is initialized) before falling back to arc4random(). Signed-off-by: Peter Korsgaard <peter@korsgaard.com>

hawicz · 2023-09-15T22:20:57Z

Rather than fiddling around with the order in json_c_get_random_seed() to try to guess at what will work for all users of the json-c API, I feel like it's be better to provide a way for programs that know they're going to run in environments where random number generation is broken to explicitly adjust the behavior. e.g. add something like a json_c_avoid_blocking_randomness(int) and if that's set then skip arc4random() (and any other randomness function that might be determined to block).

jacmet · 2023-09-16T11:49:10Z

Rather than fiddling around with the order in json_c_get_random_seed() to try to guess at what will work for all users of the json-c API, I feel like it's be better to provide a way for programs that know they're going to run in environments where random number generation is broken to explicitly adjust the behavior. e.g. add something like a json_c_avoid_blocking_randomness(int) and if that's set then skip arc4random() (and any other randomness function that might be determined to block).

That is also fine, but significantly more effort (E.G. new json-c release, affected reverse dependencies have to (conditionally) add the call, new releases of those projects and integration in distributions / build systems.

What was the reason exactly for adding (and preferring) arc4random() in the first place? From the https://github.com/json-c/json-c/blob/master/ChangeLog#L83-L85 I see:

Use getrandom() if available; with GRND_NONBLOCK to allow use of json-c
very early during boot, such as part of cryptsetup.
Use arc4random() if it's available.

So there was already focus on the cryptsetup use case when it was added. How important is the quality of the random seed? From a quick look it seems as if it is only used for the hash table lookups, E.G. unlikely to be an issue for early userspace.

hawicz · 2023-09-17T18:55:23Z

Yes, it's "just" to initialize the hash function to make it more difficult for an attacker to generate content that would be subject to collisions, thus making a DoS more likely.
If we're going to make an effort to mitigate such an attack, we should use the best (cryptographically secure, fast) random number source that's available, which AIUI is arc4random.

Actually, looking at what arc4random actually does, it seems like the right fix for cryptsetup and other early/broken environment use cases is to upgrade to a version of libbsd that actually does pass GRND_NONBLOCK: https://git.hadrons.org/cgit/libbsd.git/tree/src/getentropy_linux.c#n206
The first version of libbsd with that code looks like it's 0.9.0, from 2018.

There's not much point in moving arc4random to the end of the list of things we try in json-c, because we'd effectively never end up using it. Perhaps we can detect whether the arc4random that's available is from a "too early" libbsd, and just not try to use it at all in that case, though I don't actually know a reliable way to do that offhand.

jacmet · 2023-09-17T20:45:51Z

Yes, it's "just" to initialize the hash function to make it more difficult for an attacker to generate content that would be subject to collisions, thus making a DoS more likely. If we're going to make an effort to mitigate such an attack, we should use the best (cryptographically secure, fast) random number source that's available, which AIUI is arc4random.

Actually, looking at what arc4random actually does, it seems like the right fix for cryptsetup and other early/broken environment use cases is to upgrade to a version of libbsd that actually does pass GRND_NONBLOCK: https://git.hadrons.org/cgit/libbsd.git/tree/src/getentropy_linux.c#n206 The first version of libbsd with that code looks like it's 0.9.0, from 2018.

There's not much point in moving arc4random to the end of the list of things we try in json-c, because we'd effectively never end up using it. Perhaps we can detect whether the arc4random that's available is from a "too early" libbsd, and just not try to use it at all in that case, though I don't actually know a reliable way to do that offhand.

Sorry, but I don't quite get that. All the different interfaces to random numbers (on Linux at least) boil down to the same thing (the kernel random number generator). They do not return "better" or "worse" random numbers, they only differ in the behaviour when the kernel entropy pool is not yet initialized or empty:

getrandom() reads from (the equivalent of) /dev/random or /dev/urandom. If the kernel random number entropy pool is not initialized yet (E.G. the early userspace situation), then it will block if called without GRND_NONBLOCK, and return -1 if called with it. E.G. from man getrandom(2):

GRND_NONBLOCK
              By default, when reading from the random source, getrandom() blocks if no random bytes are available, and when reading from the urandom source, it blocks if the entropy pool has not yet been initialized.  If the GRND_NONBLOCK flag  is
              set, then getrandom() does not block in these cases, but instead immediately returns -1 with errno set to EAGAIN.

/dev/{u,}random reads from the kernel random number entropy pool. If the entropy pool is not initialized yet, then /dev/random will block, /dev/urandom not
arc4random calls getrandom() without the GRND_NONBLOCK flag, E.G. it blocks until the pool is initialized

So I don't quite get why we should prefer arc4random(). If it is for non-Linux setups, then they won't have the getrandom() and /dev/{,u}random interfaces, so the ordering doesn't matter.

Using arc4random() from a newer version of libbsd on Linux would just mean that it would end up calling getrandom(,, GRND_NONBLOCK) which would return -1, and it would fall back to using /dev/urandom, exactly as my proposed patch.

jacmet · 2023-09-23T09:27:28Z

@hawicz ping?

jacmet · 2023-11-18T16:10:18Z

@hawicz ping? How can we make progress on this?

jacmet · 2023-12-01T10:26:19Z

@hawicz it would be great to get this merged, thanks

jacmet · 2023-12-22T14:30:41Z

@hawicz ping?

hawicz · 2023-12-24T21:04:13Z

The reason I prefer to keep arc4random first is so json-c makes use of the code that is purpose built to do randomness generation correctly, and thus users of json-c gain a bit of system-wide consistency in how random numbers are generated, and I don't need to think as much about fallbacks or relative suitability and order of the other mechanisms that random_seed.c might call.
arc4random appears to be the "popular" modern choice for generating randomness, and though I'm usually loath to pick something just based on popularity, in this case it seems that this will gain the potential benefit of future improvements/bug-fixes to the RNG without the need to change json-c itself.
If there's a problem with randomness generation, it seems more sensible to upgrade the RNG function rather than all the users of that function, i.e. address the root cause. In the problematic cryptsetup case, is there something that would prevent fixing the blocking issue by upgrading to a newer libbsd (or now glibc,, since apparently >=2.36 includes it)

jacmet · 2023-12-26T08:54:13Z

On Linux, arc4random, getrandom, /dev/{u,}random all get their data from the same entropy pool, so there is no difference in the "quality" of the output. In Glibc and libbsd, arc4random is implemented using getrandom(). The big problem with arc4random() is that it is isn't suitable for early user space because it doesn't take any flags / blocks for a (potentially very long time) for the kernel entropy pool to be filled (E.G. it calls getrandom() without the GRND_NONBLOCK flags):

GRND_NONBLOCK
              By default, when reading from the random source, getrandom() blocks if no random bytes are available, and when reading from the urandom source, it blocks if the entropy pool has not yet been initialized.  If the GRND_NONBLOCK flag  is
              set, then getrandom() does not block in these cases, but instead immediately returns -1 with errno set to EAGAIN.

For a lot (most?) JSON-C users during early boot, calling getrandom(, GRND_NONBLOCK) (which fails fast with -1) and then falling back to /dev/urandom (which succeeds, even though theoretically the "randomness" cannot be guaranteed yet) is very much preferably to hanging for a very long time until arc4random() returns, as the hash collision problem the random seed is supposed to protect against is not an issue (E.G. cryptsetup uses JSON-C to read a JSON formatted configuration file of a few KB).

Once early boot is completed and the kernel entropy pool is initialized, calling getrandom() or arc4random() is completely equivalent. The difference is only about early user space where arc4random() is not suitable and getrandom() / dev/urandom is.

arc4random() is arguably a legacy function from BSD, so I am surprised to hear that you think it is the popular choice, except that it is a bit more portable given libbsd.

jacmet · 2024-02-05T17:17:48Z

@hawicz ping?

jacmet · 2024-02-29T14:24:10Z

@hawicz Can we make progress on this issue please?

jacmet · 2024-03-22T12:08:45Z

what can I do to make progress on this?

jacmet · 2024-06-13T13:08:29Z

@hawicz ping?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

random_seed.c: prefer getrandom() / /dev/{,u}random over arc4random() #832

random_seed.c: prefer getrandom() / /dev/{,u}random over arc4random() #832

jacmet commented Sep 15, 2023

hawicz commented Sep 15, 2023

jacmet commented Sep 16, 2023

hawicz commented Sep 17, 2023

jacmet commented Sep 17, 2023

jacmet commented Sep 23, 2023

jacmet commented Nov 18, 2023

jacmet commented Dec 1, 2023

jacmet commented Dec 22, 2023

hawicz commented Dec 24, 2023

jacmet commented Dec 26, 2023

jacmet commented Feb 5, 2024

jacmet commented Feb 29, 2024

jacmet commented Mar 22, 2024

jacmet commented Jun 13, 2024

random_seed.c: prefer getrandom() / /dev/{,u}random over arc4random() #832

Are you sure you want to change the base?

random_seed.c: prefer getrandom() / /dev/{,u}random over arc4random() #832

Conversation

jacmet commented Sep 15, 2023

hawicz commented Sep 15, 2023

jacmet commented Sep 16, 2023

hawicz commented Sep 17, 2023

jacmet commented Sep 17, 2023

jacmet commented Sep 23, 2023

jacmet commented Nov 18, 2023

jacmet commented Dec 1, 2023

jacmet commented Dec 22, 2023

hawicz commented Dec 24, 2023

jacmet commented Dec 26, 2023

jacmet commented Feb 5, 2024

jacmet commented Feb 29, 2024

jacmet commented Mar 22, 2024

jacmet commented Jun 13, 2024