Clear sensitive memory without getting optimized out #636

real-or-random · 2019-06-06T20:49:54Z

This is a rebase of #448, including a few changes (mostly implementing what I described in #185). I haven't tested this on Windows and I actually haven't reviewed this in detail so far, but people can start to have a look.

Fixes #185 and closes #448.

real-or-random · 2019-06-06T20:51:29Z

src/util.h

+#if defined(_MSC_VER)
+    /* SecureZeroMemory is guaranteed not to be optimized out by MSVC. */
+    SecureZeroMemory(ptr, n);
+#elif defined(__GNUC__)


I couldn't figure out when GCC started to support assembly but it's some version <= 2.95... So if you have such an old compiler, I guess you're on your own.

GCC 2.95 is likely the earliest we'd would ever encounter someone wanting to support. (Haiku OS uses it still, I believe :) )

Yeah, I think this is fine.

real-or-random · 2019-06-06T20:52:21Z

src/util.h

 #include <stdlib.h>
 #include <stdint.h>
 #include <stdio.h>

+#if defined(_MSC_VER)
+// For SecureZeroMemory
+#include <Windows.h>


I think that's okay but I'm not entirely sure if we want that. It's used in Bitcoin Core too.

I would be that this should be version gated, I'll see if I can figure out where thats supported to.

gmaxwell · 2019-06-06T23:49:30Z

I think using a sizeof on an argument is likely to introduce almost impossible to detect bugs when e.g. someone hands it the first element of an array (esp one passed from another function). It doesn't feel particularly typesafe to me.

Is there a reason why you preferred this approach to having every subsystem (like field, scalar) define a clear function for its relevant type?

real-or-random · 2019-06-07T06:51:37Z

No, I don't prefer that to be honest. It's just the approach taken in #448, and I was somewhat worried about it, too.

I'll try to change it.

real-or-random · 2019-06-12T14:40:32Z

I removed the macro from the PR. Also I removed _fe_set_zero from the PR (one can use _fe_set_int).

I'm happy to take suggestions for additional code locations where memory should be cleared.

real-or-random · 2019-06-12T14:43:32Z

src/util.h

+     * just not remove it entirely. See "Dead Store Elimination (Still) Considered Harmful" by
+     * Yang et al. (USENIX Security 2017) for more background.
+     */
+    memset(ptr, 0, len);


Now that "clearing" is cleanly separated from "setting to zero", it's a good idea to replace 0 with a different value here for testing that the code does not rely on the fact that the "_clear" functions set to 0. We could actually always use a different value here, or use a different value in (some runs of) tests. What do people think about this?

Did you try checking the binary that it's actually not optimized out?

Not yet. But yes, we should do this with a few compilers.

it's a good idea to replace 0 with a different value here for testing

I don't think it's a good idea to use a different value in testing. But I don't see a downside to just replace 0 with 42. This also has the (slight) advantage that it'll make it easier to detect now improper usage of *_clear when rebasing -zkp on top of this. -zkp uses *_clear for initializing in many places.

I like the idea of using a non-zero constant byte value. Using 0 is more likely to result in silent failures when uninitialized memory is accidentally used.

real-or-random · 2019-06-12T14:47:47Z

src/field.h

@@ -49,10 +49,11 @@ static int secp256k1_fe_normalizes_to_zero(secp256k1_fe *r);
 *  implementation may optionally normalize the input, but this should not be relied upon. */
 static int secp256k1_fe_normalizes_to_zero_var(secp256k1_fe *r);

-/** Set a field element equal to a small integer. Resulting field element is normalized. */
+/** Set a field element equal to a small integer. Resulting field element is normalized; it has
+ *  magnitude 0 if a == 0, and magnitude 1 otherwise. */
 static void secp256k1_fe_set_int(secp256k1_fe *r, int a);


(I wondered why this accepts int. An unsigned type seems more appropriate. For type safety, we would even want to exclude too large values and use something as small as unsigned char. But I guess this is slower and just not worth the hassle)?

Agree, though this seems like something for another patch.

elichai · 2019-06-12T15:17:22Z

What about clearing the memory of the temporary Field Elements in the GE addition? https://github.com/bitcoin-core/secp256k1/blob/master/src/group_impl.h#L419 (and I think most of this file)

real-or-random · 2019-06-12T22:10:42Z

Thanks, collecting more:
https://github.com/bitcoin-core/secp256k1/blob/master/src/scalar_impl.h#L71
A lot in https://github.com/bitcoin-core/secp256k1/blob/master/src/scalar_8x32_impl.h (also also 4x64)

And I think there more in the field module... I guess I'll just go through the entire codebase. 🤷‍♂️

jonasnick

I confirmed by looking at the disassembly that both the memory barrier and volatile function pointer method do work with gcc 9.1.0.
Looking at the disassembly also confirmed that the memory barrier method lets gcc optimize the memset and the volatile function pointer method lets gcc actually call memset.
I didn't look at Windows.

I changed the memset overwrite from 0x00 to 0x42 and added a function

void secp256k1_do_nothing(void) {
    unsigned char foo[32];
    secp256k1_mem_clear(foo, sizeof(foo));
}

After compiling the tests with -fno-stack-protector (for brevity) the output of objdump -d tests is:

000000000001ac30 <secp256k1_do_nothing>:
   1ac30:	66 0f 6f 05 e8 2c 04 	movdqa 0x42ce8(%rip),%xmm0        # 5d920 <secp256k1_ge_const_g+0x80>
   1ac37:	00 
   1ac38:	48 8d 44 24 d8       	lea    -0x28(%rsp),%rax
   1ac3d:	0f 29 44 24 d8       	movaps %xmm0,-0x28(%rsp)
   1ac42:	0f 29 44 24 e8       	movaps %xmm0,-0x18(%rsp)
   1ac47:	c3                   	retq   
   1ac48:	0f 1f 84 00 00 00 00 	nopl   0x0(%rax,%rax,1)
   1ac4f:	00

This moves 16 bytes of 0x42 located at 5d920 (readelf -x .rodata tests) into xmm0 and then moves two times 16 bytes from xmm0 into the foo buffer.

With the volatile function pointer method the disassembly is as follows:

000000000001acd0 <secp256k1_do_nothing>:
   1acd0:	48 83 ec 38          	sub    $0x38,%rsp
   1acd4:	48 8b 05 ed a2 04 00 	mov    0x4a2ed(%rip),%rax        # 64fc8 <memset@GLIBC_2.2.5>
   1acdb:	ba 20 00 00 00       	mov    $0x20,%edx
   1ace0:	be 42 00 00 00       	mov    $0x42,%esi
   1ace5:	48 8d 7c 24 10       	lea    0x10(%rsp),%rdi
   1acea:	48 89 44 24 08       	mov    %rax,0x8(%rsp)
   1acef:	48 8b 44 24 08       	mov    0x8(%rsp),%rax
   1acf4:	ff d0                	callq  *%rax
   1acf6:	48 83 c4 38          	add    $0x38,%rsp
   1acfa:	c3                   	retq   
   1acfb:	0f 1f 44 00 00       	nopl   0x0(%rax,%rax,1)

This calls memset with arguments rsp + 10, 0x42 and 32 as expected (see the calling convention).

jonasnick · 2019-06-24T23:08:59Z

Results are positive and very similar to the above with clang version 8.0.0.

elichai · 2019-06-24T23:44:29Z

@jonasnick did you try to benchmark if this affects performance in a non negligible way?

jonasnick · 2019-06-25T07:50:40Z

On my system the memory barrier and volatile function pointer method have no measurable performance difference. However, using one of the methods vs. using nothing in bench_sign appears to be 1.2% slower (41.3us vs 40.8us, averaged over 6 runs).

jonasnick

approach ACK. Very cool, someone had to do it eventually.

Clearing no secrets in pippenger is fine, not sure why I added it.
Looks like fe_clear is replaced everywhere with set_int(0) and the right memsets are replaced with the corresponding _clear function

I noticed that a few field.h comments about magnitudes are wrong (independent of this PR).
For example a field element set to 0 with fe_set_int will have magnitude 0 but can be correctly compared with secp256k1_fe_equal.

Tests run fine under valgrind, also with endo and using the volatile function pointer method. I didn't test on Windows.

Going through the codebase to look for locations where a _clear is missing can happen in another PR.

jonasnick · 2019-07-29T15:06:13Z

Looks like memsetting to 0x42 instead of 0 costs about 1% :/

ecdsa_sign: min 41.3us / avg 41.5us / max 42.2us
vs.
ecdsa_sign: min 40.9us / avg 41.1us / max 41.8us

(I changed run_benchmark count argument from 10 to 500)

sipa · 2019-07-29T16:54:54Z

@jonasnick That sounds like a deal breaker, if true.

real-or-random · 2019-07-29T18:53:54Z

Note that's just signing time, not verification time.

But I agree, let's stick to 0, that's just simpler.

sipa · 2019-08-06T22:58:56Z

Concept ACK.

There are a number of data types which end up being cleared using the generic secp256k1_mem_clear() function after this PR (including at least secp256k1_sha256 and secp256k1_rfc6979_hmac_sha256). I think it would be cleaner to provide *_clear() functions for those as well, and use those where relevant.

This code is not supposed to handle secret data.

We rely on memset() and an __asm__ memory barrier where it's available or on SecureZeroMemory() on Windows. The fallback implementation uses a volatile function pointer to memset which the compiler is not clever enough to optimize.

There are two uses of the secp256k1_fe_clear() function that are now separated into these two functions in order to reflect the intent: 1) initializing the memory prior to being used -> converted to fe_set_int( . , 0 ) 2) zeroing the memory after being used such that no sensitive data remains. -> remains as fe_clear() In the latter case, 'magnitude' and 'normalized' need to be overwritten when VERIFY is enabled. Co-Authored-By: isle2983 <isle2983@yahoo.com>

Co-Authored-By: isle2983 <isle2983@yahoo.com> Co-Authored-By: Pieter Wuille <pieter.wuille@gmail.com>

All of the invocations of secp256k1mem_clear() operate on stack memory and happen after the function is done with the memory object. This commit replaces existing memset() invocations and also adds secp256k1_memclear() to code locations where clearing was missing; there is no guarantee that this commit covers all code locations where clearing is necessary. Co-Authored-By: isle2983 <isle2983@yahoo.com>

This gives the caller more control about whether the state should be cleaned (= should be considered secret), which will be useful for example for Schnorr signature verification in the future. Moreover, it gives the caller the possibility to clean a hash struct without finalizing it.

real-or-random · 2020-04-27T17:00:26Z

I rebased this and implemented @sipa's suggestion for the hash API. Now the caller is responsible to call _clean() (instead of letting _finalize) do it. I think this is actually the right thing to do.

nit: Also renamed secp256k1_mem_clear to memclear now that we have also memczero in util.h

real-or-random · 2020-04-27T17:15:29Z

Going through the codebase to look for locations where a _clear is missing can happen in another PR.

Indeed, so this is ready for review.

For looking into more places in the future, in particular within some arithmetic function, I was wondering if "overdoing" this will introduce more stores to memory, which is not only slightly worse for security but will probably also affect performace. For example, the explicit_bzero function in Glibc has this problem:

Also, explicit_bzero only operates on RAM. If a sensitive data object never needs to have its address taken other than to call explicit_bzero, it might be stored entirely in CPU registers until the call to explicit_bzero. Then it will be copied into RAM, the copy will be erased, and the original will remain intact. Data in RAM is more likely to be exposed by a bug than data in registers, so this creates a brief window where the data is at greater risk of exposure than it would have been if the program didn’t try to erase it at all.

https://www.gnu.org/software/libc/manual/html_node/Erasing-Sensitive-Data.html

AFAIU and I've seen from looking at generated ASM this should not happen with our memory barrier approach but I'm not 100% sure and any insight will be helpful.

Note to myself: Cleaning registers is out of the scope currently but today the cited USENIX paper reminded me of a simple but expensive way to do this: Just run the operation again. (For example, run another SHA256 transform on dummy midstate and dummy input.) This will reuse and hence overwrite the registers (modulo inlining etc).

real-or-random · 2020-05-15T16:28:54Z

Apparently stores of 64-byte zero-over-zero can be faster than stores of other values, see https://twitter.com/BRIAN_____/status/1260913021116993536. This would be a reason not so overwrite with zeros.

But I think we shouldn't have secrets that are all zeroes.This would be relevant for indistinguishability-based primitives with arbitrary inputs, e.g., encryption, but not for signing and key exchange.

sipa · 2023-05-08T15:16:34Z

Conceptually I think it makes sense to have separate secure-erase wiping and zeroing of values. It needs a big redo by now though; are you still interesting in working on this?

real-or-random · 2023-05-08T15:18:19Z

Conceptually I think it makes sense to have separate secure-erase wiping and zeroing of values. It needs a big redo by now though; are you still interesting in working on this?

Yeah, I certainly want to come back to this, but I don't want to commit to a specific timeline. Marking as draft for now.

…vival of #636) 765ef53 Clear _gej instances after point multiplication to avoid potential leaks (Sebastian Falbesoner) 349e6ab Introduce separate _clear functions for hash module (Tim Ruffing) 99cc9fd Don't rely on memset to set signed integers to 0 (Tim Ruffing) 97c57f4 Implement various _clear() functions with secp256k1_memclear() (Tim Ruffing) 9bb368d Use secp256k1_memclear() to clear stack memory instead of memset() (Tim Ruffing) e3497bb Separate between clearing memory and setting to zero in tests (Tim Ruffing) d79a6cc Separate secp256k1_fe_set_int( . , 0 ) from secp256k1_fe_clear() (Tim Ruffing) 1c08126 Add secp256k1_memclear() for clearing secret data (Tim Ruffing) e7d3844 Don't clear secrets in pippenger implementation (Tim Ruffing) Pull request description: This PR picks up #636 (which in turn picked up #448, so this is take number three) and is essentially a rebase on master. Some changes to the original PR: * the clearing function now has the `secp256k1_` prefix again, since the related helper `_memczero` got it as well (see PR #835 / commit e89278f) * the original commit b17a7df ("Make _set_fe_int( . , 0 ) set magnitude to 0") is not needed anymore, since it was already applied in PR #943 (commit d49011f) * clearing of stack memory with `secp256k1_memclear` is now also done on modules that have been newly introduced since then, i.e. schnorr and ellswift (of course, there is still no guarantee that all places where clearing is necessary are covered) So far I haven't looked at any disassembly and possible performance implications yet (there were some concerns expressed in #636 (comment)), happy to go deeper there if this gets Concept ACKed. The proposed method of using a memory barrier to prevent optimizating away the memset is still used in BoringSSL (where it was originally picked up from) and in the Linux Kernel, see e.g. https://github.com/google/boringssl/blob/5af122c3dfc163b5d1859f1f450756e8e320a142/crypto/mem.c#L335 and https://github.com/torvalds/linux/blob/d4560686726f7a357922f300fc81f5964be8df04/include/linux/string.h#L348 / https://github.com/torvalds/linux/blob/d4560686726f7a357922f300fc81f5964be8df04/include/linux/compiler.h#L102 Fixes #185. ACKs for top commit: sipa: reACK 765ef53 real-or-random: ACK 765ef53 Tree-SHA512: 5a034d5ad14178c06928022459f3d4f0877d06f576b24ab07b86b3608b0b3e9273217b8309a1db606f024f3032731f13013114b1e0828964b578814d1efb2959

real-or-random commented Jun 6, 2019

View reviewed changes

real-or-random force-pushed the cleanse branch from aa65cd6 to 198cf25 Compare June 12, 2019 14:37

real-or-random marked this pull request as ready for review June 12, 2019 14:38

real-or-random commented Jun 12, 2019

View reviewed changes

jonasnick reviewed Jun 24, 2019

View reviewed changes

jonasnick reviewed Jun 25, 2019

View reviewed changes

sipa mentioned this pull request Jul 23, 2019

Clear sensitive memory without getting optimized out. #448

Closed

elichai mentioned this pull request Aug 28, 2019

Questions about some clear functions usage #659

Closed

real-or-random mentioned this pull request Nov 25, 2019

Is the compiler optimizing out some of the benchmarks? #667

Closed

elichai mentioned this pull request Feb 4, 2020

secp256k1_ecdh_hash_function must return 0 or 1 rust-bitcoin/rust-secp256k1#196

Closed

elichai mentioned this pull request Mar 1, 2020

Simplify callback logic to returning raw coordinates rust-bitcoin/rust-secp256k1#201

Merged

real-or-random force-pushed the cleanse branch 2 times, most recently from 7c525da to b808227 Compare April 27, 2020 13:44

Don't clear secrets in pippenger implementation

3f96b07

This code is not supposed to handle secret data.

real-or-random force-pushed the cleanse branch from b808227 to 6dc08e7 Compare April 27, 2020 13:55

real-or-random and others added 3 commits April 27, 2020 17:38

Add secp256k1_memclear() for clearing secret data

4949658

We rely on memset() and an __asm__ memory barrier where it's available or on SecureZeroMemory() on Windows. The fallback implementation uses a volatile function pointer to memset which the compiler is not clever enough to optimize.

Make _set_fe_int( . , 0 ) set magnitude to 0

b17a7df

real-or-random and others added 5 commits April 27, 2020 17:38

Separate between clearing memory and setting to zero in tests

9006e91

Co-Authored-By: isle2983 <isle2983@yahoo.com> Co-Authored-By: Pieter Wuille <pieter.wuille@gmail.com>

Implement various _clear() functions with secp256k1_memclear()

dda8737

Don't rely on memset to set signed integers to 0

244c749

real-or-random force-pushed the cleanse branch from 6dc08e7 to 6a34c60 Compare April 27, 2020 16:58

elichai mentioned this pull request Apr 30, 2020

Add usage examples #748

Merged

real-or-random mentioned this pull request May 1, 2020

Secret key generation and cleaning #749

Open

mratsim mentioned this pull request May 15, 2020

Protection of users private keys status-im/nimbus-eth2#545

Open

real-or-random mentioned this pull request Sep 9, 2020

Cleaner infinity handling in group law and ecmult_const. #791

Closed

real-or-random mentioned this pull request Oct 5, 2020

Switch to our own memcmp function #825

Closed

elichai mentioned this pull request Oct 18, 2020

Secrets Management with Secrecy/Zeroize summa-tx/coins#15

Open

roconnor-blockstream mentioned this pull request May 13, 2021

VERIFY_CHECK precondition for secp256k1_fe_set_int. #943

Merged

real-or-random mentioned this pull request Oct 15, 2021

Verify that secp256k1_ge_set_gej_zinv does not operate on infinity. #942

Merged

real-or-random mentioned this pull request Nov 16, 2021

Add Elligator Square module #982

Closed

real-or-random mentioned this pull request Dec 28, 2021

Signed-digit multi-comb for ecmult_gen (by peterdettman) #693

Closed

real-or-random mentioned this pull request Aug 5, 2022

Add dedicated methods for verifying asset/value proofs BlockstreamResearch/secp256k1-zkp#195

Open

3 tasks

real-or-random marked this pull request as draft May 8, 2023 15:18

real-or-random added assurance side-channel labels May 8, 2023

real-or-random mentioned this pull request May 22, 2023

dead store in secp256k1_ecmult_gen BlockstreamResearch/secp256k1-zkp#234

Closed

real-or-random mentioned this pull request Aug 4, 2024

Add module "musig" that implements MuSig2 multi-signatures (BIP 327) #1479

Merged

theStack mentioned this pull request Aug 6, 2024

Clear sensitive memory without getting optimized out (revival of #636) #1579

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clear sensitive memory without getting optimized out #636

Clear sensitive memory without getting optimized out #636

real-or-random commented Jun 6, 2019

real-or-random Jun 6, 2019

gmaxwell Jun 7, 2019

sipa Jul 23, 2019

real-or-random Jun 6, 2019

gmaxwell Jun 7, 2019

gmaxwell commented Jun 6, 2019

real-or-random commented Jun 7, 2019 •

edited

Loading

real-or-random commented Jun 12, 2019

real-or-random Jun 12, 2019

elichai Jun 12, 2019

real-or-random Jun 12, 2019

jonasnick Jun 24, 2019

sipa Jul 23, 2019

real-or-random Jun 12, 2019

sipa Jul 23, 2019

elichai commented Jun 12, 2019 •

edited

Loading

real-or-random commented Jun 12, 2019 •

edited

Loading

jonasnick left a comment

jonasnick commented Jun 24, 2019

elichai commented Jun 24, 2019

jonasnick commented Jun 25, 2019

jonasnick left a comment •

edited

Loading

jonasnick commented Jul 29, 2019

sipa commented Jul 29, 2019 •

edited

Loading

real-or-random commented Jul 29, 2019

sipa commented Aug 6, 2019

real-or-random commented Apr 27, 2020 •

edited

Loading

real-or-random commented Apr 27, 2020

real-or-random commented May 15, 2020

sipa commented May 8, 2023

real-or-random commented May 8, 2023

Clear sensitive memory without getting optimized out #636

Are you sure you want to change the base?

Clear sensitive memory without getting optimized out #636

Conversation

real-or-random commented Jun 6, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gmaxwell commented Jun 6, 2019

real-or-random commented Jun 7, 2019 • edited Loading

real-or-random commented Jun 12, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

elichai commented Jun 12, 2019 • edited Loading

real-or-random commented Jun 12, 2019 • edited Loading

jonasnick left a comment

Choose a reason for hiding this comment

jonasnick commented Jun 24, 2019

elichai commented Jun 24, 2019

jonasnick commented Jun 25, 2019

jonasnick left a comment • edited Loading

Choose a reason for hiding this comment

jonasnick commented Jul 29, 2019

sipa commented Jul 29, 2019 • edited Loading

real-or-random commented Jul 29, 2019

sipa commented Aug 6, 2019

real-or-random commented Apr 27, 2020 • edited Loading

real-or-random commented Apr 27, 2020

real-or-random commented May 15, 2020

sipa commented May 8, 2023

real-or-random commented May 8, 2023

real-or-random commented Jun 7, 2019 •

edited

Loading

elichai commented Jun 12, 2019 •

edited

Loading

real-or-random commented Jun 12, 2019 •

edited

Loading

jonasnick left a comment •

edited

Loading

sipa commented Jul 29, 2019 •

edited

Loading

real-or-random commented Apr 27, 2020 •

edited

Loading