Fuzzer enhancement: Explicitly check output for uninitialized memory #22064

guidovranken · 2021-05-25T21:15:47Z

Is your feature request related to a problem? Please describe.

Both MemorySanitizer and Valgrind will only detect uninitialized memory if it is used for branching or IO.

E.g. the following program performs a computation using an uninitialized variable (a) but this won't trigger MSAN/Valgrind:

int main(void)
{
    int a; int b = a + 10;
    return 0;
}

Describe the solution you'd like

Call

extern "C" void __msan_check_mem_is_initialized(const volatile void *x, size_t size);

on the data to make MSAN evaluate it.

Describe alternatives you've considered

Alternative solution that also works with Valgrind: write the data to /dev/null:

#include <stdio.h>

int main(void)
{
    int a; int b = a + 10;
    FILE* fp = fopen("/dev/null", "wb");
    fwrite(&b, sizeof(b), 1, fp);
    fclose(fp);
    return 0;
}

Additional context

Proposal: Create a wrapper for __msan_check_mem_is_initialized (as a C++ method), e.g.:

void TestMsan(const void* data, const size_t size) {
   __msan_check_mem_is_initialized(x, size);
}

And use overloaded methods for special types, e.g.

void TestMsan(const std::string& s) {
   TestMsan(s.data(), s.size());
}

Then edit all fuzzer harnesses and call TestMsan with the output of each non-void method.

E.g. the parse_script harness would become:

// Copyright (c) 2009-2020 The Bitcoin Core developers
// Distributed under the MIT software license, see the accompanying
// file COPYING or http://www.opensource.org/licenses/mit-license.php.

#include <core_io.h>
#include <script/script.h>
#include <test/fuzz/fuzz.h>

FUZZ_TARGET(parse_script)
{
    const std::string script_string(buffer.begin(), buffer.end());
    try {
        TestMsan(ParseScript(script_string));
    } catch (const std::runtime_error&) {
    }
}

The same concept can be applied to the unit tests.

The text was updated successfully, but these errors were encountered:

maflcko · 2021-06-05T07:36:18Z

Concept ACK, obviously. Though, I am worried that this will make our code overly verbose, and hard to maintain. Maybe it would help if there was a compiler knob to adjust the aggressiveness of the memory sanitizers?

For "easy" memory violations, -ftrivial-auto-var-init=pattern might be enough to corrupt values and then cause a logic error down the line. At least it will detect the memory issue fixed in commit 3737126.

practicalswift · 2021-10-03T08:27:13Z

@guidovranken This technique was used in #23152 (comment). Thanks! :)

maflcko · 2023-09-27T15:23:06Z

For reference -ftrivial-auto-var-init=pattern, caught another issue (in leveldb) that both valgrind and msan missed: #28359

As mentioned previously, I don't think there is anything that can be done here, other than adding a compiler flag upstream.

In theory the wrapper code can be enforced with a clang-tidy plugin in fuzz code (cc @dergoegge), but the downsides of being incomplete and making the code overly verbose still hold.

dergoegge · 2023-09-28T13:53:49Z

In theory the wrapper code can be enforced with a clang-tidy plugin in fuzz code (cc @dergoegge), but the downsides of being incomplete and making the code overly verbose still hold.

(maybe this is what you mean by "enforce" but) My idea was to have the clang-tidy plugin auto refactor all our code to insert the wrappers prior to running the msan/valgrind job in CI. I think this should be possible and would avoid the verbosity of having the wrappers present in the actual code.

maflcko · 2023-09-28T18:05:51Z

refactor all our code

So every statement in the source code is wrapped and the memory is read by msan? Probably fine, but I'd suspect a massive slow-down.

I guess doing this once per release can't hurt.

real-or-random · 2024-04-09T13:58:12Z

Alternative solution that also works with Valgrind: write the data to /dev/null:

There's also VALGRIND_CHECK_MEM_IS_DEFINED, see also https://github.com/bitcoin-core/secp256k1/blob/master/src/checkmem.h for an abstraction layer that works with both MSan and Valgrind.

What's suggested in this issue, i.e., reporting every read of an uninitialized value, may just be too much. The Valgrind FAQ says this:

As for eager reporting of copies of uninitialised memory values, this has been suggested multiple times. Unfortunately, almost all programs legitimately copy uninitialised memory values around (because compilers pad structs to preserve alignment) and eager checking leads to hundreds of false positives. Therefore Memcheck does not support eager checking at this time.

But starting with clang 16, at least MSan gets us closer to this. Returning an uninitialized variables from a function, or passing uninitialized values to a function as a parameter is now considered a "use" of uninitialized memory, and MSan will report it by default. See the Clang 16.0.09 Release Notes:

-fsanitize-memory-param-retval is turned on by default. With -fsanitize=memory, passing uninitialized variables to functions and returning uninitialized variables from functions is more aggressively reported. -fno-sanitize-memory-param-retval restores the previous behavior.

guidovranken added the Feature label May 25, 2021

maflcko added the Brainstorming label May 26, 2021

maflcko mentioned this issue May 26, 2021

Mark CheckTxInputs [[nodiscard]]. Avoid UUM in fuzzing harness coins_view. #22065

Merged

maflcko mentioned this issue Jun 5, 2021

build: Add --with-append-cxxflags option #22159

Closed

maflcko added Tests and removed Feature labels Sep 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fuzzer enhancement: Explicitly check output for uninitialized memory #22064

Fuzzer enhancement: Explicitly check output for uninitialized memory #22064

guidovranken commented May 25, 2021

maflcko commented Jun 5, 2021

practicalswift commented Oct 3, 2021

maflcko commented Sep 27, 2023

dergoegge commented Sep 28, 2023

maflcko commented Sep 28, 2023

real-or-random commented Apr 9, 2024 •

edited

Fuzzer enhancement: Explicitly check output for uninitialized memory #22064

Fuzzer enhancement: Explicitly check output for uninitialized memory #22064

Comments

guidovranken commented May 25, 2021

maflcko commented Jun 5, 2021

practicalswift commented Oct 3, 2021

maflcko commented Sep 27, 2023

dergoegge commented Sep 28, 2023

maflcko commented Sep 28, 2023

real-or-random commented Apr 9, 2024 • edited

real-or-random commented Apr 9, 2024 •

edited