Add option to allow exploiting intprcast alignment #1074

Aaron1011 · 2019-11-24T19:32:53Z

Currently, we have a test that asserts that code is not allowed to rely on any 'extra' information provided by the intprcast (e.g. a [u16; 2] array that happens to have alignment '4').

However, this is a perfectly legitimate thing for code to do. I propose that we add a intprcast-alignment option which enables the following behavior:

When we check the alignment for a memory access, we see if we have a recorded base address for the allocation (i.e. if a pointer within the allocation was ever cast to an integer).

If we do not have a base address, we use the current alignment checking behavior (i.e. check the static alignment of the type). This will catch cases where the code is definitely wrong - if the pointer was never cast to an integer, the code cannot possibly know that it happened to have 'extra' alignment.

If we do have a base address, then we do the alignment check based on the actual base address. This will allow some incorrect code, like:

let mut my_arr: [u8; 100]`
my_arr.as_ptr() as usuze; // Dummy cast
unsafe { *(my_arr.as_mut_ptr() as *mut u16) = 25 }

Depending on what alignment we pick for the base address of my_addr, this may or may not work. This means that whether or not this program passes now depends on the random seed.

While this isn't ideal, I don't see a way of allowing the intptrcast_alignment_check to pass without also causing 'spurious passes' (code which really should fail, but doesn't).

The text was updated successfully, but these errors were encountered:

RalfJung · 2019-12-01T21:36:05Z

If we do have a base address, then we do the alignment check based on the actual base address.

As that test shows, we currently do not do this by design. I understand that this seems strange. See rust-lang/rust#62420 for why I think this is a good idea.

However, I am open to also offer the other mode you are asking for in Miri. What I am not sure about is the default (I lean towards being conservative and keeping the current behavior the default). If we want to change the default, IMO it is a hard requirement that we, by default, at least warn when code exploits alignment and thus could "spuriously pass". (Cc #797)

tmiasko · 2020-02-16T11:16:07Z

A correctly aligned memory access which is under aligned with respect to the
whole allocation constitutes a very weak evidence that there is a bug. Such
memory accesses are relatively common. For example when dealing with binary
data formats, or when writing a custom memory allocators.

I have repeatedly seen prompts for a code review, where person asking had
lingering doubts about code correctness after Miri reported an alignment error.
Thus far, the errors invariably turned out to be a false positives and code correct.

If the code in question does in fact miss an alignment check, the bug can be
reliably caught through randomization without producing false positives.

Furthermore, the suggested workaround using align_to, if applicable at all,
results in no test coverage. After all Miri intentionally exercises possibility
of spurious failure in align_offset, resulting in executions without those
memory accesses, quite unlike executions outside Miri.

RalfJung · 2020-02-16T11:43:27Z

Such
memory accesses are relatively common. For example when dealing with binary
data formats, or when writing a custom memory allocators.

Could you go into a bit more detail about how this is common when dealing with binary data? I would think then you rarely control the actual alignment of the buffer nor the offset of the data, so you have to use unaligned accesses?

(I also think you are biased here when you say "relatively common", because that seems to be the kind of code you are dealing with, but the vast majority of code is not like that.)

That said...

I have repeatedly seen prompts for a code review, where person asking had
lingering doubts about code correctness after Miri reported an alignment error.
Thus far, the errors invariably turned out to be a false positives and code correct.

... this is a good argument. Miri also has found a few actual alignment errors in the past; I am not sure how many of them we would have missed if full alignment was taken into account.

Anyway, the first step is to implement full alignment checks as an option, keeping the current behavior as a default. I was never opposed to that, it has just not been implemented. :)

If the code in question does in fact miss an alignment check, the bug can be
reliably caught through randomization without producing false positives.

Does this mean Miri should somehow help with that?

Furthermore, the suggested workaround using align_to, if applicable at all,
results in no test coverage.

OTOH, I am worried that that currently untested code will often use SIMD as that's a common reason to want higher alignment in the first place; making align_to always work in Miri will probably reduce the amount of code that can be run in practice.

tmiasko · 2020-02-16T13:01:16Z

Consider for example D-Bus decoding. It is a binary format where the data
stored inside is naturally aligned with respect to the beginning of the
message. If the message is stored in user controlled allocation there is no
guarantee it will be suitably aligned with respect to the address space,
additionally data might have possibly a different endianness. Yet, almost
always it will be possible to cast buffer data directly into decoded type, and
we can take advantage of that by offering an API along the lines of:

impl dbus::Decoder<'d> {
  fn from_bytes(buffer: &'d [u8]) -> Self {
    ...
  }

  fn decode_array<T: FixedSize>(&mut self) -> dbus::Result<Cow<'d, [T]>> {
    ...
  }
}

The implementation would check if data is sufficiently aligned for T and
return &d [T] whenever possible, otherwise it would make a copy and return
Vec<T>. Yet under Miri, if everything happens to align, the application is
unlucky since Miri considers this to be an error.

The bytemuck crate offers a safe API for such slice conversions (and there
are many others). Looking at dependent crates might provide further examples.

The current behaviour of checking alignment against the allocation layout
significantly reduces the amount of code that can run under Miri.

RalfJung · 2020-02-16T13:05:21Z

The implementation would check if data is sufficiently aligned for T and
return &d [T] whenever possible, otherwise it would make a copy and return
Vec.

Ah, so there is a fallback path. That is the part I was missing. Now it makes sense. :)

RalfJung · 2020-02-16T13:46:50Z

So I propose the next step is to add an option that both exploits full alignment, and makes align_to behave like it does "normally". Then we can experiment with that to see how bad the SIMD situation really is.

I won't have time to do that any time soon, but I can provide some guidance if someone else is up for the task.

@shepmaster

for alignment errors, note that there might be false positives Cc @shepmaster ``` error: Undefined Behavior: accessing memory with alignment 1, but alignment 8 is required --> tests/compile-fail/unaligned_pointers/alignment.rs:8:9 | 8 | *y_ptr = 42; | ^^^^^^^^^^^ accessing memory with alignment 1, but alignment 8 is required | = help: this usually indicates that your program performed an invalid operation and caused Undefined Behavior = help: but alignment errors can also be false positives, see #1074 ```

thomcc · 2020-04-14T07:23:58Z

Regarding align_to, it's somewhat common to just do an unaligned load (ptr::read_unaligned, _mm_loadu_blah, etc) for the first and last read (and to handle too small sequences separately). This avoids needing to have different code for all the cases, for example.

I guess it would be unfortunate for miri not to work under that circumstance...

I have repeatedly seen prompts for a code review, where person asking had
lingering doubts about code correctness after Miri reported an alignment error.
Thus far, the errors invariably turned out to be a false positives and code correct

Seconding this -- miri's alignment complaints are basically noise since they're so often false positives -- any code that manually aligns pointers, whether correctly or not, sets miri off...

RalfJung · 2020-04-14T07:47:13Z

Using align_to you need a fallback path for the prefix and suffix, right? If you use that, things should work fine in Miri because it just puts everything into the prefix.

Seconding this -- miri's alignment complaints are basically noise since they're so often false positives -- any code that manually aligns pointers, whether correctly or not, sets miri off...

Enough people spoke up by now that I am indeed convinced we should at the very least offer the option to be less strict about alignment. :) Repeating more arguments along those lines is not going to convince me even more.

What remains is finding someone to do the actual work. :D

@oli-obk

miri engine: add option to use force_int for alignment check This is needed for rust-lang/miri#1074. The Miri-side patch is at rust-lang/miri#1513. r? @oli-obk

add option to use force_int for alignment check Fixes #1074. Depends on rust-lang/rust#75592.

RalfJung · 2020-08-17T17:01:47Z

To actually get this fix into the Miri shipped with rustup, we'll need to do a submodule bump in the rustc repo.

Also the test changes in PR #1513 demonstrate quite well the problem with alignment checks that this causes -- I had to make most compile-fail alignment test repeat 10 times to make sure they actually fail. But overall that is probably still better than false positives by default.

enable align_to tests in Miri With rust-lang/miri#1074 resolved, we can enable these tests in Miri. I also tweaked the test sized to get reasonable execution times with decent test coverage.

RalfJung added A-intptrcast Area: affects int2ptr and ptr2int casts C-proposal Category: a proposal for something we might want to do, or maybe not; details still being worked out labels Nov 25, 2019

RalfJung added the E-medium label Mar 28, 2020

RalfJung mentioned this issue Mar 28, 2020

Enable Miri to emit warnings without halting execution #797

Open

RalfJung mentioned this issue Apr 12, 2020

[Rust] FlatBufferBuilder::create_vector_direct reports undefined behavior when run through Miri google/flatbuffers#5854

Closed

shepmaster mentioned this issue Apr 12, 2020

Allow ignoring alignment violations #1326

Closed

RalfJung added C-enhancement Category: a PR with an enhancement or an issue tracking an accepted enhancement and removed C-proposal Category: a proposal for something we might want to do, or maybe not; details still being worked out labels Apr 13, 2020

RalfJung mentioned this issue Apr 14, 2020

Add option to disable alignment check #1332

Merged

RalfJung added the I-false-UB Impact: makes Miri falsely report UB, i.e., a false positive (with default settings) label Apr 16, 2020

RalfJung mentioned this issue Jun 14, 2020

Alignment Woes and False Positives #1449

Closed

hsivonen mentioned this issue Aug 14, 2020

Potential Unsound: 1 out-of-bound read and 5 unaligned memory access. hsivonen/encoding_rs#52

Closed

This was referenced Aug 16, 2020

ptr::align_offset generates surprisingly bad code rust-lang/rust#75579

Closed

miri engine: add option to use force_int for alignment check rust-lang/rust#75592

Merged

add option to use force_int for alignment check #1513

Merged

bors added a commit that referenced this issue Aug 17, 2020

Auto merge of #1513 - RalfJung:int-align, r=RalfJung

01d0cf0

add option to use force_int for alignment check Fixes #1074. Depends on rust-lang/rust#75592.

bors added a commit that referenced this issue Aug 17, 2020

Auto merge of #1513 - RalfJung:int-align, r=RalfJung

62502a0

add option to use force_int for alignment check Fixes #1074. Depends on rust-lang/rust#75592.

bors added a commit that referenced this issue Aug 17, 2020

Auto merge of #1513 - RalfJung:int-align, r=RalfJung

07244c2

add option to use force_int for alignment check Fixes #1074. Depends on rust-lang/rust#75592.

bors added a commit that referenced this issue Aug 17, 2020

Auto merge of #1513 - RalfJung:int-align, r=RalfJung

834bd63

add option to use force_int for alignment check Fixes #1074. Depends on rust-lang/rust#75592.

bors added a commit that referenced this issue Aug 17, 2020

Auto merge of #1513 - RalfJung:int-align, r=RalfJung

1122caa

add option to use force_int for alignment check Fixes #1074. Depends on rust-lang/rust#75592.

bors closed this as completed in 066fa62 Aug 17, 2020

bors closed this as completed in #1513 Aug 17, 2020

RalfJung mentioned this issue Aug 19, 2020

enable align_to tests in Miri rust-lang/rust#75694

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to allow exploiting intprcast alignment #1074

Add option to allow exploiting intprcast alignment #1074

Aaron1011 commented Nov 24, 2019

RalfJung commented Dec 1, 2019

tmiasko commented Feb 16, 2020

RalfJung commented Feb 16, 2020 •

edited

Loading

tmiasko commented Feb 16, 2020

RalfJung commented Feb 16, 2020

RalfJung commented Feb 16, 2020

thomcc commented Apr 14, 2020 •

edited

Loading

RalfJung commented Apr 14, 2020 •

edited

Loading

RalfJung commented Aug 17, 2020

Add option to allow exploiting intprcast alignment #1074

Add option to allow exploiting intprcast alignment #1074

Comments

Aaron1011 commented Nov 24, 2019

RalfJung commented Dec 1, 2019

tmiasko commented Feb 16, 2020

RalfJung commented Feb 16, 2020 • edited Loading

tmiasko commented Feb 16, 2020

RalfJung commented Feb 16, 2020

RalfJung commented Feb 16, 2020

thomcc commented Apr 14, 2020 • edited Loading

RalfJung commented Apr 14, 2020 • edited Loading

RalfJung commented Aug 17, 2020

RalfJung commented Feb 16, 2020 •

edited

Loading

thomcc commented Apr 14, 2020 •

edited

Loading

RalfJung commented Apr 14, 2020 •

edited

Loading