"incorrect use of uninit memory" lint should fire for multi-variant enum #73608

RalfJung · 2020-06-22T09:21:27Z

I tried this code:

#[derive(Copy, Clone, Debug)]
enum Fruit {
    Apple,
    _Banana,
}

fn foo() -> Fruit {
    unsafe {
        let mut r = mem::uninitialized();
        ptr::write(&mut r as *mut Fruit, Fruit::Apple);
        r
    }
}

I expected to see this happen: It should warn about Fruit not working with mem::uninitialized.

Instead, this happened: There is no diagnostic.

The problem here is that not all enums should cause the warning. If the enum only has one variant, being uninitialized could actually be fine (but variants that are uninhabited and ZST do not count). This is easy to check at run-time where we have the full layout (so we panic), but statically we just have the type and that makes it harder.

As a start, we could count the fieldless variants; if there are at least 2 then the enum definitely needs a tag and thus may not remain uninitialized.

Cc @stepancheg

The text was updated successfully, but these errors were encountered:

lcnr · 2020-06-22T10:13:43Z

I was shortly confused why one variant enums can be okay, so here's a short example:

enum Test {
    Variant(u8),
}

Note that we should also not warn on an enum with #[repr(uN)] and 2^N variants. https://play.rust-lang.org/?version=nightly&mode=debug&edition=2018&gist=425e29d79a26d57552d35a212126552b

RalfJung · 2020-06-22T11:06:40Z

I was shortly confused why one variant enums can be okay, so here's a short example:

To be clear, this is not a guarantee. It's just how we currently represent enums. It might become UB in the future to leave such an enum uninit. Right now, it is a case of incorrectly relying on unspecified data layout.

Note that we should also not warn on an enum with #[repr(uN)] and 2^N variants.

That is not correct, we should warn for such cases. Uninitialized memory is different from memory having some arbitrary value. So when we say "the tag has to encode a valid discriminant", then even if every initialized bit pattern encodes a valid discriminant, that does not mean that the uninitialized bit pattern encodes a valid discriminant.

RalfJung · 2020-06-22T11:30:19Z

To be clear, this is not a guarantee. It's just how we currently represent enums. It might become UB in the future to leave such an enum uninit. Right now, it is a case of incorrectly relying on unspecified data layout.

In light of this, I wonder if we should just warn about all enums, to avoid relying on such unspecified details? However that would mean that we'd warn about cases that are not UB... so we should likely at least make the message different.

lcnr · 2020-06-22T11:33:38Z

Don't both cases have the same reasoning though? I don't quite see why you do not want to warn on enum Test { Variant(u8) } while warning on 2^N variants, considering that both rely on (very similar) unspecified behavior.

This is easy to check at run-time where we have the full layout (so we panic)

Afaict we check this during code-gen, not at run-time. So we could emit post monomorphization warnings here, which we probably do not want, e.g.

use std::{mem, ptr};

unsafe trait Foo {
    const IS_POD: bool;
}

unsafe impl Foo for bool {
    const IS_POD: bool = false;
}

unsafe impl Foo for u8 {
    const IS_POD: bool = true;
}

fn bar<T: Foo + Default>() -> T {
    unsafe {
        if T::IS_POD {
            let mut x = mem::uninitialized();
            ptr::write(&mut x, T::default());
            x
        } else {
            T::default()
        }
    }
}

fn main() {
    let _: bool = bar();
    let _: u8 = bar();
}

lcnr · 2020-06-22T11:42:27Z

To be clear, this is not a guarantee. It's just how we currently represent enums. It might become UB in the future to leave such an enum uninit. Right now, it is a case of incorrectly relying on unspecified data layout.

In light of this, I wonder if we should just warn about all enums, to avoid relying on such unspecified details? However that would mean that we'd warn about cases that are not UB... so we should likely at least make the message different.

Isn't enum Foo { Variant(u8) } also UB except that it currently can't be triggered?

There is obviously a somewhat important difference between theoretical and "actual" UB here, which is probably also the line we draw when deciding if the call itself should panic. IMO we should warn for anything which is "theoretically" UB while panicking in cases which are currently known to sometimes to cause problems.

RalfJung · 2020-06-22T11:50:13Z

Afaict we check this during code-gen, not at run-time. So we could emit post monomorphization warnings here, which we probably do not want, e.g.

We do, sorry for the vagueness. But indeed post-monomorphization time errors would likely not even show up in the right crate here, and they would break proper run-time tests as you noted.

Don't both cases have the same reasoning though? I don't quite see why you do not want to warn on enum Test { Variant(u8) } while warning on 2^N variants, considering that both rely on (very similar) unspecified behavior.

No, it's very different. Your single-variant enum is (under current rustc layout) equivalent to u8 itself. It is right that uninitialized u8 is currently deemed UB by the reference, but whether or not this is really what we want is discussed in rust-lang/unsafe-code-guidelines#71.

For enums with a tag, I have not seen anyone suggest to permit them being uninitialized under any circumstances. It would we an awful special case in our semantics, and I don't see any reason to do this.

IMO we should warn for anything which is "theoretically" UB

That is not currently our approach e.g. for uninitialized u8.

RalfJung added the C-bug Category: This is a bug. label Jun 22, 2020

LeSeulArtichaut added A-lint Area: Lints (warnings about flaws in source code) such as unused_mut. C-enhancement Category: An issue proposing an enhancement or a PR with one. and removed C-bug Category: This is a bug. labels Jun 22, 2020

JohnTitor added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Jul 5, 2020

RalfJung mentioned this issue Jul 17, 2020

warn about uninitialized multi-variant enums (invalid_value lint) #74438

Merged

bors closed this as completed in cdedae8 Jul 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"incorrect use of uninit memory" lint should fire for multi-variant enum #73608

"incorrect use of uninit memory" lint should fire for multi-variant enum #73608

RalfJung commented Jun 22, 2020 •

edited

Loading

lcnr commented Jun 22, 2020 •

edited

Loading

RalfJung commented Jun 22, 2020 •

edited

Loading

RalfJung commented Jun 22, 2020

lcnr commented Jun 22, 2020

lcnr commented Jun 22, 2020 •

edited

Loading

RalfJung commented Jun 22, 2020 •

edited

Loading

"incorrect use of uninit memory" lint should fire for multi-variant enum #73608

"incorrect use of uninit memory" lint should fire for multi-variant enum #73608

Comments

RalfJung commented Jun 22, 2020 • edited Loading

lcnr commented Jun 22, 2020 • edited Loading

RalfJung commented Jun 22, 2020 • edited Loading

RalfJung commented Jun 22, 2020

lcnr commented Jun 22, 2020

lcnr commented Jun 22, 2020 • edited Loading

RalfJung commented Jun 22, 2020 • edited Loading

RalfJung commented Jun 22, 2020 •

edited

Loading

lcnr commented Jun 22, 2020 •

edited

Loading

RalfJung commented Jun 22, 2020 •

edited

Loading

lcnr commented Jun 22, 2020 •

edited

Loading

RalfJung commented Jun 22, 2020 •

edited

Loading