running miri on rustc's test suite (run-pass) #55

oli-obk · 2016-09-15T14:06:47Z

output of

time MIRI_SYSROOT=~/.xargo/HOST MIRI_RUSTC_TEST=../rust/src/test/run-pass cargo run --release --manifest-path="rustc_tests/Cargo.toml" 2> cargo_test_output.txt > /dev/null

~~1617 success, 558 no mir, 205 crate not found, 158 failed~~
~~1630 success, 560 no mir, 205 crate not found, 143 failed~~
~~1632 success, 561 no mir, 205 crate not found, 140 failed~~
~~1632 success, 562 no mir, 205 crate not found, 139 failed~~
~~1633 success, 563 no mir, 205 crate not found, 137 failed~~
~~1638 success, 554 no mir, 205 crate not found, 141 failed~~
~~1864 success, 211 no mir, 205 crate not found, 258 failed~~
~~1867 success, 202 no mir, 205 crate not found, 264 failed~~
~~1868 success, 202 no mir, 205 crate not found, 94 failed, 43 C fn, 9 ABI, 102 unsupported, 5 intrinsic~~
~~1878 success, 201 no mir, 205 crate not found, 58 failed, 42 C fn, 9 ABI, 121 unsupported, 5 intrinsic~~
~~1883 success, 201 no mir, 205 crate not found, 55 failed, 42 C fn, 9 ABI, 121 unsupported, 6 intrinsic~~
~~1885 success, 201 no mir, 205 crate not found, 53 failed, 42 C fn, 9 ABI, 121 unsupported, 6 intrinsic~~
~~1886 success, 201 no mir, 205 crate not found, 52 failed, 42 C fn, 9 ABI, 121 unsupported, 6 intrinsic~~
~~1902 success, 202 no mir, 206 crate not found, 60 failed, 44 C fn, 0 ABI, 122 unsupported, 6 intrinsic~~
~~1901 success, 202 no mir, 206 crate not found, 54 failed, 44 C fn, 9 ABI, 122 unsupported, 6 intrinsic~~
~~1904 success, 202 no mir, 206 crate not found, 49 failed, 44 C fn, 9 ABI, 122 unsupported, 6 intrinsic~~
~~1905 success, 202 no mir, 206 crate not found, 52 failed, 44 C fn, 9 ABI, 122 unsupported, 2 intrinsic~~
~~1907 success, 202 no mir, 206 crate not found, 52 failed, 44 C fn, 9 ABI, 122 unsupported, 0 intrinsic~~
~~1913 success, 202 no mir, 206 crate not found, 50 failed, 44 C fn, 9 ABI, 122 unsupported, 0 intrinsic~~
~~1933 success, 201 no mir, 209 crate not found, 49 failed, 38 C fn, 0 ABI, 122 unsupported, 6 intrinsic~~
~~1985 success, 206 no mir, 219 crate not found, 62 failed, 40 C fn, 0 ABI, 122 unsupported, 6 intrinsic~~
~~2016 success, 208 no mir, 225 crate not found, 57 failed, 34 C fn, 0 ABI, 123 unsupported, 10 intrinsic~~
~~2022 success, 208 no mir, 225 crate not found, 51 failed, 34 C fn, 0 ABI, 123 unsupported, 10 intrinsic~~
~~2025 success, 208 no mir, 225 crate not found, 48 failed, 34 C fn, 0 ABI, 123 unsupported, 10 intrinsic~~
~~2179 success, 2 no mir, 235 crate not found, 68 failed, 145 C fn, 0 ABI, 15 unsupported, 6 intrinsic~~
~~2179 success, 3 no mir, 235 crate not found, 73 failed, 42 C fn, 0 ABI, 112 unsupported, 6 intrinsic~~
~~2188 success, 14 no mir, 236 crate not found, 79 failed, 44 C fn, 0 ABI, 113 unsupported, 6 intrinsic~~
2457 success, 5 no mir, 252 crate not found, 100 failed, 44 C fn, 0 ABI, 118 unsupported, 14 intrinsic

Tracking the state in gist from now on, so we can see diffs

https://gist.github.com/oli-obk/5a0832eef3124ad9088748fc9e759318

The text was updated successfully, but these errors were encountered:

solson · 2016-09-16T00:27:10Z

I have implementations for atomic_load, atomic_store, and atomic_xadd_relaxed which don't really do anything atomic, but it shouldn't matter when we don't have actual parallelism. A few others like the relaxed/non-relaxed variants and volatile_{load,store} should be just as easy.

I'll submit them soon and see how the test results change.

oli-obk · 2016-09-20T14:00:05Z

I implemented the missing non atomic* intrinsics. Rustc seriously needs more tests, every intrinsic should at least have a run-pass test...

oli-obk · 2016-09-21T13:21:24Z

use of fn(usize) -> Foo as fn(usize) -> u32
use of fn(&isize) -> Option<&isize> as fn(&isize) -> &isize

These are used when working with ffi... Should we support these cases?
I mean, any &isize can be transmuted to Option<&isize> and will be perfectly fine. Afaik this is guaranteed.

oli-obk · 2016-11-13T21:00:01Z

to pass more tests we need to implement panic and print, some minor bugs are left, but these make up most of our failed tests.

eddyb · 2016-11-13T21:14:19Z

@oli-obk Any chance you could gist the entire test output? Assuming it contains both names and messages, it would let people like me to idly rummage through some of the maybe-UB tests.

oli-obk · 2016-11-13T21:21:49Z

~~cargo_test_output.txt~~

see in first post for frequently updating file

oli-obk · 2016-11-13T21:34:23Z

https://github.com/rust-lang/rust/blob/master/src/test/run-pass/mir_raw_fat_ptr.rs#L129 compares pointers into different allocs ( same in raw_fat_ptr.rs and mir_raw_fat_ptr.rs)

solson · 2016-11-13T21:39:01Z

I believe we could impose an ordering between unrelated allocations, but it wouldn't really be based on anything (it would just have to be deterministic).

I've seen a C interpreter that essentially has an arbitrary deterministic ordering, if I recall correctly.

eddyb · 2016-11-13T21:44:29Z

@solson Not in CTFE mode, though, right?

solson · 2016-11-13T21:45:06Z

@eddyb No, even in CTFE mode, if we can guarantee determinism.

oli-obk · 2016-11-13T21:46:13Z

That's gonna expose some internals and differ depending on the mir optimizations that are run before evaluation.

eddyb · 2016-11-13T21:47:21Z

@solson If that determinism depends on the order things were evaluated in, that's not deterministic enough. It should be impossible for constant evaluation to observe any past interactions.
So maybe it'd make sense for "stack memory"? Not worth risking it though.

solson · 2016-11-13T21:47:24Z

It doesn't really expose anything. It's fine if it differs, so long as it runs the same given the same inputs (sources, flags, etc).

solson · 2016-11-13T21:47:56Z

@eddyb Yeah, it wouldn't work if you re-use the same uncleared Memory for different runs in some manner.

solson · 2016-11-13T21:48:33Z

I agree it sounds too risky for CTFE now.

eddyb · 2016-11-13T21:49:11Z

so long as it runs the same given the same inputs

Nope, that's not enough, the "inputs" are the MIR to evaluate and its generic parameters, not the whole compilation. I've described how getting this wrong even one bit can impact coherence and thus soundness.
Presumably there'd be some global Memory to cache results but its state should not be observable.

oli-obk · 2016-11-15T14:57:20Z

I updated the top post with a list of the missing MIRs and a list of all panics happening inside miri. I also attached an updated raw output file.

solson · 2017-02-07T09:19:33Z

@oli-obk Can you re-run and update the OP? I was having some odd problems running it, but I can investigate stuff based on your results for now.

oli-obk · 2017-02-07T09:47:30Z

sure

oli-obk · 2017-02-07T10:15:58Z

I think I broke it in 8b8c743

fixed it, report and PR coming in around an hour.

oli-obk · 2017-02-07T10:42:02Z

updated log

solson · 2017-02-07T12:01:40Z

There are a number of "invalid enum discriminant value read" results in your log that I can't reproduce. When I run those tests, they succeed.

oli-obk · 2017-02-07T12:23:09Z

our rustc versions and/or test suite may differ due to different checkouts. My checkout is from February 2nd. My rustc is rustc 1.16.0-nightly (24055d0f2 2017-01-31)

solson · 2017-02-07T12:41:34Z

rustc 1.17.0-nightly (ea7a6486a 2017-02-04) with the latest checkout here. Maybe your packed struct PR a few days ago fixed things?

oli-obk · 2017-02-07T12:44:54Z

I'll investigate

oli-obk · 2017-02-07T13:58:12Z

ok, so updating rustc and the checkout fixed these bugs, so it must be something rustc changed that fixed it and not us, because I used the same miri code base. Regenerating log...

oli-obk · 2017-02-07T16:32:55Z

updated log.

oli-obk · 2017-02-09T08:52:54Z

every NO MIR FOR ...panic... is a test failure.

     17 NO MIR FOR `std::panicking::rust_panic_with_hook`
      7 NO MIR FOR `std::rt::begin_panic_fmt`
      7 NO MIR FOR `std::panicking::panicking`

some of these are useless... like a test testing --test compileflag, which we simply ignore and try to run the main function, which simply panics.

rerunning after 1 > -1 is true again, since that fixes a few of those.

gnzlbg · 2017-02-09T15:41:00Z

Is there a plan for supporting inline assembly?

solson · 2017-02-09T15:43:39Z

@gnzlbg No.

gnzlbg · 2017-02-09T15:44:08Z

So there will never be a way to work around that?

gnzlbg · 2017-02-09T15:45:20Z

If rust code could detect that is being run at compile-time, and branch on it, it could provide a different implementation of some functionality without features that miri will probably never understand (like inline assembly, SIMD intrinsics, llvm intrinsics, etc).

solson · 2017-02-09T15:47:22Z

@gnzlbg Yeah, I refer to that idea as #[cfg(const)]. I guess bring it up again once we have RFCs for the more advanced const features Miri will enable.

gnzlbg · 2017-02-09T15:49:38Z

Cool! Something like #[cfg(const)] would be a really pragmatic hack!

solson · 2017-02-09T23:26:47Z

@oli-obk It'd be nice to split the "error" count in the OP into true errors and "unsupported", so things like lack of threading don't inflate the error count. I'd also prefer to have them in separate lists if possible.

eddyb · 2017-02-09T23:39:02Z

#[cfg(const)] is syntactical whereas this needs to decide based on what executes it

solson · 2017-02-09T23:40:24Z

Good point, it wouldn't actually use cfg.

oli-obk · 2017-03-14T09:17:53Z

fun fact: over the last month miri has gotten faster by a factor of two. At least the rustc test suite is finished twice as fast (number of tests went up! not down). This might either be due to improvements in rustc or due to improvements in miri.

oli-obk · 2017-06-21T06:25:48Z

I'm running this now with full std mir enabled. We seem to have some previously unknown issues. But at least "no mir" errors have gone down to 2.

eddyb · 2017-06-21T09:13:09Z

@oli-obk If you look at your 2 "MIR not found" errors, I'm pretty sure the :::: is from extern {...}, so these are most likely FFI.

RalfJung · 2017-07-12T06:50:14Z

Is there any way to run miri on the tests (#[test]) embedded in e.g. libstd? Most of the functional tests are actually such embedded tests, not run-pass tests.

oli-obk · 2017-07-12T06:55:37Z

hmm... I think I implemented this in cargo miri. But I'm not sure how to run that inside the rustc build system.

RalfJung · 2019-02-23T10:30:43Z

Also see https://github.com/RalfJung/miri-test-libstd/issues/1

oli-obk mentioned this issue Nov 13, 2016

ensure that integers cast to pointers will never point at a valid alloc, not even the zst alloc #81

Merged

oli-obk mentioned this issue Dec 15, 2016

tuple struct constructors and uninitialized fields #96

Closed

solson mentioned this issue Feb 7, 2017

recursive static initialization is broken #120

Closed

oli-obk mentioned this issue Feb 10, 2017

autogenerate markdown for rustc test suite result #137

Merged

oli-obk mentioned this issue Mar 14, 2017

rustup to rustc 1.17.0-nightly (60a0edc6c 2017-02-26) #147

Merged

oli-obk mentioned this issue Apr 25, 2017

Also test subdirectories of rust/src/test/run-pass #160

Merged

oli-obk added the C-project Category: a larger project is being tracked here, usually with checkmarks for individual steps label Aug 10, 2017

RalfJung added the C-enhancement Category: a PR with an enhancement or an issue tracking an accepted enhancement label Nov 17, 2018

RalfJung closed this as completed Feb 23, 2019

RalfJung reopened this Feb 23, 2019

RalfJung added the A-tests Area: affects our test suite or CI label Mar 8, 2019

RalfJung removed the C-enhancement Category: a PR with an enhancement or an issue tracking an accepted enhancement label Apr 8, 2019

RalfJung added C-proposal Category: a proposal for something we might want to do, or maybe not; details still being worked out and removed C-project Category: a larger project is being tracked here, usually with checkmarks for individual steps labels Jul 1, 2019

RalfJung mentioned this issue Apr 19, 2020

Define UB in float-to-int casts to saturate rust-lang/rust#71269

Merged

oli-obk closed this as completed Apr 23, 2023

running miri on rustc's test suite (run-pass) #55

running miri on rustc's test suite (run-pass) #55

Comments

oli-obk commented Sep 15, 2016 • edited Loading

solson commented Sep 16, 2016

oli-obk commented Sep 20, 2016

oli-obk commented Sep 21, 2016

oli-obk commented Nov 13, 2016

eddyb commented Nov 13, 2016

oli-obk commented Nov 13, 2016 • edited Loading

oli-obk commented Nov 13, 2016 • edited Loading

solson commented Nov 13, 2016 • edited Loading

eddyb commented Nov 13, 2016

solson commented Nov 13, 2016

oli-obk commented Nov 13, 2016

eddyb commented Nov 13, 2016

solson commented Nov 13, 2016

solson commented Nov 13, 2016

solson commented Nov 13, 2016

eddyb commented Nov 13, 2016

oli-obk commented Nov 15, 2016

solson commented Feb 7, 2017

oli-obk commented Feb 7, 2017

oli-obk commented Feb 7, 2017

oli-obk commented Feb 7, 2017

solson commented Feb 7, 2017

oli-obk commented Feb 7, 2017

solson commented Feb 7, 2017

oli-obk commented Feb 7, 2017

oli-obk commented Feb 7, 2017

oli-obk commented Feb 7, 2017

oli-obk commented Feb 9, 2017

gnzlbg commented Feb 9, 2017 • edited Loading

solson commented Feb 9, 2017

gnzlbg commented Feb 9, 2017

gnzlbg commented Feb 9, 2017

solson commented Feb 9, 2017

gnzlbg commented Feb 9, 2017

solson commented Feb 9, 2017

eddyb commented Feb 9, 2017

solson commented Feb 9, 2017 • edited Loading

oli-obk commented Mar 14, 2017

oli-obk commented Jun 21, 2017

eddyb commented Jun 21, 2017

RalfJung commented Jul 12, 2017

oli-obk commented Jul 12, 2017

RalfJung commented Feb 23, 2019

oli-obk commented Sep 15, 2016 •

edited

Loading

oli-obk commented Nov 13, 2016 •

edited

Loading

oli-obk commented Nov 13, 2016 •

edited

Loading

solson commented Nov 13, 2016 •

edited

Loading

gnzlbg commented Feb 9, 2017 •

edited

Loading

solson commented Feb 9, 2017 •

edited

Loading