Fixed const propagation not working in a particular scenario #71298

felix91gr · 2020-04-18T18:51:30Z

As shown here, constant propagation is missing at least the following scenario:

let x = 0;
let y = f(x);

It is currently being const propagated like this:

let x = 0;
let y = f(x);

And it should behave like this:

let x = 0;
let y = f(0);

This PR addresses this scenario, and changes the result of two other mir-opt tests. I will need help figuring out if those tests cases are being further optimized, or if I ran into a regression.

(CC @oli-obk)

rust-highfive · 2020-04-18T18:51:33Z

Thanks for the pull request, and welcome! The Rust team is excited to review your changes, and you should hear from @matthewjasper (or someone else) soon.

If any changes to this PR are deemed necessary, please add them as extra commits. This ensures that the reviewer can see what has changed since they last reviewed the code. Due to the way GitHub handles out-of-date commits, this should also make it reasonably obvious what issues have or haven't been addressed. Large or tricky changes may require several passes of review and changes.

Please see the contribution instructions for more information.

felix91gr · 2020-04-18T18:57:14Z

Some notes on this PR for whoever is reading and wants some explanations about the changes:

https://github.com/rust-lang/rust/pull/71298/files#diff-9e103702275cbef342c2decd3395bf3bR759

This was added in order to have LocalKind information outside of the context of CanConstProp::check, which was important to this commit.

https://github.com/rust-lang/rust/pull/71298/files#diff-9e103702275cbef342c2decd3395bf3bR765

I originally made LocalKind implement Clone with a derive macro, but for some reason that broke rustc. I reverted those changes, and implemented a cloning function alongside the code that needed it, and that unbroke the compiler.

No idea why the Clone derive made it break, though.

https://github.com/rust-lang/rust/pull/71298/files#diff-9e103702275cbef342c2decd3395bf3bL775

Not being able to const-prop over Vars was too much of a restriction. Lifting it was necessary to make this test case pass.

felix91gr · 2020-04-18T19:12:44Z

Sorry, I'm super rusty with rustc commands (pun not intended). Now all my changes are up.

https://github.com/rust-lang/rust/pull/71298/files#diff-1542539b0bf77ee645d84f829c7d8605L25

This test changed its output. It seems to me that it's now more performant (basically, it's being optimized better now). Please help me here though, because I'm not 100% sure.

https://github.com/rust-lang/rust/pull/71298/files#diff-c34947eb53c76084c6a717a62550884f

I definitely need help here. These MIR changes are way too complicated for my current understanding of MIR >.<

Did I cause a regression, or is this test also optimizing better now?

felix91gr · 2020-04-18T19:38:46Z

Missed rustfmt. Brb with correct format

felix91gr · 2020-04-19T00:06:56Z

I'm working on a fix for the errors shown in CI. Will be back as soon as my computer finishes running the test suite again!

wesleywiser

Thanks for the PR! I left a few comments. Let me know if you have any questions.

src/test/mir-opt/const_prop/usize_literal_propagation/rustc.main.ConstProp.diff

src/librustc_mir/transform/const_prop.rs

wesleywiser · 2020-04-19T03:16:11Z

r? @wesleywiser

felix91gr · 2020-04-19T03:51:07Z

Kay, uploaded the changes that let the old tests pass again (const-prop was now detecting UB at compile time in 3 of them <3)

felix91gr · 2020-04-19T04:42:10Z

What IS that CI failure? Its output is so weird. It somehow expects ints with 8 0's instead of 16 0's. Is this a type inference problem, or is it perhaps 32 vs 64 bit thing? 🤔

wesleywiser · 2020-04-20T00:57:51Z

Yeah, it looks like the tests are failing on 32-bit ARM. For the src/test/mir-opt/const_prop/usize_literal_propagation.rs and src/test/mir-opt/simplify-arm-identity.rs, I think you just need to add an annotation to tell the test harness to use separate outputs on 32 and 64 bit:

rust/src/test/mir-opt/const_prop/array_index.rs

Line 1 in dbf8b6b

// EMIT_MIR_FOR_EACH_BIT_WIDTH

wesleywiser · 2020-04-20T01:34:45Z

Sorry, I forgot, you'll also need to --bless the tests to get the files to be generated. You'll need to do that for both 64-bit and 32-bit architectures. Something like this should work:

For 64-bit:

./x.py test --stage 1 -i --bless src/test/mir-opt

For 32-bit:

./x.py test --stage 1 -i --bless --target i686-unknown-linux-gnu src/test/mir-opt

(Assuming you're on Linux. You may also need to install some 32-bit compatibility libraries like this mentions.)

felix91gr · 2020-04-20T01:44:40Z

Sorry, I forgot, you'll also need to --bless the tests to get the files to be generated. You'll need to do that for both 64-bit and 32-bit architectures. Something like this should work:

For 64-bit:
./x.py test --stage 1 -i --bless src/test/mir-opt
For 32-bit:
./x.py test --stage 1 -i --bless --target i686-unknown-linux-gnu src/test/mir-opt
(Assuming you're on Linux. You may also need to install some 32-bit compatibility libraries like this mentions.)

Got it! (I think, that seems fairly straightforward). And yes, I'm on Linux :3

src/librustc_mir/transform/const_prop.rs

wesleywiser · 2020-04-20T19:40:38Z

You might want to rebase when you get a chance so that CI will run and we can see if this improved the failing UI tests (it should have).

felix91gr · 2020-04-21T00:23:38Z

(I'm recompiling everthing because the repo got into some weird state and I decided to reset git's memory, in a way, by cloning again at these commits) (therefore, I'm basically offline until I manage to recompile everything :v)

felix91gr · 2020-04-21T01:38:24Z

Kay, re-blessed those two tests. I checked the output manually and whatever they're doing, I think those programs are gonna have the same output as their predecessors. Now on to checking if the rest of the test suite is a-ok

felix91gr · 2020-04-21T02:00:05Z

Oh wow, those failures are weird. What do they even mean?

felix91gr · 2020-04-21T02:09:01Z

Okay, I'll have to generate the MIR for those snippets, or I'll have no idea of what to do. They look very arcane to me (like read_unaligned and stuff, or like let a = a as *const isize as usize, what does that line even mean?)

But hey, on the good side: only 2 tests failed.

felix91gr · 2020-04-21T02:58:35Z

Oh, okay, that's not entirely accurate. Only 2 fail, but on the --stage 1 command. It seems that the full ./x.py test run has a lot more failing tests. That's great, I can probably debug some of those :D

felix91gr · 2020-04-21T03:56:10Z

Okay, wait... I don't get it. There are errors for which the test complains that the stderr changed, but it hasn't really changed? What is happening. This is so weird.

For example, the first failing test gives this output. And for good measure, these are the contents of the .stderr file that was saved after running it.

They're all the same? What's going on? 🤔

felix91gr · 2020-04-21T03:56:33Z

(@wesleywiser?)

wesleywiser · 2020-04-21T13:36:11Z

I also am very confused. What may be happening is that there's a miscompilation introduced by these changes which is affecting the test harness itself and causing strange issues like this.

wesleywiser · 2020-04-24T08:17:29Z

@felix91gr Would you mind opening another PR with just commits 1, 2 and 3 in it? We can test there if the issue is propagating vars or the function args change.

felix91gr · 2020-04-24T15:55:54Z

Sure! Lemme make a couple of new branches, will be back with two testing PRs :3

src/librustc_mir/transform/const_prop.rs

…h `locals_of_current_block`

felix91gr · 2020-04-26T02:03:56Z

The two changes are up!

What I did to dodge the aliasing of self in the call to remove_const was basically to convert remove_const into an associated function. That way, we give it only the borrows it actually needs - and can safely iterate over the BitSet without having to clone it.

wesleywiser · 2020-04-26T02:48:36Z

@bors try @rust-timer queue

rust-timer · 2020-04-26T02:48:37Z

Awaiting bors try build completion

bors · 2020-04-26T02:48:47Z

⌛ Trying commit 6e586b8 with merge 075d9ec4e3198749f4bb6e6f3bb2b29b8eceb815...

bors · 2020-04-26T05:06:47Z

☀️ Try build successful - checks-azure
Build commit: 075d9ec4e3198749f4bb6e6f3bb2b29b8eceb815 (075d9ec4e3198749f4bb6e6f3bb2b29b8eceb815)

rust-timer · 2020-04-26T05:06:48Z

Queued 075d9ec4e3198749f4bb6e6f3bb2b29b8eceb815 with parent 0862458, future comparison URL.

rust-timer · 2020-04-26T07:37:15Z

Finished benchmarking try commit 075d9ec4e3198749f4bb6e6f3bb2b29b8eceb815, comparison URL.

oli-obk · 2020-04-26T09:46:27Z

All of that regression time is spent in LTO, because we go from 16 codegen units to 44!

So technically this PR is now an improvement, because it allows rustc to split the crate into more codegen units?

Idk... is optimizing for LTO compilation speed actually something we care about?

felix91gr · 2020-04-26T15:08:13Z

All of that regression time is spent in LTO, because we go from 16 codegen units to 44!

Yay!

So technically this PR is now an improvement, because it allows rustc to split the crate into more codegen units?

Wait, I'm not sure I'm understanding you correctly. You mean that:

We are now generating sort of an excess of codegen units (which need to be re-tuned in the codegen step because it's a local maxima as Wesley pointed out),
Or that generating extra codegen units allows the compiler to optimize the code better due to LTO?

Idk... is optimizing for LTO compilation speed actually something we care about?

Probably, but not at this stage. Specially if we are to fully decouple the compiler into libraries in the future. My guess is that we want to optimize LTO compilation speed, but only starting at the codegen stage and not before.

Because MIR is at the end of the day going to have to do what MIR's going to have to do, right? Be a place for optimizations that benefit all Rust backends, a place for static analysis and a place for compile-time computation, right?

I think we are doing the right thing here :)

wesleywiser · 2020-04-26T17:30:32Z

So technically this PR is now an improvement, because it allows rustc to split the crate into more codegen units?

I think what's actually happening is that before, the patched incremental test had 16 codegen units get regenerated and now 44 are being regenerated. I believe the total number of codgen units is directly tied to the number of modules in the crate which for our perf tests, is always the same. So this is a clear regression.

I think unfortunately at this time we can't merge this exactly as is because of the large incremental regression and no correspondingly large improvements. However, this PR definitely moves us closer to our end goal of high-quality MIR optimizations, it's just currently hampered by other parts of the compiler. I'm also very sympathetic to the clippy use case the original issue there completed.

At this time, we can do this:

Merge Const-prop bugfix: only add propagation inside own block for user variables #71518 as-is after doing one last perf run to make sure it's still performance neutral.
Merge the function call propagation under mir-opt-level=2. Rust code normally compiles under mir-opt-level=1 which is the default but nightly compilers can do "unstable" MIR optimizations which are enabled at levels 2 & 3. Since this code works fine but causes performance regressions, we'll gate it under mir-opt-level=2 being requested.

Clippy unfortunately won't see the effects of point 2 because it doesn't use unstable optimizations. However, it should be relatively easy to add some logic to clippy that does a very simple version of const-prop and would resolve your issue. After point 1 is merged, I think all you will need to do is look for any locals in a Body which are assigned to once and remember the right-hand side value of their assignment. Then when inspecting function calls, substitute known values for those locals. I'm not really familiar with clippy internals so @oli-obk will have to guide you there but I think that is very do-able.

Let me know if you have any questions and I'll be happy to try to answer them 🙂

felix91gr · 2020-04-26T20:43:29Z

However, this PR definitely moves us closer to our end goal of high-quality MIR optimizations, it's just currently hampered by other parts of the compiler. I'm also very sympathetic to the clippy use case the original issue there completed.

💓

Merge the function call propagation under mir-opt-level=2. Rust code normally compiles under mir-opt-level=1 which is the default but nightly compilers can do "unstable" MIR optimizations which are enabled at levels 2 & 3. Since this code works fine but causes performance regressions, we'll gate it under mir-opt-level=2 being requested.

How does that work? I'm guessing clippy can't call upon a miri instance to run opt-level-2 optimizations either, right? Like, basically that opt. is 100% blocked from stable venues until upgraded to opt-level-1.

Another idea is that maybe this can be gated in Clippy as a nightly-only feature, as well. So that we don't write new code just because we want the feature to appear on stable right now.

But if you say it's actually easy, maybe we can give it a try. @oli-obk how straightforward do you think it'll be to implement the lint if we have (1) but not (2) on hand?

felix91gr · 2020-04-26T20:50:00Z

Let me know if you have any questions and I'll be happy to try to answer them 🙂

@wesleywiser yes!

How can I help with the codegen issue? Is there a working group or something like that?
How long do you reckon it will take us to sort this out? Like, if we're pessimistic and multiply the time x8 or so just to be sure to go over- and not under-.
Oli mentioned that there will be a meeting about this. Is it going to be public? Can I attend? Even to just listen to the conversation is enough, although I'd love to be able to ask questions if they arise.
Do we have a list of MIR-opts we want to implement, somewhere? I wanna keep working on this. Either in docs (which as I understand stuff, I plan on writing them down), in book keeping or in implementation.

wesleywiser · 2020-04-27T00:34:39Z

How can I help with the codegen issue? Is there a working group or something like that?

We're not actually sure yet. We've noticed more performance issues cropping up because of the current algorithm so it's something we need to look into soon. Our next step is to hold a planning meeting with the rest of the compiler team. I'll be sure to ping you when I fill out the meeting issue template so you know when that happens!

How long do you reckon it will take us to sort this out?

I'm not really sure. I suspect there's some improvements to the current algorithm we could find and land in the next few months. Finding a significantly better algorithm will probably take much longer.

Oli mentioned that there will be a meeting about this. Is it going to be public? Can I attend? Even to just listen to the conversation is enough, although I'd love to be able to ask questions if they arise.

Yes, absolutely! All of the compiler team triage and design meetings happen on the Rust Zulip instance. Triage meetings are Thursday mornings at 10am US Eastern time. Design meetings are Friday mornings at 10am US Eastern time. Anyone is welcome to attend either.

Do we have a list of MIR-opts we want to implement, somewhere? I wanna keep working on this. Either in docs (which as I understand stuff, I plan on writing them down), in book keeping or in implementation.

There's a few places I'd look:

The A-mir-opt issue label on GitHub: https://github.com/rust-lang/rust/issues?q=is%3Aopen+is%3Aissue+label%3AA-mir-opt
Any FIXME or TODO comments in src/librustc_mir/transforms
I'd also suggest hanging out in the wg-mir-opt stream on Zulip: https://rust-lang.zulipchat.com/#narrow/stream/189540-t-compiler.2Fwg-mir-opt

…k_prop, r=wesleywiser Const-prop bugfix: only add propagation inside own block for user variables A testing spinoff of rust-lang#71298. This one only adds the const-prop for locals that are user variables.

felix91gr · 2020-04-30T17:28:42Z

Closed since #71697 takes the baton on the conversation and remaining points to implement ✨

rust-highfive assigned matthewjasper Apr 18, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Apr 18, 2020

This comment has been minimized.

Sign in to view

wesleywiser reviewed Apr 19, 2020

View reviewed changes

src/test/mir-opt/const_prop/usize_literal_propagation/rustc.main.ConstProp.diff Outdated Show resolved Hide resolved

src/librustc_mir/transform/const_prop.rs Outdated Show resolved Hide resolved

src/librustc_mir/transform/const_prop.rs Outdated Show resolved Hide resolved

rust-highfive assigned wesleywiser and unassigned matthewjasper Apr 19, 2020

This comment has been minimized.

Sign in to view

wesleywiser reviewed Apr 20, 2020

View reviewed changes

This comment has been minimized.

Sign in to view

This was referenced Apr 24, 2020

Const-prop bugfix: only add propagation inside own block for user variables #71518

Merged

Const-prop bugfix: only add propagation into function call arguments #71522

Closed

felix91gr commented Apr 25, 2020

View reviewed changes

src/librustc_mir/transform/const_prop.rs Show resolved Hide resolved

wesleywiser reviewed Apr 25, 2020

View reviewed changes

src/librustc_mir/transform/const_prop.rs Outdated Show resolved Hide resolved

felix91gr added 2 commits April 25, 2020 21:48

Remove unnecessary .clone(), by making remove_const not alias wit…

8a8c8c9

…h `locals_of_current_block`

Simplify fn call const propagation (by wesleywiser)

6e586b8

felix91gr closed this Apr 30, 2020

felix91gr deleted the const_prop_bugfix branch April 30, 2020 17:28

Fixed const propagation not working in a particular scenario #71298

Fixed const propagation not working in a particular scenario #71298

Conversation

felix91gr commented Apr 18, 2020

rust-highfive commented Apr 18, 2020

felix91gr commented Apr 18, 2020 • edited Loading

felix91gr commented Apr 18, 2020

felix91gr commented Apr 18, 2020

This comment has been minimized.

felix91gr commented Apr 19, 2020

wesleywiser left a comment

Choose a reason for hiding this comment

wesleywiser commented Apr 19, 2020

felix91gr commented Apr 19, 2020

felix91gr commented Apr 19, 2020

This comment has been minimized.

wesleywiser commented Apr 20, 2020

wesleywiser commented Apr 20, 2020

felix91gr commented Apr 20, 2020

This comment has been minimized.

wesleywiser commented Apr 20, 2020

This comment has been minimized.

felix91gr commented Apr 21, 2020 • edited Loading

felix91gr commented Apr 21, 2020

felix91gr commented Apr 21, 2020

felix91gr commented Apr 21, 2020

felix91gr commented Apr 21, 2020

felix91gr commented Apr 21, 2020

felix91gr commented Apr 21, 2020

wesleywiser commented Apr 21, 2020

wesleywiser commented Apr 24, 2020

felix91gr commented Apr 24, 2020

felix91gr commented Apr 26, 2020

wesleywiser commented Apr 26, 2020

rust-timer commented Apr 26, 2020

bors commented Apr 26, 2020

bors commented Apr 26, 2020

rust-timer commented Apr 26, 2020

rust-timer commented Apr 26, 2020

oli-obk commented Apr 26, 2020

felix91gr commented Apr 26, 2020 • edited Loading

wesleywiser commented Apr 26, 2020

felix91gr commented Apr 26, 2020

felix91gr commented Apr 26, 2020 • edited Loading

wesleywiser commented Apr 27, 2020

felix91gr commented Apr 30, 2020

felix91gr commented Apr 18, 2020 •

edited

Loading

felix91gr commented Apr 21, 2020 •

edited

Loading

felix91gr commented Apr 26, 2020 •

edited

Loading

felix91gr commented Apr 26, 2020 •

edited

Loading