Perf regression in nightly when compiling massive matches #29227

swgillespie · 2015-10-22T05:03:56Z

The file located here: https://gist.github.com/swgillespie/37b32f7b09ae536df8dc when compiled using rustc rustc_abuse.rs -o rustc_abuse -Z time-passes takes approximately 9 seconds and 40MB of memory to compile on stable and beta, while taking almost a minute and 4GB of memory on nightly.

The Bad Thing being done here is that there are several massive matches. As expected, match checking takes a few seconds, but the real culprit here seems to be "MIR dump", which is where the memory usage peaks. The memory usage is high enough to get my Travis CI build killed that has some code similar to this, but not as extreme.

These numbers were gathered with:

rustc 1.5.0-nightly (4826f9625 2015-10-21)
rustc 1.4.0-beta.3 (20eba406f 2015-10-16)
rustc 1.3.0 (9a92aaf19 2015-09-15).
Mac OS X 10.10.5

The text was updated successfully, but these errors were encountered:

alexcrichton · 2015-10-22T07:42:56Z

cc @rust-lang/compiler, seems like some low hanging fruit perhaps?

nikomatsakis · 2015-10-22T12:21:38Z

Perhaps, yeah. Thanks for the report!

nikomatsakis · 2015-10-22T12:23:02Z

Well, it probably requires tweaking MIR somewhat to have a more general "switch" style statement, as @Aatch suggested at some point.

eddyb · 2015-10-23T07:21:23Z

@nikomatsakis I believe I've seen MIR building turn constant patterns not into plain x == constant but PartialEq::eq(&x, &constant) - wouldn't that be a bug, given that patterns need to be pure?

nikomatsakis · 2015-10-23T15:30:25Z

@eddyb ah yes, I meant to writeup an RFC on this point. I believe the
current pattern behavior (not what MIR does) is incorrect (or at least
undesirable). Certainly it's not something that I fully understood we were
doing. In particular, I think that

match v { CONSTANT => true, _ => false }

and

v == CONSTANT

should always be equivalent, if both of them compile. (I could imagine
that the match version might not always compile). The current behavior of
converting constants into patterns obviously doesn't preserve this
equivalence, which seems pretty wrong to me, but it also doesn't work very
well with things like associated constants.

On Fri, Oct 23, 2015 at 3:21 AM, Eduard-Mihai Burtescu <
notifications@github.com> wrote:

@nikomatsakis https://github.com/nikomatsakis I believe I've seen MIR
building turn constant patterns not into plain x == constant but PartialEq::eq(&x,
&constant) - wouldn't that be a bug, given that patterns need to be pure?

—
Reply to this email directly or view it on GitHub
#29227 (comment).

eddyb · 2015-10-23T16:41:29Z

I don't think there's a way to change that and not break backwards compat, as currently the purity of PartialEq impls is not publically reflected in APIs, and matching always works, even without a PartialEq impl or an impure one.

OTOH, expanding associated constants may require delaying match MIR generation post-monomorphization, at least for the associated constant pattern, not necessarily the whole match.

nikomatsakis · 2015-10-23T19:17:09Z

@eddyb I consider the current behavior a bug, that it is accepting more code than it ought to. But I would want to evaluate the impact of any change, to be sure. Note: I also don't consider purity in matches to be a requirement. After all, I'd like to permit `box` patterns, which do a deref. That is also note pure. We simply cannot expand associated constants, I don't think. I mean technically yes we could wait till after monomorphization, but we'd also have to avoid doing the exhaustiveness checks until that point as well, since they are influenced by the constant-to-pattern expansion.

arielb1 · 2015-10-23T20:00:09Z

@nikomatsakis

I am not sure we want to support non-resolvable associated constants in match, exactly because of this unclarity. However, I am not sure match x { C => {}, _ => {}} into match x { a if a == C => {}, _ => {} } is so terrible.

eddyb · 2015-10-24T07:49:52Z

@nikomatsakis Exhaustivity-wise, we can just be conservative, which might indeed be too much of a restriction for associated constants to be useful at all in matches.

As for the box patterns, we need restrictions there for soundness, and I believe the best way to go about it is to reuse the DST support: the CoerceUnsized<Rc<U>> for Rc<T> impl lets the compiler know where the actual pointer is.
Coupled with a Deref<Target=T> for Rc<T> impl, the compiler can assume...

Actually, scratch that, it was for a different feature (self: Rc<T>) - the pointers the compiler knows about is to an RcBox<T>, not as useful here.
To get box patterns to work, we can just require an unsafe impl<T> PureDeref for Rc<T> {} and be done with it.
Although I wouldn't stabilize that because it could interfere with a potential const impl<T> Deref<Target=T> for Rc<T> {...}.

nikomatsakis · 2015-10-26T19:36:12Z

This is pretty off topic for this issue. I plan to open a discussion thread
in the next day or so and we can hash it out there. :) Alternatively,
#20489 has related discussion.

On Sat, Oct 24, 2015 at 3:50 AM, Eduard-Mihai Burtescu <
notifications@github.com> wrote:

@nikomatsakis https://github.com/nikomatsakis Exhaustivity-wise, we can
just be conservative, which might indeed be too much of a restriction for
associated constants to be useful at all in matches.

As for the box patterns, we need restrictions there for soundness, and I
believe the best way to go about it is to reuse the DST support: the CoerceUnsized<Rc>
for Rc impl lets the compiler know where the actual pointer is.
Coupled with a Deref<Target=T> for Rc impl, the compiler can assume...

Actually, scratch that, it was for a different feature (self: Rc) -
the pointers the compiler knows about is to an RcBox, not as useful
here.
To get box patterns to work, we can just require an unsafe impl
PureDeref for Rc {} and be done with it.
Although I wouldn't stabilize that because it could interfere with a
potential const impl Deref<Target=T> for Rc {...}.

—
Reply to this email directly or view it on GitHub
#29227 (comment).

nikomatsakis · 2015-10-26T19:37:00Z

This was accidentally closed. The issue is not solved, though #29384 is a temporary workaround to prevent regressing the stable release.

nikomatsakis · 2015-10-26T19:37:14Z

(I did start a branch that I THINK will resolve this issue, however.)

Introduce a `SwitchInt` and restructure pattern matching to collect integers and characters into one master switch. This is aimed at #29227, but is not a complete fix. Whereas before we generated an if-else-if chain and, at least on my machine, just failed to compile, we now spend ~9sec compiling `rustc_abuse`. AFAICT this is basically just due to a need for more micro-optimization of the matching process: perf shows a fair amount of time just spent iterating over the candidate list. Still, it seemed worth opening a PR with this step alone, since it's a big step forward.

@Aatch

The older algorithm was pretty inefficient for big matches. Fixes #29227. (On my computer, MIR construction on this test case goes from 9.9s to 0.025s.) Whereas before we had a loop like: - for all outcomes of the test we are performing - for all candidates - check whether candidate is relevant to outcome We now do: - for all candidates - determine which outcomes the candidate is relevant to Since the number of outcomes in this case is proportional to the number of candidates, the original algorithm turned out to be O(n^2), and the newer one is just O(n). This PR also does some minor speedups by eagerly mirroring all patterns, so that we can just pass around `&Pattern<'tcx>`, which makes cloning cheaper. We could probably go a bit further in this direction. r? @Aatch

sfackler added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Oct 22, 2015

nikomatsakis mentioned this issue Oct 22, 2015

Tracking issue for MIR (RFC #1211) #27840

Closed

16 tasks

nikomatsakis closed this as completed in 2d5b8b0 Oct 26, 2015

nikomatsakis reopened this Oct 26, 2015

nikomatsakis mentioned this issue Nov 4, 2015

Introduce a SwitchInt construct to MIR #29588

Merged

nikomatsakis mentioned this issue Nov 10, 2015

OOM building Servo after Rustup - deep recursion in mir #29740

Closed

nikomatsakis added a commit to nikomatsakis/rust that referenced this issue Nov 11, 2015

Add regression test for rust-lang#29227.

f027612

nikomatsakis mentioned this issue Nov 11, 2015

Change match desugaring in MIR to be O(n) instead of O(n^2) #29763

Merged

bors closed this as completed in #29763 Nov 11, 2015

CAD97 mentioned this issue Aug 13, 2022

Confusing error message in the presence of unicode combining characters #100388

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Perf regression in nightly when compiling massive matches #29227

Perf regression in nightly when compiling massive matches #29227

swgillespie commented Oct 22, 2015

alexcrichton commented Oct 22, 2015

nikomatsakis commented Oct 22, 2015

nikomatsakis commented Oct 22, 2015

eddyb commented Oct 23, 2015

nikomatsakis commented Oct 23, 2015

eddyb commented Oct 23, 2015

nikomatsakis commented Oct 23, 2015 via email

arielb1 commented Oct 23, 2015

eddyb commented Oct 24, 2015

nikomatsakis commented Oct 26, 2015

nikomatsakis commented Oct 26, 2015

nikomatsakis commented Oct 26, 2015

Perf regression in nightly when compiling massive matches #29227

Perf regression in nightly when compiling massive matches #29227

Comments

swgillespie commented Oct 22, 2015

alexcrichton commented Oct 22, 2015

nikomatsakis commented Oct 22, 2015

nikomatsakis commented Oct 22, 2015

eddyb commented Oct 23, 2015

nikomatsakis commented Oct 23, 2015

eddyb commented Oct 23, 2015

nikomatsakis commented Oct 23, 2015 via email

arielb1 commented Oct 23, 2015

eddyb commented Oct 24, 2015

nikomatsakis commented Oct 26, 2015

nikomatsakis commented Oct 26, 2015

nikomatsakis commented Oct 26, 2015