experiment with "inverting" liveness to improve perf of html5ever #52460

nikomatsakis · 2018-07-17T15:52:49Z

html5ever profiles are dominated by code that "applies" the result of computing liveness. In thinking over how this code works, it seems like it might be better to "invert". Currently what happens is this:

The util/liveness code computes, for each point in the CFG, the set of live variables at each point
- but we want to distinguish two kinds of liveness: live because it will be used, and live because it will be dropped
- so actually we run the analysis twice
- these results are stored as a bitset of variables per program point
Then the NLL typeck code goes over each program point. For each point, it looks at each live variable at that point, and identifies the regions within that variable's type. Those regions are set to be live at that point.
- these results are stored as a bitset of program points per region

You can see here the mismatch: in the first case, we store a bitset of variables per point, and in the later, a bitset of points per region. This results in a lot of "conversion" instructions.

I was thinking that it might be more efficient to do the liveness in an inverted sense. Instead of re-using the util/liveness code, we would do the following:

Do one pass over the CFG to create an index mapping each variable to the points where it is referenced (this ought to be roughly an O(n) walk).
For each variable V whose type T contains regions:
- We would walk backward from each use to find the defs that dominate them.
- Each block that we visit is a place where V is live, so we can accumulate those blocks into a sparse bit set.
  - Depending on how we organize the index of uses above, we can make this particularly efficient I imagine, since we should be able to cheaply answer the question "is V defined anywhere in this basic block".
  - We can also handle the drop-live vs use-live distinction as part of this walk in a relatively straightforward fashion I imagine. When we are walking backwards from a drop, we mark points as drop-live and stop at any use or definition (since, from that point back, the value is either use-live or not live at all).
Once we are done we should have a bit set of points where V is live and a bit set of drop-live points.
- We then iterate over region R in the type of T and merge that live-set into its value.
- Similarly for the drop-live points, but only over the relevant regions.

This isn't fundamentally different but it has a few potential advantages:

We enumerate the regions in the type of each variable ONCE -- and when we do it, we have the full set of live points in hand
We don't have to do two liveness computations or deal with the somewhat messy job of stitching them back together

One complication I remember now is that for drop-live we also do need the results of initialization at each point. That could be a bit of a pain to reconcile, have to think about it.

All in all, not an easy change, but maybe a good one.

The text was updated successfully, but these errors were encountered:

nikomatsakis · 2018-07-17T19:59:26Z

Decided to push this back to Release Candidate milestone:

This is mostly targeting the outlier case of html5ever, though it may have broader benefits
It's a "medium size" PR, not an easy win
The focus for EP2 is on "must fix" items

@pnkfelix

[wip] NLL: experiment with inverting liveness I got inspired to see what would happen here. cc #52460 r? @pnkfelix

@pnkfelix

NLL: experiment with inverting liveness I got inspired to see what would happen here. cc #52460 r? @pnkfelix

@pnkfelix

NLL: experiment with inverting liveness I got inspired to see what would happen here. Fixes #52460 r? @pnkfelix

nikomatsakis added WG-compiler-nll NLL-performant Working towards the "performance is good" goal labels Jul 17, 2018

nikomatsakis added this to the Rust 2018 Preview 2 milestone Jul 17, 2018

nikomatsakis added I-nominated A-NLL Area: Non-lexical lifetimes (NLL) labels Jul 17, 2018

nikomatsakis modified the milestones: Rust 2018 Preview 2, Rust 2018 Release Candidate Jul 17, 2018

lqd mentioned this issue Jul 25, 2018

html5ever in the rustc-perf repository is memory-intensive #52028

Closed

nikomatsakis mentioned this issue Aug 13, 2018

NLL: experiment with inverting liveness #53314

Merged

bors added a commit that referenced this issue Aug 13, 2018

Auto merge of #53314 - nikomatsakis:nll-invert-liveness, r=<try>

0d2e1bf

[wip] NLL: experiment with inverting liveness I got inspired to see what would happen here. cc #52460 r? @pnkfelix

bors added a commit that referenced this issue Aug 22, 2018

Auto merge of #53314 - nikomatsakis:nll-invert-liveness, r=<try>

c658645

NLL: experiment with inverting liveness I got inspired to see what would happen here. cc #52460 r? @pnkfelix

nikomatsakis removed the WG-compiler-nll label Aug 27, 2018

nikomatsakis self-assigned this Aug 27, 2018

nikomatsakis removed the I-nominated label Aug 27, 2018

bors added a commit that referenced this issue Aug 28, 2018

Auto merge of #53314 - nikomatsakis:nll-invert-liveness, r=pnkfelix

83ddc33

NLL: experiment with inverting liveness I got inspired to see what would happen here. Fixes #52460 r? @pnkfelix

bors closed this as completed in #53314 Aug 28, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

experiment with "inverting" liveness to improve perf of html5ever #52460

experiment with "inverting" liveness to improve perf of html5ever #52460

nikomatsakis commented Jul 17, 2018

nikomatsakis commented Jul 17, 2018

experiment with "inverting" liveness to improve perf of html5ever #52460

experiment with "inverting" liveness to improve perf of html5ever #52460

Comments

nikomatsakis commented Jul 17, 2018

nikomatsakis commented Jul 17, 2018