Fix performance regression in debuginfo file_metadata. #70803

arlosi · 2020-04-05T08:56:08Z

Fixes performance regression caused by #69718.

Finding the SourceFile associated with a FileName called get_source_file on the SourceMap, which does a linear search through all files in the SourceMap.

This resolves the issue by passing the SourceFile in from the caller (which already had it available) instead of the FileName

Fixes #70785.

Finding the `SourceFile` associated with a `FileName` called `get_source_file` on the `SourceMap`, which does a linear search through all files in the `SourceMap`. This resolves the issue by passing the SourceFile in from the caller (which already had it available).

rust-highfive · 2020-04-05T08:56:11Z

r? @matthewjasper

(rust_highfive has picked a reviewer for you, use r? to override)

arlosi · 2020-04-05T08:58:01Z

r? @eddyb

Should we do a perf run to see if this fully resolves the regression? It seemed to resolve it in a few benchmarks I ran locally.

eddyb · 2020-04-05T08:59:48Z

src/librustc_codegen_llvm/debuginfo/metadata.rs


-    let source_file = cx.sess().source_map().get_source_file(file_name);


Ahh, I missed this during review! This is indeed unnecessarily slow.

eddyb · 2020-04-05T09:01:03Z

src/librustc_codegen_llvm/debuginfo/metadata.rs

-    file_name: &FileName,
+    source_file: &SourceFile,


Heh, I think I did this (plus removing defining_crate and relying on the more accurate source_file.is_imported()) in one of my PRs that has been sitting around.

eddyb · 2020-04-05T09:02:45Z

Let's land it and see the results there, to reduce the time master spends regressed.

@bors r+ rollup=never p=10

bors · 2020-04-05T09:02:46Z

📌 Commit 4cdceda has been approved by eddyb

bors · 2020-04-05T09:50:34Z

⌛ Testing commit 4cdceda with merge 607b858...

bors · 2020-04-05T13:00:29Z

☀️ Test successful - checks-azure
Approved by: eddyb
Pushing 607b858 to master...

nnethercote · 2020-04-08T04:50:21Z

@arlosi: This change fixes most of the regressions, thanks.

But I notice that the "clap-rs-debug patched incremental: println" benchmark regression is only partly fixed. Any ideas what happened there?

mati865 · 2020-04-08T07:06:30Z

@nnethercote this PR has fixed the regression: https://perf.rust-lang.org/compare.html?start=9e55101bb681010c82c3c827305e2665fc8f2aa0&end=607b8582362be8e26df7acc12fa242359d7edf95&stat=instructions:u
Clap regressed right before original PR due to #70156 (comment)

nnethercote · 2020-04-08T07:11:43Z

@mati865: is that the right perf link? Those perf results look like not much changed.

mati865 · 2020-04-08T07:16:42Z

@nnethercote it's difference starting right before original PR and after this fix proving there is no big preformance hit.
For clap see this results (it's different PR): https://perf.rust-lang.org/compare.html?start=74bd074eefcf4915c73d1ab91bc90859664729e6&end=9e55101bb681010c82c3c827305e2665fc8f2aa0&stat=instructions:u and the comment I linked above for the explanation.

nnethercote · 2020-05-04T22:00:14Z

I see now that this caused a large regression in memory usage

@arlosi: was that expected? Any thoughts on how it could be improved?

arlosi · 2020-05-07T02:19:52Z

@nnethercote It appears that the change that caused the CPU regression, also caused an rss improvement. And that fixing the CPU regression removed the rss improvement.

The original change that contained the CPU regression went in on 2020-04-03, and the fix went in on 2020-04-05. Looking at the graph for cargo-debug max-rss over that time period shows an improvement in memory usage, then a matching regression:
https://perf.rust-lang.org/index.html?start=2020-04-01&end=2020-04-08&absolute=true&stat=max-rss

Here's the run that introduced the CPU regression (and rss improvement):
https://perf.rust-lang.org/compare.html?start=9e55101bb681010c82c3c827305e2665fc8f2aa0&end=6050e523bae6de61de4e060facc43dc512adaccd&stat=max-rss

The key difference that caused the CPU regression was adding this line:
let source_file = cx.sess().source_map().get_source_file(file_name);
However, I can't explain how that could have reduced memory usage.

nnethercote · 2020-05-08T02:24:50Z

I investigated the max RSS change. As far as I can tell, the number of allocations, and their sizes, are basically identical in the two versions. But the distribution is different, due to the significant timing differences. My theory is that because the back-end is multi-threaded, how the memory peaks for all the individual threads overlap have a big effect on how the global peak manifests.

Here is what DHAT tells me about memory usage when the speed regression was in place:

    Total:     22,161,188,077 bytes (100%, 71,870.02/Minstr) in 52,214,866 blocks (100%, 169.34/Minstr), avg size 424.42 bytes, avg lifetime 6,620,711,098.71 instrs (2.15% of program duration)
    At t-gmax: 662,297,094 bytes (100%) in 2,100,503 blocks (100%), avg size 315.3 bytes

and here's what it looks like now:

    Total:     22,161,559,231 bytes (100%, 128,445.93/Minstr) in 52,214,794 blocks (100%, 302.63/Minstr), avg size 424.43 bytes, avg lifetime 7,079,327,664.2 instrs (4.1% of program duration)
    At t-gmax: 1,018,333,644 bytes (100%) in 4,138,546 blocks (100%), avg size 246.06 bytes

The "Total" lines describe the allocations done over the entire life of the program, and they are the same. The "At t-gmax" lines describe the live allocations that exist at the global memory peak, and they are very different.

Here is the with-regression memory usage shown by Massif:

There are lots of local maxima, spread out evently, that are close to the global maximum.

And here is the same thing for the current code:

The global maximum is larger and more pronounced relative to the rest of the curve.

In short, I don't think there's really anything to be done about this. At least it's good to know that nothing silly is happening.

rust-highfive assigned matthewjasper Apr 5, 2020

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Apr 5, 2020

rust-highfive assigned eddyb and unassigned matthewjasper Apr 5, 2020

eddyb reviewed Apr 5, 2020

View reviewed changes

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 5, 2020

bors added the merged-by-bors This PR was explicitly merged by bors. label Apr 5, 2020

bors merged commit 607b858 into rust-lang:master Apr 5, 2020

eddyb mentioned this pull request Apr 5, 2020

A big performance regression in benchmarks #70785

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix performance regression in debuginfo file_metadata. #70803

Fix performance regression in debuginfo file_metadata. #70803

arlosi commented Apr 5, 2020

rust-highfive commented Apr 5, 2020

arlosi commented Apr 5, 2020

eddyb Apr 5, 2020

eddyb Apr 5, 2020

eddyb commented Apr 5, 2020

bors commented Apr 5, 2020

bors commented Apr 5, 2020

bors commented Apr 5, 2020

nnethercote commented Apr 8, 2020

mati865 commented Apr 8, 2020

nnethercote commented Apr 8, 2020

mati865 commented Apr 8, 2020 •

edited

nnethercote commented May 4, 2020

arlosi commented May 7, 2020

nnethercote commented May 8, 2020


		let source_file = cx.sess().source_map().get_source_file(file_name);

Fix performance regression in debuginfo file_metadata. #70803

Fix performance regression in debuginfo file_metadata. #70803

Conversation

arlosi commented Apr 5, 2020

rust-highfive commented Apr 5, 2020

arlosi commented Apr 5, 2020

eddyb Apr 5, 2020

Choose a reason for hiding this comment

eddyb Apr 5, 2020

Choose a reason for hiding this comment

eddyb commented Apr 5, 2020

bors commented Apr 5, 2020

bors commented Apr 5, 2020

bors commented Apr 5, 2020

nnethercote commented Apr 8, 2020

mati865 commented Apr 8, 2020

nnethercote commented Apr 8, 2020

mati865 commented Apr 8, 2020 • edited

nnethercote commented May 4, 2020

arlosi commented May 7, 2020

nnethercote commented May 8, 2020

mati865 commented Apr 8, 2020 •

edited