report: sort performance audits based on impact #15445

adrianaixba · 2023-09-08T18:35:50Z

This

removes the Opportunities section and moves its audits into Diagnostics
in the renderer, calculates an audit's impact to sort using the guidance level as a tie breaker

In a follow up (#15447), score will be modified to use impact and will then also be used as a sorting variable.

https://lighthouse-git-metric-savings-score-googlechrome.vercel.app/sample-reports/english/

report/renderer/performance-category-renderer.js

core/test/lib/statistics-test.js

brendankenny · 2023-09-27T16:57:11Z

in the renderer, calculates an audit's impact to sort using the guidance level as a tie breaker

Would it be possible for this (the overallImpact() calc part) to happen in core? It would probably take a little rearranging (AFAIK audit results aren't currently altered after coming out of Audit.generateAuditResult(audit, product) but they would need to be for the perf category to do this scoring based on the values of the metrics), but for JSON consumers it would be far more valuable to get the processed impact-on-score numbers than curve control points and have to compute it manually.
What's the kind of case the linear impact fallback helps with? Is it for cases like 20s LCP, 5s improvement won't help the score much but is still a significant win? Does it end up mattering much in practice since (I'm assuming) overallImpact differences will typically dominate the sort?
Some comments for the two impact numbers and a quick high-level explanation comment on the goal of the sort would be greatly appreciated in there.

adrianaixba · 2023-09-27T17:15:48Z

@adamraine feel free to chime in!

Would it be possible for this (the overallImpact() calc part) to happen in core?

My understanding is that its difficult to do this, and maybe not immediately worth it because of the audits requiring a reference to others & the metric scores and such that you mentioned. We could potentially try for this in a future iteration?

What's the kind of case the linear impact fallback helps with?

the linear impact here is mostly for the bottom section of the audits (the less prioritized audits). You're right that the overallImpact will dominate the sort. The linear impact is mostly to help us prioritize audits when comparing between those that don't having savings (overallImpact).

adamraine · 2023-09-27T17:22:00Z

Would it be possible for this (the overallImpact() calc part) to happen in core?

My understanding is that its difficult to do this, and maybe not immediately worth it because of the audits requiring a reference to others & the metric scores and such that you mentioned. We could potentially try for this in a future iteration?

Yeah, this creates a situation where the result of one audit is dependent on the result of another audit. What happens when someone does skipAudits: ['largest-contentful-paint']? FWIW we considered baking the overall impact into audit score for #15447 but decided to avoid it for this reason.

Is it for cases like 20s LCP, 5s improvement won't help the score much but is still a significant win?

Pretty much this. It's a way to compare impact of audits whose log normal impact gets rounded down to 0 because the actual metric value is so big.

Some comments for the two impact numbers and a quick high-level explanation comment on the goal of the sort would be greatly appreciated in there.

+1 to comments

brendankenny · 2023-09-27T23:57:18Z

Would it be possible for this (the overallImpact() calc part) to happen in core?

My understanding is that its difficult to do this, and maybe not immediately worth it because of the audits requiring a reference to others & the metric scores and such that you mentioned. We could potentially try for this in a future iteration?

Yeah, this creates a situation where the result of one audit is dependent on the result of another audit. What happens when someone does skipAudits: ['largest-contentful-paint']? FWIW we considered baking the overall impact into audit score for #15447 but decided to avoid it for this reason.

Isn't the available information the same if doing it in runner vs in the report renderer? The values would just have to be optional, like how they're already treated in overallImpact?

Spending 30 seconds thinking about this so probably a better way to do it and definitely better naming possible, but MetricSavings could become

interface MetricSavings {
  LCP?: {
    value?: number;
    score?: number;
  };
  FCP?: {
    value?: number;
    score?: number;
  };
  CLS?: {
    value?: number;
    score?: number;
  };
  TBT?: {
    value?: number;
    score?: number;
  };
  INP?: {
    value?: number;
    score?: number;
  };
}

scores (or impacts, I don't know) are left empty in audit processing, filled in with a function call computeRelativeMetricScores(perfCategory) or whatever in runner, and stick overallImpact somewhere convenient. overallLinearImpact maybe too, but I don't know about that one's name :)

adamraine · 2023-09-28T00:32:02Z

Isn't the available information the same if doing it in runner vs in the report renderer? The values would just have to be optional, like how they're already treated in overallImpact?

The difference is that overallImpact (in its current iteration) is an internal variable. If largest-contentful-paint is missing the sort order will change but the JSON will remain the same.

In general, I'm unsure if we should to expose overallImpact or individual metric impacts to users at all. I think it's better to keep the impact as an internal ranking heuristic and not an authoritative data point.

brendankenny · 2023-09-28T15:49:58Z

FWIW:

the ordering isn't really internal, it's very visible in the HTML report :) I think it's ok to make it clear that heuristics are subject to change from version to version as this is iterated on.
Also, while I think it would be valuable to do a look at relative ranking of audits before and after this change to judge its impact (using e.g. HTTP Archive data for a broad sampling), my suggestion here is less about the ranking order and more about the much more concrete scoring impact, which is a straightforward application of the existing scoring system to the metric savings we're already exposing.
If largest-contentful-paint is missing the sort order will change but the JSON will remain the same.

This is a good point for consistency (though possibly an argument for dropping the scoring-level dependency between opportunities and metrics and making the dependency explicit), but the 99% of the time or whatever when a metric isn't dropped it would be useful to have the score impact, so maybe it's worth the tradeoff?

adamraine · 2023-09-28T18:27:04Z

the ordering isn't really internal, it's very visible in the HTML report :) I think it's ok to make it clear that heuristics are subject to change from version to version as this is iterated on.

I wasn't suggesting that the sorting is internal, but I don't think we are obligated to expose the exact order audits should appear in in the JSON.

Also, while I think it would be valuable to do a look at relative ranking of audits before and after this change to judge its impact (using e.g. HTTP Archive data for a broad sampling)

You might find this challenging to do because the report before this change has two different groups and sorts these categories using heuristics that are internal to the report renderer (overallSavingsMs in opportunities & score + drop informative to the bottom in diagnostics). Any comparison in HTTPA will require you to replicate the sorting logic before this patch as well.

my suggestion here is less about the ranking order and more about the much more concrete scoring impact, which is a straightforward application of the existing scoring system to the metric savings we're already exposing.

It will still be possible to compute the impact for scoring impact in HTTPA using the metric scoring options and metricSavings provided on the lhr.

This is a good point for consistency (though possibly an argument for dropping the scoring-level dependency between opportunities and metrics and making the dependency explicit), but the 99% of the time or whatever when a metric isn't dropped it would be useful to have the score impact, so maybe it's worth the tradeoff?

I think our options for this are:

Create a proper audit dependency system.
- The work required here doesn't seem worth the tradeoff to me
Create some bespoke logic for the performance category when we create the lhr JSON. We currently treat all categories the same when constructing the JSON.
- It's hard to imagine a solution here that won't introduce some technical debt

brendankenny · 2023-09-28T23:17:12Z

Create some bespoke logic for the performance category when we create the lhr JSON. We currently treat all categories the same when constructing the JSON.

It's hard to imagine a solution here that won't introduce some technical debt

I don't believe that's the case. Because almost all the new perf audit infrastructure is optional (metricSavings, scoringOptions, even acronym, etc), overallImpact() would work essentially unchanged in runner.js over all categories because it already has to handle when those properties are missing (like long-tasks and other still informative audits). Or, equivalently, there is bespoke logic, but it's already implicitly encoded in all the special things perf audits get that the other categories' audits don't, so there's no new special handling needed.

In any case, it does feel like the JSON consumer story is being treated as an intermediate HTML-report-generation step here rather than an endpoint of its own.. BUT y'all have done a lot of good work and thinking on this and I don't want to block the effort. Carry on! ✌️

paulirish · 2023-09-29T00:21:48Z

report/renderer/performance-category-renderer.js

+          const {
+            overallImpact: aOverallImpact,
+            overallLinearImpact: aOverallLinearImpact,
+          } = this.overallImpact(a, metricAudits);


right now we recompute each audit's overallImpact each time this comparator is run.

it ends up not being super costly, but... it just seems like good practice to avoid recomputing the same thing. can we move that higher?

paulirish · 2023-09-29T00:22:50Z

report/renderer/performance-category-renderer.js

-        .filter(audit => this._classifyPerformanceAudit(audit) === 'load-opportunity')
-        .filter(audit => !ReportUtils.showAsPassed(audit.result))
-        .sort((auditA, auditB) => this._getWastedMs(auditB) - this._getWastedMs(auditA));
-
    const filterableMetrics = metricAudits.filter(a => !!a.relevantAudits);


what about the metric filter? it seems like we have all the info we need to resort when we filter to just FCP, etc.

Good point. That might be best left as a follow up and the old metric filter is adequate for now. After this PR but before the release?

sure. clearly something about the score/icons will also have to land post this-PR , pre-release.
sg

report/renderer/performance-category-renderer.js

vercel · 2023-10-02T23:44:51Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
lighthouse	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Oct 2, 2023 11:44pm

adrianaixba and others added 11 commits August 10, 2023 17:07

get overall impact for all audits

129319d

Merge branch 'main' into metric-savings-score

b6abb30

calculate overall impact

7881ecd

Merge branch 'main' into metric-savings-score

317f85c

linear impact + old scoring fallback

2a7ef45

rm color indicators

4f7b7f4

Merge branch 'main' into metric-savings-score

3adb1e1

add simple guidance level sort

e1ebbb8

add gl to impact, rm dependency on score

6a642ad

fall back to old sort

9838c9c

remove guidance level from main sort

be2455b

adamraine reviewed Sep 8, 2023

View reviewed changes

report/renderer/performance-category-renderer.js Outdated Show resolved Hide resolved

adamraine mentioned this pull request Sep 8, 2023

core: align performance audit score with metric savings #15447

Merged

add metrics filter back

212be08

vercel bot deployed to Preview September 8, 2023 23:10 View deployment

sample json

5f0f5f7

vercel bot deployed to Preview September 8, 2023 23:20 View deployment

Merge branch 'main' into metric-savings-score

629f841

vercel bot deployed to Preview September 20, 2023 21:26 View deployment

fix

74d1cb9

vercel bot deployed to Preview September 20, 2023 21:29 View deployment

adamraine reviewed Sep 20, 2023

View reviewed changes

core/test/lib/statistics-test.js Show resolved Hide resolved

update proto

be716af

vercel bot deployed to Preview September 21, 2023 17:34 View deployment

add tests

c2521ad

vercel bot deployed to Preview September 21, 2023 22:40 View deployment

nit

8b1fd65

vercel bot deployed to Preview September 22, 2023 01:35 View deployment

adrianaixba added 2 commits September 21, 2023 18:39

Merge branch 'main' into metric-savings-score

9ed4c98

nit

c1d75f8

add comments

17f4fef

vercel bot deployed to Preview September 27, 2023 17:58 View deployment

connorjclark approved these changes Sep 27, 2023

View reviewed changes

paulirish reviewed Sep 29, 2023

View reviewed changes

avoid recalculating impact details

68f0793

vercel bot deployed to Preview October 2, 2023 16:16 View deployment

Merge branch 'main' into metric-savings-score

b7e3930

vercel bot deployed to Preview October 2, 2023 16:17 View deployment

adamraine reviewed Oct 2, 2023

View reviewed changes

report/renderer/performance-category-renderer.js Outdated Show resolved Hide resolved

report/renderer/performance-category-renderer.js Outdated Show resolved Hide resolved

report/renderer/performance-category-renderer.js Outdated Show resolved Hide resolved

nits

91892d7

vercel bot deployed to Preview October 2, 2023 17:52 View deployment

connorjclark reviewed Oct 2, 2023

View reviewed changes

report/renderer/performance-category-renderer.js Outdated Show resolved Hide resolved

nit

02ebf4b

vercel bot deployed to Preview October 2, 2023 19:49 View deployment

adamraine added 2 commits October 2, 2023 16:43

Merge branch 'main' into metric-savings-score

7d44fff

nit

235f258

vercel bot deployed to Preview October 2, 2023 23:44 View deployment

adamraine merged commit 896399b into main Oct 3, 2023
30 checks passed

adamraine deleted the metric-savings-score branch October 3, 2023 18:06

This was referenced Oct 9, 2023

Performance Audit Prioritization Cleanup #15522

Closed

report(performance): use metric savings for metric filter #15540

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

report: sort performance audits based on impact #15445

report: sort performance audits based on impact #15445

adrianaixba commented Sep 8, 2023 •

edited by paulirish

Loading

brendankenny commented Sep 27, 2023

adrianaixba commented Sep 27, 2023

adamraine commented Sep 27, 2023 •

edited

Loading

brendankenny commented Sep 27, 2023

adamraine commented Sep 28, 2023 •

edited

Loading

brendankenny commented Sep 28, 2023

adamraine commented Sep 28, 2023

brendankenny commented Sep 28, 2023

paulirish Sep 29, 2023

paulirish Sep 29, 2023

adamraine Sep 29, 2023

paulirish Sep 29, 2023

vercel bot commented Oct 2, 2023

report: sort performance audits based on impact #15445

report: sort performance audits based on impact #15445

Conversation

adrianaixba commented Sep 8, 2023 • edited by paulirish Loading

brendankenny commented Sep 27, 2023

adrianaixba commented Sep 27, 2023

adamraine commented Sep 27, 2023 • edited Loading

brendankenny commented Sep 27, 2023

adamraine commented Sep 28, 2023 • edited Loading

brendankenny commented Sep 28, 2023

adamraine commented Sep 28, 2023

brendankenny commented Sep 28, 2023

paulirish Sep 29, 2023

Choose a reason for hiding this comment

paulirish Sep 29, 2023

Choose a reason for hiding this comment

adamraine Sep 29, 2023

Choose a reason for hiding this comment

paulirish Sep 29, 2023

Choose a reason for hiding this comment

vercel bot commented Oct 2, 2023

adrianaixba commented Sep 8, 2023 •

edited by paulirish

Loading

adamraine commented Sep 27, 2023 •

edited

Loading

adamraine commented Sep 28, 2023 •

edited

Loading