Add telemetry probe for dex load time #8803

ecsmyth · 2019-12-17T01:14:26Z

edit: this issue was repurposed to only instrument dex load time instead of startup time but the description for startup probes remain below.

Why/User Benefit/User Problem

Startup time is an oft cited reason for friction and churn. We need to understand how long it takes Fenix to startup for our users so that we can prioritize work to address regressions and outliers that result in decreased engagement or churn. Local and CI-based testing provides an incomplete picture of how Fenix performs at startup.

Impact

Improved understanding of how Fenix performs on startup for our users.

Acceptance Criteria (how do I know when I’m done?)

We should make a best-effort to incrementally instrument as much of startup as possible - until we hit diminishing returns - because it's impossible to do it all. Unfortunately, it will be difficult to define what this looks like in advance so take this as a guiding principle. Msg mcomella if this does not make sense.

Some incremental steps we should try to implement for GA:

Add a metric that tracks the start time of GV
Add a metric that tracks the duration of FenixApplication.onCreate (this may or may not be helpful but we want to add it to see if it is in practice)
Add a metric that tracks FenixApplication.onCreate until first frame drawn
Distinguish between first run and other cold starts (don't let this impact performance!)
(is this reasonable?) Distinguish between which page was shown, i.e. which startup path was used
Time from first frame drawn to reportFullyDrawn? Essentially, capture loading top sites & open tabs because this is dependent on content quantity

Some incremental steps we probably can't implement by GA but we should verify:

Add a probe that tracks FenixApplication.onCreate until visual completeness
Add warm/hot start times, distinguish from cold start
Distinguish between cold starts after upgrading and other cold starts

Here is ecsmyth's ideal startup time probe that inspired these incremental steps:

Glean telemetry probe landed in Fenix release that measures time, as closely as is practical, from when user initiates startup until the app is started and the initial screen is visually complete. The probe should differentiate between hot, cold, and warm startup scenarios and account for differences in the first page rendered (e.g., how desktop measures first paint and first paint of about:home)

┆Issue is synchronized with this Jira Task

The text was updated successfully, but these errors were encountered:

mcomella · 2019-12-17T21:49:58Z

It may be valuable to revisit the cold/warm/hot startup definition doc here.

mcomella · 2020-02-05T22:02:23Z

Alessio suggested we can use Glean performance metrics to capture these effectively.

…Create.

… completeness.

During glean review, keeping as a separate commit to easily see the diff.

mcomella · 2020-03-25T21:05:46Z

Waiting on ecsmyth to determine if time for GeckoRuntime.init (i.e. the time Gecko starts on the main thread before continuing work on a background thread) is a valuable probe.

This wraps a Glean TimespanMetricType to make it safer to measure duration.

…start metrics.

We need to access the data in stat to get the process start time, so we can calculate the time from process start until application.init for the frameworkStart probe.

This class controls the central logic around the metrics we want to record.

We primarily want to determine if this is a problem area for us to investigate rather than a long term measurement to keep so we should set the expiration date accordingly. Furthermore, this code executes before crash reporting is init so it's ideal to remove it sooner rather than later.

…t capture methods.

…start metrics.

We need to access the data in stat to get the process start time, so we can calculate the time from process start until application.init for the frameworkStart probe.

This class controls the central logic around the metrics we want to record.

We primarily want to determine if this is a problem area for us to investigate rather than a long term measurement to keep so we should set the expiration date accordingly. Furthermore, this code executes before crash reporting is init so it's ideal to remove it sooner rather than later.

…t capture methods.

We need to access the data in stat to get the process start time, so we can calculate the time from process start until application.init for the frameworkStart probe.

This class controls the central logic around the metrics we want to record.

We primarily want to determine if this is a problem area for us to investigate rather than a long term measurement to keep so we should set the expiration date accordingly. Furthermore, this code executes before crash reporting is init so it's ideal to remove it sooner rather than later.

…ods.

mcomella · 2020-04-20T21:06:02Z

I added the "dex launch time" probe: the time between process start and Application.<init>. I'll leave this issue open to remember to investigate the results and potentially file a follow-up for a more involved analysis.

mcomella · 2020-04-23T18:57:24Z

I'm going to close as fixed and create a new issue for the analysis.

mcomella · 2020-04-23T19:08:02Z

Filed #10161 for the analysis.

mcomella transferred this issue from mozilla-mobile/perf-frontend-issues Feb 27, 2020

mcomella added the performance Possible performance wins label Feb 27, 2020

mcomella mentioned this issue Feb 27, 2020

Add telemetry to determine ratio of cold/warm/hot starts #5912

Closed

github-actions bot added the needs:triage Issue needs triage label Feb 27, 2020

mcomella self-assigned this Feb 27, 2020

mcomella moved this from Needs prioritization to In progress in Performance, front-end roadmap Feb 27, 2020

mcomella added a commit to mcomella/fenix that referenced this issue Feb 28, 2020

For mozilla-mobile#8803: add StartupTimelineMeasurements.geckoInit.

a9e487d

mcomella added a commit to mcomella/fenix that referenced this issue Feb 28, 2020

For mozilla-mobile#8803: add StartupTimelineMeasurements.geckoRuntime…

74f13db

…Create.

mcomella added a commit to mcomella/fenix that referenced this issue Mar 13, 2020

For mozilla-mobile#8803: add DurationContainer + tests.

dd657a5

mcomella added a commit to mcomella/fenix that referenced this issue Mar 13, 2020

For mozilla-mobile#8803: add StartupTimelineMeasurements.

fc462d2

mcomella added a commit to mcomella/fenix that referenced this issue Mar 13, 2020

For mozilla-mobile#8803: measure duration of GeckoRuntime.create.

fcfb536

mcomella added a commit to mcomella/fenix that referenced this issue Mar 13, 2020

For mozilla-mobile#8803: log StartupTimelineMeasurements after visual…

f4bc341

… completeness.

mcomella added a commit to mcomella/fenix that referenced this issue Mar 13, 2020

For mozilla-mobile#8803: record duration of geckoRuntimeCreate in Glean.

689f351

mcomella added a commit to mcomella/fenix that referenced this issue Mar 18, 2020

For mozilla-mobile#8803: add DurationContainer + tests.

ed7c095

mcomella added a commit to mcomella/fenix that referenced this issue Mar 18, 2020

For mozilla-mobile#8803: add StartupTimelineMeasurements.

5bff242

mcomella added a commit to mcomella/fenix that referenced this issue Mar 18, 2020

For mozilla-mobile#8803: measure duration of GeckoRuntime.create.

0d04de8

mcomella added a commit to mcomella/fenix that referenced this issue Mar 18, 2020

For mozilla-mobile#8803: log StartupTimelineMeasurements after visual…

5f58098

… completeness.

mcomella added a commit to mcomella/fenix that referenced this issue Mar 18, 2020

For mozilla-mobile#8803: record duration of geckoRuntimeCreate in Glean.

4fc6e07

mcomella added a commit to mcomella/fenix that referenced this issue Mar 19, 2020

For mozilla-mobile#8803: FOLD ME. change to recording in Glean.

07a1897

During glean review, keeping as a separate commit to easily see the diff.

mcomella added a commit to mcomella/fenix that referenced this issue Mar 19, 2020

For mozilla-mobile#8803: FOLD ME. change to recording in Glean.

25ab69f

During glean review, keeping as a separate commit to easily see the diff.

mcomella added a commit to mcomella/fenix that referenced this issue Mar 19, 2020

For mozilla-mobile#8803: FOLD ME. change to recording in Glean.

b716bf0

During glean review, keeping as a separate commit to easily see the diff.

mcomella mentioned this issue Mar 25, 2020

Add StartupTimeline class to better divide & instrument startup #7822

Closed

mcomella moved this from In progress to Waiting in Performance, front-end roadmap Mar 25, 2020

mcomella assigned ecsmyth Mar 25, 2020

mcomella mentioned this issue Mar 25, 2020

Gather telemetry on representative user experiences for performance testing #9069

Closed

mcomella added a commit to mcomella/fenix that referenced this issue Mar 31, 2020

For mozilla-mobile#8803: add StartupTimeline.measure.

6e86de8

This wraps a Glean TimespanMetricType to make it safer to measure duration.

mcomella added a commit to mcomella/fenix that referenced this issue Mar 31, 2020

For mozilla-mobile#8803: add StartupTimeline ping type and gecko metric.

47e7057

mcomella added a commit to mcomella/fenix that referenced this issue Apr 9, 2020

For mozilla-mobile#8803: add StartupTimeline ping type and framework_…

84848f4

…start metrics.

mcomella added a commit to mcomella/fenix that referenced this issue Apr 9, 2020

For mozilla-mobile#8803: add StartupFrameworkStartMeasurement.

034cdbe

This class controls the central logic around the metrics we want to record.

mcomella added a commit to mcomella/fenix that referenced this issue Apr 9, 2020

For mozilla-mobile#8803: hook up frameworkStart metric.

a7aaf76

mcomella added a commit to mcomella/fenix that referenced this issue Apr 15, 2020

For mozilla-mobile#8803 - review: Add clarifying comments to onAppIni…

15cb5b1

…t capture methods.

mcomella added a commit to mcomella/fenix that referenced this issue Apr 16, 2020

For mozilla-mobile#8803: add StartupTimeline ping type and framework_…

17d1ef3

…start metrics.

mcomella added a commit to mcomella/fenix that referenced this issue Apr 16, 2020

For mozilla-mobile#8803: add StartupFrameworkStartMeasurement.

ac5f393

This class controls the central logic around the metrics we want to record.

mcomella added a commit to mcomella/fenix that referenced this issue Apr 16, 2020

For mozilla-mobile#8803: hook up frameworkStart metric.

5f103a7

mcomella added a commit to mcomella/fenix that referenced this issue Apr 16, 2020

For mozilla-mobile#8803 - review: Add clarifying comments to onAppIni…

a36eb94

…t capture methods.

mcomella added a commit to mcomella/fenix that referenced this issue Apr 16, 2020

For mozilla-mobile#8803 - post: update metrics & pings data review URL.

b51198c

mcomella added a commit that referenced this issue Apr 17, 2020

For #8803: add StartupTimeline ping type and framework_start metrics.

a0c4b33

mcomella added a commit that referenced this issue Apr 17, 2020

For #8803: add Stat and test.

7f618a6

We need to access the data in stat to get the process start time, so we can calculate the time from process start until application.init for the frameworkStart probe.

mcomella added a commit that referenced this issue Apr 17, 2020

For #8803: add StartupFrameworkStartMeasurement.

dbf733d

This class controls the central logic around the metrics we want to record.

mcomella added a commit that referenced this issue Apr 17, 2020

For #8803: hook up frameworkStart metric.

f49fc6d

mcomella added a commit that referenced this issue Apr 17, 2020

For #8803 - review: Add clarifying comments to onAppInit capture meth…

f3ed207

…ods.

mcomella added a commit that referenced this issue Apr 17, 2020

For #8803 - post: update metrics & pings data review URL.

909ee73

mcomella moved this from In progress to Waiting in Performance, front-end roadmap Apr 20, 2020

mcomella changed the title ~~Add telemetry probe for startup time~~ Add telemetry probe for dex load time Apr 20, 2020

mcomella mentioned this issue Apr 20, 2020

Add telemetry probe for startup time #10069

Closed

mcomella moved this from Waiting to Done in Performance, front-end roadmap Apr 23, 2020

mcomella closed this as completed Apr 23, 2020

mcomella mentioned this issue Apr 23, 2020

Analyze dex load time telemetry #10161

Closed

liuche mentioned this issue Apr 28, 2020

Releng 5.0 #10205

Closed

32 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add telemetry probe for dex load time #8803

Add telemetry probe for dex load time #8803

ecsmyth commented Dec 17, 2019 •

edited by data-sync-user

mcomella commented Dec 17, 2019

mcomella commented Feb 5, 2020

mcomella commented Mar 25, 2020

mcomella commented Apr 20, 2020

mcomella commented Apr 23, 2020

mcomella commented Apr 23, 2020

Add telemetry probe for dex load time #8803

Add telemetry probe for dex load time #8803

Comments

ecsmyth commented Dec 17, 2019 • edited by data-sync-user

Why/User Benefit/User Problem

Impact

Acceptance Criteria (how do I know when I’m done?)

mcomella commented Dec 17, 2019

mcomella commented Feb 5, 2020

mcomella commented Mar 25, 2020

mcomella commented Apr 20, 2020

mcomella commented Apr 23, 2020

mcomella commented Apr 23, 2020

ecsmyth commented Dec 17, 2019 •

edited by data-sync-user