metric: Time to Interactive #27

paulirish · 2016-03-18T22:13:26Z

Time 'To Interactive (or TTI) is a classic success metric across the web, games and application development. We're attempting to provide a robust version of it for the web platform.

Key goals for the metric:

User should feel like the page is done enough to start interacting -- they may scroll, tap on notification icons, start reading and expect the page to be not move around any longer.
The page should be ready for the user. It's not enough to present the appearance of done-ness. The user shouldn't have problems once they start engaging with the page, because it's either still busy or uninitialized. In other words, the page is not ready if main thread contention is preventing user interaction from being successful.

A working definition for Time To Interactive

Updated: April 6, 2016

There are two major timestamps, composed of smaller ones:

Timestamp: Visually ready for interaction

Visually ready is the Math.max() of these timestamps:

domContentLoadedEventEnd has fired
- Definition: HTML parsing has finished, all DOMContentLoaded handlers have run
User-agent loading indicator is done
- Current definition: Top frame and all iframes have fired window load event
- Proposed definition (from cl/1860743002): top frame only: DCL ended and all layout-blocking resources, plus images that begun their request before DCL ended have finished.
- Alternative definition (from Chrome on Android Progress Bar Enhancements - google-only, sry): top frame's DOMContentLoaded + top frame's images (who started before DCL) are loaded (IMO, we should use this one)
  - Perhaps only include we only include above-the-fold-ish images.
Above-the-fold layout has stabilized
- Could use WebMeaningfulLayout.h
- Could use ksakamoto's "largest layout"?
What about fonts?
- They are not necessarily addressed by above items.
- Do we wait for inflight fonts requests? Only if the text remains invisible (if the user allows a fallback font, then we don't wait for inflight)
- Can we ignore icon-fonts like font-awesome?
- We should at least collect the font-painted timestamps for all webfonts
- The Time to First Meaningful Paint: a layout-based approach algorithm considers in-flight font requests that layout more than 200 characters.
Viewport and above-the-fold
- None of the above considers what is in the viewport, and waits for resources that may be offscreen or 10,000px away.
- Should we only consider images that are above-the-fold?

Timestamp: Main thread ready for interaction

Once visually ready has hit, we begin considering thread availability.

Main thread is available
- Definition TBD.
- Approximate: 90% percentile main thread task duration ~ under 300ms. Maximum main thread task duration is < 400ms
- Perhaps: System Health "input readiness" Metric by @benshayden
- Perhaps: Ask requestIdleCallback for deadlines
  - "You call rIC until you get at least 20ms. Treat it as a proxy for page being settled. Seemed to correlate pretty well" ~@paullewis & jochen

@skyostil added: "it's super hard to congest the compositor, but a fair indicator will be availability of time on main thread because a) a lot of work will be main-thread bound and b) CPU contention (he thinks) can sometimes block the compositor from allowing scroll."

@paullewis added: "some of this may be in our wording where we explain that we're treating main thread as a proxy for interactivity"

Main-thread dependent input events

Question: Do we treat pages with slow-scrolling-regions (global touchstart, etc) the same as ones without?
Verdict: yes, until there is a CSP-like policy that makes it impossible to add these handlers

Key Data, Goals, and Scoring

milliseconds: navigationStart ─⇥ visually ready
milliseconds: navigationStart ─⇥ thread ready
milliseconds: visually ready ─⇥ thread ready
- if the delta between "looks done" and "is ready" is bigger and bigger.. then you're tanking the UX and the score should represent that.

- Goal: `max(visReady,threadReady)`should be < 5000ms - Scoring TBD

The text was updated successfully, but these errors were encountered:

danbeam · 2016-03-29T18:31:31Z

Main thread is ~available

Calculation TBD.

Maybe this is when there's enough yielding that the UI can change or interact or show progress every 16.7ms (i.e. 60 FPS)?

5. Any other ideas?

Maybe mention that deferred work may be scheduled and run while not affecting TTI (as long as it's not clogging things up)?

/cc @natduca @esprehn

The current SH metrics, Responsiveness and Efficiency, only measure aspects of the user experience that actually happened. Was this interaction actually slow? However, we know of certain types of behaviors that sites can engage in that are risky, and might have impacted responsiveness if the user had behaved differently. For example, if sites spend longer than 50ms on a single given javascript task, then that would impact responsiveness if the user attempts to interact with the page during it. It's like speeding: it's hazardous even if it doesn't cause a crash. We have identified other types of hazards, but we're starting with long idle tasks. This CL produces a single 'hazard' metric for a given trace, with diagnostic values for each IdleExpectation. This CL scores the risk of each long idle task as if it had impacted an ideal Scroll Response that had been minding its own business. Longer idle tasks are given worse (higher) hazard scores than relatively shorter idle tasks. Extending the analogy, the faster you speed, the higher your ticket fine. The per-UE hazard scores are perceptually blended like responsiveness scores, but require a different weighting function since they are smallerIsBetter. test_data/trace_amp-obama-2.json is a good manual test case. This metric may eventually be renamed to something biggerIsBetter like "readiness" or "availability" per GoogleChrome/lighthouse#27 BUG=catapult:#1242 Review URL: https://codereview.chromium.org/1823323002

paulirish · 2016-04-06T21:59:16Z

I've updated the definition of the TTI above metric based on latest discussions with @benshayden, @skyostil, japhet@, @pbakaus and others.

bryanmcquade · 2016-05-04T17:06:32Z

Are we sure TTI should block on domContentLoadedEventEnd? I find that many pages start to become interactable before parsing is complete. If for instance there's a script in the footer inserted via doc.write, this can delay DCL firing when the CPU will be idle and the page interactive.

That said, I think it's also reasonable to discourage pages from running the parser a long time after their content is usable - we can message to devs that they shouldn't do this and if they do so, our loading indicators may stay visible longer than they would like. But that also means that some existing content will continue to show a loading indicator / not be considered interactive until after it has reached the interactive state.

paulirish · 2016-05-04T22:48:54Z

Are we sure TTI should block on domContentLoadedEventEnd?

Hah. Pat pretty much the same comment yesterday on the Progressive page Load Metrics doc yesterday(https://docs.google.com/document/d/1oiy0_ych1v2ADhyG_QW7Ps4BNER2ShlJjx2zCbVzVyY/edit#)

Quoting him, then me..

This isn't necessarily tied to anything visual. A site can have a lot of js at the end of the body that needs to get loaded/parsed/executed that will delay DCL but may not cause any visual change to the page, though by-definition if you are waiting for the progress indicator this may be included anyway as part of that logic.

Fair. This isn't directly a visual dependency, so I'll adjust that.

I agree that DCL be a direct dependency here as its not visual. Main thread availability will tackle CPU contention, and layout stability for the rest of the blocking script impact.

paulirish added the metric label Mar 18, 2016

paulirish modified the milestone: Lighthouse MVP Mar 30, 2016

paulirish assigned paullewis Apr 4, 2016

benshayden mentioned this issue Apr 6, 2016

Invert hazardMetric to inputReadinessMetric catapult-project/catapult#2224

Closed

paulirish changed the title ~~metric: Ready for interaction~~ metric: Time to Interactive Apr 6, 2016

paulirish mentioned this issue Apr 7, 2016

metric: Thread availability #28

Closed

paulirish modified the milestone: Lighthouse MVP Apr 8, 2016

paulirish added this to the Lighthouse 1.0 milestone May 6, 2016

paulirish mentioned this issue Jun 27, 2016

WIP: Time to Interactive metric #450

Closed

paulirish closed this as completed in d0d3829 Jul 22, 2016

pmeenan mentioned this issue Dec 13, 2016

Add a metric for interactivity catchpoint/WebPageTest#781

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metric: Time to Interactive #27

metric: Time to Interactive #27

paulirish commented Mar 18, 2016 •

edited

Loading

danbeam commented Mar 29, 2016

paulirish commented Apr 6, 2016

bryanmcquade commented May 4, 2016

paulirish commented May 4, 2016

metric: Time to Interactive #27

metric: Time to Interactive #27

Comments

paulirish commented Mar 18, 2016 • edited Loading

Key goals for the metric:

A working definition for Time To Interactive

Timestamp: Visually ready for interaction

Timestamp: Main thread ready for interaction

Main-thread dependent input events

Key Data, Goals, and Scoring

danbeam commented Mar 29, 2016

paulirish commented Apr 6, 2016

bryanmcquade commented May 4, 2016

paulirish commented May 4, 2016

paulirish commented Mar 18, 2016 •

edited

Loading