Reflect preload information (in ResourceTiming?) #303

noamr · 2021-10-26T10:25:53Z

Right now it's impossible to know whether a resource was used as part of a preload.
Preloading the resource would generate the RT entry, and in some cases it would be reused without a new fetch (e.g. if it's in the image cache), and in some cases there would be a new fetch that would go to cache (with transferSize 0).

Suggesting to be explicit about it, and to say in a subsequent fetch whether it was served from network, service-worker, HTTP cache, or preload.

The text was updated successfully, but these errors were encountered:

yoavweiss · 2021-10-26T11:03:15Z

The downside of an explicit entry for the preload-reuse resource is that it's a change from what happens today, so it might be worthwhile to run it by RUM folks and see how much confusion it's likely to create.

/cc @nicjansma @andydavies @cliffcrocker

noamr · 2021-10-26T11:09:30Z

The downside of an explicit entry for the preload-reuse resource is that it's a change from what happens today, so it might be worthwhile to run it by RUM folks and see how much confusion it's likely to create.

/cc @nicjansma @andydavies @cliffcrocker

It won't always be different from how it is today. It depends on the status of the cache at that moment. And in any case it would be a near-zero-time entry.

noamr · 2021-10-26T11:16:43Z

Also, "what happens today" is different across browsers :)

If we go with a clear definition of preload (like here) and a clearer definition of what an RT entry represents (we said in the F2F that it should represent a fetch), then a preload and a preload-consume are two fetches, the second one reusing a cloned response from the first one, which would generate two resource timing entries (one with initiatorType "link" and one with initiatorType "img" or whichever resource), the second of which would be zero-ish.

andydavies · 2021-10-27T07:54:58Z

If we go with a clear definition of preload (like here) and a clearer definition of what an RT entry represents (we said in the F2F that it should represent a fetch), then a preload and a preload-consume are two fetches, the second one reusing a cloned response from the first one, which would generate two resource timing entries (one with initiatorType "link" and one with initiatorType "img" or whichever resource), the second of which would be zero-ish.

From an RT perspective I think preload should behave as fetches from the pre-parser do as it's essentially the same behaviour - a resource is fetched ahead of time, and used later - one is browser initiated, and the other author initiated

So if preload results in two RT entries then everything that's fetched by the pre-parser should also result in two entries - which I think would be very weird

I think the RT entry for preload should match the network or HTTP cache fetch as it does for other resources and the reporting API used to flag unused preloads

Won't provide information via RT that's something is used but will flag up non-used ones

noamr · 2021-10-27T08:08:35Z

If we go with a clear definition of preload (like here) and a clearer definition of what an RT entry represents (we said in the F2F that it should represent a fetch), then a preload and a preload-consume are two fetches, the second one reusing a cloned response from the first one, which would generate two resource timing entries (one with initiatorType "link" and one with initiatorType "img" or whichever resource), the second of which would be zero-ish.

From an RT perspective I think preload should behave as fetches from the pre-parser do as it's essentially the same behaviour - a resource is fetched ahead of time, and used later - one is browser initiated, and the other author initiated

But the preparser has a substantial difference, e.g. you know the element that's going to use the resource and that early fetching is not entirely observable, e.g. it has the same initiatorType which is not info available to preload.

So if preload results in two RT entries then everything that's fetched by the pre-parser should also result in two entries - which I think would be very weird

I don't think having two entries for preload should necessarily mean two entries for the preparser

I think the RT entry for preload should match the network or HTTP cache fetch as it does for other resources and the reporting API used to flag unused preloads

Won't provide information via RT that's something is used but will flag up non-used ones

Sure, that's also an option

noamr · 2021-10-30T17:30:55Z

Actually based on the newer developments in the preload PR, I tend to agree with either having some "unused preload" flag (maybe something like readyState on the link element?), and/or having a preloadConsumedTime attribute on the RT entry.

noamr · 2021-11-02T13:41:56Z

Proposal:

A preload link, once loaded, creates an RT entry immediately, but doesn't queue it to the PerformanceObservers.
Once the preload is consumed, it marks a preloadConsumedTime timestamp on the same entry, and the entry gets queued (maybe it somehow register the consumer initiatorType).
The duration would be based on responseEnd like today, and the preloadConsumedTime would be (most likely) after that, though in some cases it can be before (if the resource is used before it's fully loaded).

This is in line with navigation timing, where the entry is created when we know about it, but only gets queued when its final attribute is set.

It also allows us to report unused preloads in RUM without adding a timeout - a RUM library or a document can check on its own whether we have preloaded links that don't match RT entries with corresponding preloadConsumedTime.

WDYT, @andydavies @yoavweiss

yoavweiss · 2021-11-08T07:58:29Z

I like it!
/cc @npm1 @philipwalton @pmeenan for their opinions.

npm1 · 2021-11-10T16:20:29Z

The proposal seems reasonable to me, but it's unclear from the discussion in the thread what the current behavior is and why we need to change it. Can this be documented here?

noamr · 2021-11-10T16:54:04Z

The proposal seems reasonable to me, but it's unclear from the discussion in the thread what the current behavior is and why we need to change it. Can this be documented here?

The current behavior doesn't mark the time the preload was consumed, and you'd need to guess whether it's consumed based on the existence of another RT entry with the same URL. The idea was to give extra indication about use of preloads, which is a request that came in the preload discussion at TPAC.

Note that this changes the current behavior in a minor way only - the attributes for the preloaded entry stay the same and a new one is added - the only actual change in behavior is that the preload is not queued until consumption.

nicjansma · 2022-04-07T00:56:55Z

@noamr I like your proposal. Two potential challenges though:

For HTTP Header-based Preloads, RUM doesn't know what those headers were (or if there were any). So RUM wouldn't be able to cross-check with the RT entries we saw to know which header-based Preloads weren't used.
If we're not going to queue to the PerformanceObservers until they're consumed, unused Preloads wouldn't ever have an RT entry? This would further "hide" data from ResourceTiming that I'd like to avoid (and the bytes/time they took).

I generally don't like tying things to the load event, but maybe we could use that here? If a Preload doesn't get consumed by load, we can queue that entry right then, and maybe mark preloadConsumedTime=-1 or something to indicate that it was a Preload but not used? I expect most Preloads to be used by load, and most RUM to "consume" RT data after load.

noamr · 2022-04-07T05:33:46Z

@noamr I like your proposal. Two potential challenges though:

For HTTP Header-based Preloads, RUM doesn't know what those headers were (or if there were any). So RUM wouldn't be able to cross-check with the RT entries we saw to know which header-based Preloads weren't used.

Suggesting that the initiator type of link headers would be link-header, the same way early hints get a early-hint initiator type.

If we're not going to queue to the PerformanceObservers until they're consumed, unused Preloads wouldn't ever have an RT entry? This would further "hide" data from ResourceTiming that I'd like to avoid (and the bytes/time they took).

They will have an RT entry, but it wouldn't be queued to PerformanceObservers.

noamr · 2022-04-07T07:45:54Z

I generally don't like tying things to the load event, but maybe we could use that here? If a Preload doesn't get consumed by load, we can queue that entry right then, and maybe mark preloadConsumedTime=-1 or something to indicate that it was a Preload but not used? I expect most Preloads to be used by load, and most RUM to "consume" RT data after load.

If the RT entries are there but not queued to observers, is this needed? Simply check performance.getEntries*** in your onload to see which preloads were not consumed.

noamr · 2022-04-07T07:48:16Z

I think the original idea of having an RT entry for consuming a resource, and differentiating it from the preload entry somehow would be more straightforward - e.g. having a delivery or cacheState attribute that exposes network / http-cache / service-worker / consume-preload or so.

nicjansma · 2022-04-07T13:00:52Z

Suggesting that the initiator type of link headers would be link-header, the same way early hints get a early-hint initiator type.

There was the suggestion that RUM providers could cross-check their HTML-based <link rel="preload"> vs. the observed ResourceTiming entries, to understand Preloads that didn't get used. That wouldn't work for Headers-based Preloads, as we don't know what HTTP headers there were.

They will have an RT entry, but it wouldn't be queued to PerformanceObservers.

Hm, I don't think we have any precedence for PerformanceObservers "seeing" a different view than performance.getEntries() though -- they're always been viewing the same list.

noamr · 2022-04-07T13:12:43Z

Suggesting that the initiator type of link headers would be link-header, the same way early hints get a early-hint initiator type.

There was the suggestion that RUM providers could cross-check their HTML-based <link rel="preload"> vs. the observed ResourceTiming entries, to understand Preloads that didn't get used. That wouldn't work for Headers-based Preloads, as we don't know what HTTP headers there were.

Right. Suggesting that they could cross-check against preload entries in the RT buffer instead.

They will have an RT entry, but it wouldn't be queued to PerformanceObservers.

Hm, I don't think we have any precedence for PerformanceObservers "seeing" a different view than performance.getEntries() though -- they're always been viewing the same list.

they're seeing the same view but not necessarily get triggered for each entry.
The precedence is the navigation timing entry - it's available from the start, but only queued to the observers after the load event.

nicjansma · 2022-04-07T14:59:52Z

If the RT entries are there but not queued to observers, is this needed? Simply check performance.getEntries*** in your onload to see which preloads were not consumed.

Ah yep, agreed that would work.

Right. Suggesting that they could cross-check against preload entries in the RT buffer instead.

I see, you're suggesting RUM would:

Have a PerformanceObserver active, putting entries into poBuffer
At the time of beaconing, also get PerformanceTimeline buffer ptBuffer = performance.getEntriesByType("resource")
Compare the list. For anything in the ptBuffer that was .initiatorType="link" (or something new?), see if it's in the poBuffer. If yes, that Preload was consumed. If not, that Preload was wasted.

I can see how that would work, but thinking through a few small gotchas:

Small amount of work required to compare the two arrays
We're trying to move folks away from using the PT buffer and only use POs, and this would redirect developers (at least RUM caring about Preloads) back to using PT
Slight risk from the getEntries*() buffer being either been filled or cleared, so it's not always going to line-up.
I'm just a little hesitant to have the PT and PO buffers have mis-matched number of entries (unless it's well-documented)
A RUM provider that only uses PO (which is our recommendation), that doesn't make any changes for this, could see less PO entries (and would be missing expensive unused Preloads)

The precedence is the navigation timing entry - it's available from the start, but only queued to the observers after the load event.

That's fair. It's helpful for NT that it's the "only" entry in the NT PO buffer (for now), so I think it's a bit easier for developers to know that caveat.

I'm concerned that developers would get confused in general that the PO and PT buffers may be slightly different for Preload reasons. If we did this, we may want some additional guidance/notes in the spec.

Will ponder this a bit more though!

andydavies · 2022-04-07T15:24:44Z

@noamr @nicjansma

Has there been any discussion of using the reporting-api to tackle this e.g. report unused preloads to an endpoint rather than having two performance entries? (did wonder whether bfcache reasons should adopt the same thing)

(Before I test this) do you know how the browser having to go to cache twice for the same resource is recorded in RT e.g. image gets evicted from memory on a low powered device and browser has to refetch it from cache when visitor scrolls of such like?

noamr · 2022-04-07T15:31:12Z

@noamr @nicjansma

Has there been any discussion of using the reporting-api to tackle this e.g. report unused preloads to an endpoint rather than having two performance entries? (did wonder whether bfcache reasons should adopt the same thing)

This is not just about unused preloads, it's also for preloads that are used a long time after they're loaded, to help optimize... maybe you didn't need to preload it because anyway it's used very late?

(Before I test this) do you know how the browser having to go to cache twice for the same resource is recorded in RT e.g. image gets evicted from memory on a low powered device and browser has to refetch it from cache when visitor scrolls of such like?

yes retrieving from HTTP cache is reflected, with transferSize 0. I suggested to expose an enum of where the resource is retrieved from in this comment, which would allow us to differentiate.

noamr · 2022-04-07T15:39:28Z

If the RT entries are there but not queued to observers, is this needed? Simply check performance.getEntries*** in your onload to see which preloads were not consumed.

Ah yep, agreed that would work.

Right. Suggesting that they could cross-check against preload entries in the RT buffer instead.

I see, you're suggesting RUM would:

Have a PerformanceObserver active, putting entries into poBuffer

At the time of beaconing, also get PerformanceTimeline buffer ptBuffer = performance.getEntriesByType("resource")

Compare the list. For anything in the ptBuffer that was .initiatorType="link" (or something new?), see if it's in the poBuffer. If yes, that Preload was consumed. If not, that Preload was wasted.

I can see how that would work, but thinking through a few small gotchas:

Small amount of work required to compare the two arrays

We're trying to move folks away from using the PT buffer and only use POs, and this would redirect developers (at least RUM caring about Preloads) back to using PT

Slight risk from the getEntries*() buffer being either been filled or cleared, so it's not always going to line-up.

I'm just a little hesitant to have the PT and PO buffers have mis-matched number of entries (unless it's well-documented)

A RUM provider that only uses PO (which is our recommendation), that doesn't make any changes for this, could see less PO entries (and would be missing expensive unused Preloads)

The precedence is the navigation timing entry - it's available from the start, but only queued to the observers after the load event.

That's fair. It's helpful for NT that it's the "only" entry in the NT PO buffer (for now), so I think it's a bit easier for developers to know that caveat.

I'm concerned that developers would get confused in general that the PO and PT buffers may be slightly different for Preload reasons. If we did this, we may want some additional guidance/notes in the spec.

Will ponder this a bit more though!

I understand the concerns... the alternative would be to have an additional RT entry for the consume. I will present the options in one of the following WG meetings, would be good to hear more thoughts here in the meantime.

andydavies · 2022-04-07T15:52:12Z

(Before I test this) do you know how the browser having to go to cache twice for the same resource is recorded in RT e.g. image gets evicted from memory on a low powered device and browser has to refetch it from cache when visitor scrolls of such like?
yes retrieving from HTTP cache is reflected, with transferSize 0. I suggested to expose an enum of where the resource is retrieved from #303 (comment), which would allow us to differentiate.

So if I load a page that has an image at the top, scroll to the bottom and the image gets evicted from memory cache, scroll back to the top and the browser reloads the image from cache I'll get two RT entries - one potentially with a transferSize and one without?

noamr · 2022-04-07T16:09:14Z

(Before I test this) do you know how the browser having to go to cache twice for the same resource is recorded in RT e.g. image gets evicted from memory on a low powered device and browser has to refetch it from cache when visitor scrolls of such like?
yes retrieving from HTTP cache is reflected, with transferSize 0. I suggested to expose an enum of where the resource is retrieved from #303 (comment), which would allow us to differentiate.

So if I load a page that has an image at the top, scroll to the bottom and the image gets evicted from memory cache, scroll back to the top and the browser reloads the image from cache I'll get two RT entries - one potentially with a transferSize and one without?

That is correct.

noamr · 2022-04-14T11:13:25Z

Two alternatives: (feel free to propose more)

Separate Entry

Add a timing entry to the timeline when the resource is consumed
Very flexible
Works the same as entries representing resources fetched HTTP cache
Perhaps differentiate with a delivery property (from cache | network | service-worker | preload)

Additional Property

Add a preloadConsume timestamp to the existing preload entry
Less disruptive to current timelines
An entry might change after its creation, similar to how navigation timing entries may receive activationStart / loadEventStart after the entry is already available.

noamr added the enhancement label Jan 17, 2022

yoavweiss mentioned this issue Apr 25, 2022

How should prefetch be exposed? w3c/navigation-timing#163

Closed

yoavweiss added this to Triage in Triage May 5, 2022

yoavweiss moved this from Triage to Enhancements in Triage May 5, 2022

noamr mentioned this issue Jun 18, 2022

Add Render Blocking Status to PerformanceResourceTiming #327

Merged

jeremyroman mentioned this issue Jul 6, 2022

Expose delivery type in PerformanceResourceTiming. #332

Closed

jeremyroman mentioned this issue Aug 2, 2023

deliveryType (Resource Timing) w3ctag/design-reviews#858

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reflect preload information (in ResourceTiming?) #303

Reflect preload information (in ResourceTiming?) #303

noamr commented Oct 26, 2021

yoavweiss commented Oct 26, 2021

noamr commented Oct 26, 2021

noamr commented Oct 26, 2021

andydavies commented Oct 27, 2021

noamr commented Oct 27, 2021

noamr commented Oct 30, 2021

noamr commented Nov 2, 2021

yoavweiss commented Nov 8, 2021

npm1 commented Nov 10, 2021

noamr commented Nov 10, 2021 •

edited

nicjansma commented Apr 7, 2022

noamr commented Apr 7, 2022 •

edited

noamr commented Apr 7, 2022

noamr commented Apr 7, 2022

nicjansma commented Apr 7, 2022

noamr commented Apr 7, 2022 •

edited

nicjansma commented Apr 7, 2022

andydavies commented Apr 7, 2022

noamr commented Apr 7, 2022

noamr commented Apr 7, 2022

andydavies commented Apr 7, 2022

noamr commented Apr 7, 2022

noamr commented Apr 14, 2022

Reflect preload information (in ResourceTiming?) #303

Reflect preload information (in ResourceTiming?) #303

Comments

noamr commented Oct 26, 2021

yoavweiss commented Oct 26, 2021

noamr commented Oct 26, 2021

noamr commented Oct 26, 2021

andydavies commented Oct 27, 2021

noamr commented Oct 27, 2021

noamr commented Oct 30, 2021

noamr commented Nov 2, 2021

yoavweiss commented Nov 8, 2021

npm1 commented Nov 10, 2021

noamr commented Nov 10, 2021 • edited

nicjansma commented Apr 7, 2022

noamr commented Apr 7, 2022 • edited

noamr commented Apr 7, 2022

noamr commented Apr 7, 2022

nicjansma commented Apr 7, 2022

noamr commented Apr 7, 2022 • edited

nicjansma commented Apr 7, 2022

andydavies commented Apr 7, 2022

noamr commented Apr 7, 2022

noamr commented Apr 7, 2022

andydavies commented Apr 7, 2022

noamr commented Apr 7, 2022

noamr commented Apr 14, 2022

Separate Entry

Additional Property

noamr commented Nov 10, 2021 •

edited

noamr commented Apr 7, 2022 •

edited

noamr commented Apr 7, 2022 •

edited