Editorial: refactor link headers to be declared per-type #7866

noamr · 2022-04-28T08:17:30Z

At least two implementers are interested (and none opposed):
- N/A: editorial
Tests are written and can be reviewed and commented upon at:
- N/A: editorial
Implementation bugs are filed:
- N/A: editorial

(See WHATWG Working Mode: Changes for more details.)

/browsing-the-web.html ( diff )
/images.html ( diff )
/index.html ( diff )
/links.html ( diff )
/semantics.html ( diff )

domenic

I started doing more tactical review but then realized we should probably go further and change up our strategy. This seems directionally right (more polymorphism/abstraction) but I think we need a bit more of a holistic revamp instead of just adding abstractions over time.

Thoughts very welcome.

domenic · 2022-04-29T19:27:51Z

source

-  particular their influence on a <code>Document</code>'s <span>script-blocking style sheet
-  counter</span>, is not defined. See <a href="https://github.com/whatwg/html/issues/4224">issue
-  #4224</a> for discussion on integrating this into the spec.</p>
+  <p>Link types that support loading via a `<code data-x="http-link">Link</code>` should supply a


"should" seems inappropriate here. I think just removing the word or replacing it with "will" is better.

Actually, maybe I'm just confused. Now each link type can have up to three algorithms?

"process a link from options"

"process the linked resource from options"

"fetch and process the linked resource"

(also there's "appropriate times"...)

This seems confusing, partially because the names, but partially because the setup.

Can we think harder about a clean setup here? It might expand this PR a good bit but I think it's worthwhile before we go too far.

One idea, not sure if it works:

Every link type can define:

"Appropriate times to process the link element"

"Process the link element steps". This takes a link processing options.

"Process the link header steps". This takes a link processing options.

We have a section of helpers. This could include things like:

"default process the link element steps" and maybe "default process the link header steps". (No longer sure if this is a good idea... who is using the default directly, these days?)

"create a link element request"

...more...

Importantly, none of these are actually hooked up by default, like the current "default fetch and process the linked resource" steps are.

We define somewhere that for every link element, when "the appropriate times to process the link element" come to pass, we 1) gather link processing options; 2) pass them to "process the link header steps" for each rel="" token.

Similarly, we change the "process link headers" algorithm to dispatch to the appropriate "process the link header steps" for each rel="", after collecting link processing options.

Now, we go through every link relation type and define its "appropriate times", "process the link element", "process the link header".

These definitions can often be "do nothing", e.g. "process the link header" for stylesheet.

These definitions can use shared helpers from the section mentioned above.

These definitions can define type-specific helpers (e.g. "preconnect") and share them between "process the link element" and "process the link header".

Yea I was thinking that we can go even further - to add an optional link element to link processing options, and to have one processing function per link instead of 2/3. If that processing function doesn't do anything special with pre-document options ("early hints"), it simply waits until either the element is resolved, or the document is resolved (and then creates a dummy element).

This would make link header / early hints work by default for any type of link, with special early processing when we see fit. Of course we can still limit it as an application decision.

@domenic I did a big refactor, I think it's much cleaner now. WATTSI is down though so no previews yet :)

Hmm, I'm unsure if the refactor is complete or what the intention is. I see now preload + prefetch both have "fetch and process the linked resource" and "process a link from options", but the former calls the latter which is a bit confusing. And the "default fetch and process the linked resource" still exists. And "process link headers" is no longer rel-restricted; it just always calls "process a link from options" which will crash if rel is not preload or preconnect?

What did you think about my above sketch? Maybe we should work together on a proposal for what the whole spec should look like before doing another draft?

Hmm, I'm unsure if the refactor is complete or what the intention is. I see now preload + prefetch both have "fetch and process the linked resource" and "process a link from options", but the former calls the latter which is a bit confusing.

Yes, the idea is that the options are created from either the link or the header.

And the "default fetch and process the linked resource" still exists. And "process link headers" is no longer rel
restricted; it just always calls "process a link from options" which will crash if rel is not preload or preconnect?

"process a link from options" is only called if it's defined for that rel (I'll double check that I didn't delete that part by mistake)

What did you think about my above sketch? Maybe we should work together on a proposal for what the whole spec should look like before doing another draft?

The PR is loosely based on it but I'll reply to it to make things clearer. But I accept the offer to work on it together rather than shoot more PRs. :)

Actually, maybe I'm just confused. Now each link type can have up to three algorithms?

"process a link from options"

"process the linked resource from options"

"fetch and process the linked resource"

The first two were supposed to be the same one, there was a leftover...

(also there's "appropriate times"...)

That didn't really change though

This seems confusing, partially because the names, but partially because the setup.

Can we think harder about a clean setup here? It might expand this PR a good bit but I think it's worthwhile before we go too far.

One idea, not sure if it works:

Every link type can define:

"Appropriate times to process the link element"

"Process the link element steps". This takes a link processing options.

"Process the link header steps". This takes a link processing options.

We have a section of helpers. This could include things like:

"default process the link element steps" and maybe "default process the link header steps". (No longer sure if this is a good idea... who is using the default directly, these days?)

"create a link element request"

...more...

Importantly, none of these are actually hooked up by default, like the current "default fetch and process the linked resource" steps are.

We define somewhere that for every link element, when "the appropriate times to process the link element" come to pass, we 1) gather link processing options; 2) pass them to "process the link header steps" for each rel="" token.

You mean, to refactor all the link types that don't have headers? I think that would be great but it's a lot of work and the main benefit of this refactor is to align link headers/elements, which are currently only used by preload/preconnect (and later modulepreload). Is it worth it to refactor all the links?

Similarly, we change the "process link headers" algorithm to dispatch to the appropriate "process the link header steps" for each rel="", after collecting link processing options.

Now, we go through every link relation type and define its "appropriate times", "process the link element", "process the link header".

These definitions can often be "do nothing", e.g. "process the link header" for stylesheet.

These definitions can use shared helpers from the section mentioned above.

These definitions can define type-specific helpers (e.g. "preconnect") and share them between "process the link element" and "process the link header".

I went here in a direction where both header/element create "options" and process that... it maps to implementations pretty nicely and does not require additional helpers. But I'm also Ok with this proposal of having the helpers named after the action they perform, I can see how it might be more readable then creating those options structs.

You mean, to refactor all the link types that don't have headers? I think that would be great but it's a lot of work and the main benefit of this refactor is to align link headers/elements, which are currently only used by preload/preconnect (and later modulepreload). Is it worth it to refactor all the links?

That was what I'm suggesting, mainly because I think it's just too confusing what supports headers/early hints and what doesn't, otherwise. It was relatively clear before this PR when early hints/link headers were centralized; each algorithm said "bail out if not rel=X or Y". But if we introduce polymorphism then it's much less clear.

I went here in a direction where both header/element create "options" and process that... it maps to implementations pretty nicely and does not require additional helpers. But I'm also Ok with this proposal of having the helpers named after the action they perform, I can see how it might be more readable then creating those options structs.

I like the options structs! I just think we need clearly-named algorithms, or some other indication of what is supported by each rel. (E.g., maybe "process the options" + "supported link modes: early hints/headers/element".)

source

noamr · 2022-05-04T11:48:00Z

@domenic: Refactored again based on our conversations (above and on Matrix).

domenic

OK, I like this. It doesn't go quite as far as I was suggesting (which also involved revamping the non-header parts of links to use a similar structure, e.g. based on the options and getting rid of the defaults-and-overrides structure). But that's fine for now.

source

domenic · 2022-05-04T21:16:44Z

I pushed a commit with editorial fixups.

I remain concerned about the increased reliance on spinning the event loop. In particular, when processing early hints, I don't think there is any event loop; the Window hasn't been created yet, and as such neither has the agent and its event loop. So the definition of "spin the event loop", which relies on being called from the main thread (i.e. in a task), doesn't seem applicable.

noamr · 2022-05-05T04:47:09Z

I pushed a commit with editorial fixups.

I remain concerned about the increased reliance on spinning the event loop. In particular, when processing early hints, I don't think there is any event loop; the Window hasn't been created yet, and as such neither has the agent and its event loop. So the definition of "spin the event loop", which relies on being called from the main thread (i.e. in a task), doesn't seem applicable.

Good point, I'll bring back the callback instead. I wish there was a way to write code that feels more like promises, having callback everywhere feels like an inflation of indentation sometimes.

domenic

Focusing on trying to get the options structure right...

source

noamr · 2022-05-05T15:47:41Z

Focusing on trying to get the options structure right...

Fixed, I hope :)

source

noamr · 2022-05-16T08:15:20Z

Anything left @domenic? Thanks!

source

domenic · 2022-05-17T00:01:17Z

Sorry about the conflict with #7935, but that might help simplify this a bit, as I think now "preload" doesn't need a return value.

noamr · 2022-05-17T07:29:50Z

Sorry about the conflict with #7935, but that might help simplify this a bit, as I think now "preload" doesn't need a return value.

Now worries, I initiated the removing of blocking in the first place so no apologies needed :)

domenic

Also needs a rebase because of the integrity checks that just merged :)

source

noamr · 2022-05-19T08:22:42Z

Also needs a rebase because of the integrity checks that just merged :)

Rebased and fixed remaining stuff.

domenic

Down to only a few minor questions left. Sorry for the larger delay; as sometimes happens with large reviews, it gets scarier in my head the longer I delay... but I think this should be the last round.

source

domenic · 2022-05-24T18:53:40Z

source

-        data-x="concept-response">response</span> <var>response</var>:</p>
+     <li><p>If <var>options</var>'s <span data-x="link options href">href</span> is the empty string
+     and <var>options</var>'s <span data-x="link options source set">source set</span> is null,
+     then return.</p></li>


So if I am reading this correctly, "process link headers" accepts empty string/no-srcset links (by parsing the empty string relative to the base URL), but "process early hints" rejects them. Is that right?

Also, "return" seems unlikely to be correct here; you probably want "continue"?

I'll remove this line, and each rel will deal with this individually. Only preload accepts source-sets.

Can you add tests for the empty string behavior? (Does not block merging this.)

source

noamr · 2022-05-25T19:10:13Z

Down to only a few minor questions left. Sorry for the larger delay; as sometimes happens with large reviews, it gets scarier in my head the longer I delay... but I think this should be the last round.

Rebased and clarified/fixed remaining bits. I hope we're near completion :)

This was referenced Apr 28, 2022

modulepreload in Link header & early hints #7862

Open

Process subresource link headers whatwg/fetch#1409

Closed

domenic reviewed Apr 29, 2022

View reviewed changes

noamr force-pushed the link-header-editorial branch 2 times, most recently from 72fefe9 to 42147e4 Compare May 4, 2022 11:03

noamr added the topic: resource hints (inc. preload) label May 4, 2022

domenic mentioned this pull request May 4, 2022

Rearrange link processing model stuff #7890

Open

domenic reviewed May 4, 2022

View reviewed changes

domenic reviewed May 5, 2022

View reviewed changes

source Show resolved Hide resolved

source Show resolved Hide resolved

source Show resolved Hide resolved

source Show resolved Hide resolved

source Show resolved Hide resolved

source Show resolved Hide resolved

domenic reviewed May 6, 2022

View reviewed changes

noamr mentioned this pull request May 8, 2022

WIP: Subresource link header #7904

Closed

3 tasks

domenic reviewed May 16, 2022

View reviewed changes

source Show resolved Hide resolved

source Show resolved Hide resolved

source Show resolved Hide resolved

source Show resolved Hide resolved

source Show resolved Hide resolved

source Show resolved Hide resolved

source Show resolved Hide resolved

source Show resolved Hide resolved

noamr force-pushed the link-header-editorial branch from e1ec88d to 65db91c Compare May 17, 2022 07:59

domenic reviewed May 19, 2022

View reviewed changes

noamr added 2 commits May 19, 2022 08:06

Editorial: refactor link headers to be declared per-type

ce1b5d1

Rebase and fix some nits

29b2676

noamr force-pushed the link-header-editorial branch from 65db91c to 29b2676 Compare May 19, 2022 08:22

Nit fixes

49d74e9

domenic reviewed May 24, 2022

View reviewed changes

noamr added 2 commits May 25, 2022 22:36

More nits

9d2fc68

leftover

e91c47e

noamr and others added 3 commits May 25, 2022 22:39

leftover

d066692

Fixes

b3cc045

Some rewrapping

ab280a2

domenic approved these changes May 26, 2022

View reviewed changes

domenic merged commit 6712444 into whatwg:main May 26, 2022

noamr deleted the link-header-editorial branch May 26, 2022 17:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Editorial: refactor link headers to be declared per-type #7866

Editorial: refactor link headers to be declared per-type #7866

noamr commented Apr 28, 2022 •

edited by pr-preview bot

domenic left a comment

domenic Apr 29, 2022

domenic Apr 29, 2022

noamr Apr 30, 2022

noamr May 2, 2022

domenic May 3, 2022

noamr May 3, 2022 •

edited

noamr May 3, 2022

domenic May 3, 2022

noamr commented May 4, 2022

domenic left a comment

domenic commented May 4, 2022

noamr commented May 5, 2022

domenic left a comment

noamr commented May 5, 2022

noamr commented May 16, 2022

domenic commented May 17, 2022

noamr commented May 17, 2022

domenic left a comment

noamr commented May 19, 2022

domenic left a comment

domenic May 24, 2022

noamr May 25, 2022

domenic May 26, 2022

noamr commented May 25, 2022

Editorial: refactor link headers to be declared per-type #7866

Editorial: refactor link headers to be declared per-type #7866

Conversation

noamr commented Apr 28, 2022 • edited by pr-preview bot

domenic left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

noamr May 3, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

noamr commented May 4, 2022

domenic left a comment

Choose a reason for hiding this comment

domenic commented May 4, 2022

noamr commented May 5, 2022

domenic left a comment

Choose a reason for hiding this comment

noamr commented May 5, 2022

noamr commented May 16, 2022

domenic commented May 17, 2022

noamr commented May 17, 2022

domenic left a comment

Choose a reason for hiding this comment

noamr commented May 19, 2022

domenic left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

noamr commented May 25, 2022

noamr commented Apr 28, 2022 •

edited by pr-preview bot

noamr May 3, 2022 •

edited