Take encoding into account when parsing link headers & early hints #9715

noamr · 2023-09-10T03:51:44Z

Usually we use the document's encoding when parsing URLs in link headers, but that doesn't exist yet for early hints & link headers, so we need to use something, probably the charset param of the document's content-type header. /cc @bashi

The text was updated successfully, but these errors were encountered:

domenic · 2023-09-10T06:54:36Z

I think using UTF-8 would be better, ignoring the Content-Type. Especially because Content-Type might not arrive by early hints time, right?

noamr · 2023-09-10T07:00:20Z

I think using UTF-8 would be better, ignoring the Content-Type. Especially because Content-Type might not arrive by early hints time, right?

Right. I think it's a matter of calling steps 3-6 of https://html.spec.whatwg.org/#parse-a-url instead of running the whole algorithm.

bashi · 2023-09-13T05:17:20Z

+1 to use UTF-8. Early hints are introduced recently so I guess it's not so harmful to assume servers that speak early hints use UTF-8.

domenic · 2023-09-13T05:52:15Z

It would be good to write tests to see what browsers do for non-early Link headers. Do they use Content-Type, or do they always use UTF-8?

I hope that at least some browsers always use UTF-8, and so we can have the simple rule "if it's a Link header, we use UTF-8; if it's <link>, we use the document's encoding".

- Use the document encoding for link elements - Always use UTF8 for link headers/early hints Closes whatwg#9715

noamr mentioned this issue Sep 10, 2023

Editorial: refactor parse a url #9709

Merged

noamr added a commit to noamr/html that referenced this issue Sep 20, 2023

Specify URL encoding for links

ec7151c

- Use the document encoding for link elements - Always use UTF8 for link headers/early hints Closes whatwg#9715

noamr linked a pull request Sep 20, 2023 that will close this issue

Specify URL encoding for links (header+element) #9764

Open

4 tasks

shhnjk mentioned this issue Feb 3, 2024

Implement dangling markup injection mitigation #10022

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Take encoding into account when parsing link headers & early hints #9715

Take encoding into account when parsing link headers & early hints #9715

noamr commented Sep 10, 2023

domenic commented Sep 10, 2023

noamr commented Sep 10, 2023

bashi commented Sep 13, 2023

domenic commented Sep 13, 2023

Take encoding into account when parsing link headers & early hints #9715

Take encoding into account when parsing link headers & early hints #9715

Comments

noamr commented Sep 10, 2023

domenic commented Sep 10, 2023

noamr commented Sep 10, 2023

bashi commented Sep 13, 2023

domenic commented Sep 13, 2023