Clarify the fake token step in fragment parsing #9430

zcorpan · 2023-06-15T13:51:00Z

💥 Error: Wattsi server error 💥

PR Preview failed to build. (Last tried on Jun 15, 2023, 2:18 PM UTC).

More

PR Preview relies on a number of web services to run. There seems to be an issue with the following one:

🚨 Wattsi Server - Wattsi Server is the web service used to build the WHATWG HTML spec.

If you don't have enough information above to solve the error by yourself (or to understand to which web service the error is related to, if any), please file an issue.

Fixes #9428

annevk · 2023-06-15T14:10:01Z

source


    <p>Let this start tag token be the start tag token of the <var
-    data-x="concept-frag-parse-context">context</var> node, e.g. for the purposes of determining
+    data-x="concept-frag-parse-context">context</var> node. This token is only used for determining
    if it is an <span>HTML integration point</span>.</p>


Could you turn this into a single sentence along the lines I suggested? Using "Let" to set an internal member of the context element (or node, we should be consistent) is weird.

annevk

Thanks!

I vaguely wonder if we should run this by at least one other parser expert, e.g., Henri, but if you feel confident I think that ought to suffice.

hsivonen · 2023-07-04T13:57:46Z

We currently have interop on this point: The attributes of the context node are ignored: https://software.hixie.ch/utilities/js/live-dom-viewer/saved/11842

It seems like a bug, but given that it's interoperable, do we want to keep it that way?

In any case, implementation-wise, I don't like writing the spec in the way that copies all the attributes if we only look at one (or currently, as implemented, none). I'd much prefer taking the local name for the context node, the namespace for the context node, and whatever flags actually need to be inspected. Currently in Gecko, there's a flag for quirks mode. If we decide we want to change the browser behavior here, I'd prefer to have a flag that true iff the context node is annotation-xml in the MathML namespace and it has the attribute encoding whose value is an ASCII-case-insensitive match for either "text/html" or "application/xhtml+xml".

That way, it would be clear that the fragment parsing algorithm doesn't actually need a copy of all the attributes.

annevk · 2023-07-04T16:06:26Z

That's true, but then we'd have to change a whole lot more, to pass through that flag and make it do the correct thing. We could make it more precise though and only copy that attribute and only if it has one of those values. That seems like a more minimal fix that still has the desired effect.

And it does seem bad that this would not work as expected, so I'd recommend we all fix it.

Also, I'm guessing we can't turn annotation-xml into a general HTML sink (i.e., ignore the encoding attribute and assume it contains HTML), though maybe @bkardell or @fred-wang could say more about that.

cc @mfreed7

Clarify the fake token step in fragment parsing

a321df0

Fixes #9428

annevk reviewed Jun 15, 2023

View reviewed changes

Fixup

d671455

annevk approved these changes Jun 15, 2023

View reviewed changes

zcorpan requested a review from hsivonen June 15, 2023 15:04

annevk added clarification Standard could be clearer topic: parser labels Jul 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify the fake token step in fragment parsing #9430

Clarify the fake token step in fragment parsing #9430

zcorpan commented Jun 15, 2023 •

edited by pr-preview bot

annevk Jun 15, 2023

zcorpan Jun 15, 2023

annevk left a comment

hsivonen commented Jul 4, 2023

annevk commented Jul 4, 2023

Clarify the fake token step in fragment parsing #9430

Are you sure you want to change the base?

Clarify the fake token step in fragment parsing #9430

Conversation

zcorpan commented Jun 15, 2023 • edited by pr-preview bot

💥 Error: Wattsi server error 💥

annevk Jun 15, 2023

Choose a reason for hiding this comment

zcorpan Jun 15, 2023

Choose a reason for hiding this comment

annevk left a comment

Choose a reason for hiding this comment

hsivonen commented Jul 4, 2023

annevk commented Jul 4, 2023

zcorpan commented Jun 15, 2023 •

edited by pr-preview bot