fix(getSelector): Ensure nodename is eescaped #566

WilcoFiers · 2017-10-10T11:03:42Z

Closes #563

rdeltour · 2017-10-11T15:11:34Z

Thanks @WilcoFiers!
Would it be possible however to use the local name instead of the escaped node name in the selector? Or with an "any namespace" type selector? On an XHTML document, the selector m\\:ci wouldn’t match whereas ci or *|ci would.

WilcoFiers · 2017-10-11T17:51:08Z

@rdeltour What's the the use case for that? When would the same system want to change the namespace, even though the element would be the same. I think it's a little clearer what element you're supposed to look at with the namespace specified, no?

rdeltour · 2017-10-12T08:44:39Z

What's the the use case for that?

The use case is to make selectors match the element in an XHTML context, such as in EPUB.

Let me try to clarify. This has to deal with whether the element has a namespace or not in the DOM, which depends on how the document was parsed (for instance, as XML or as HTML).

Consider this example HTML document:

<!doctype html>
<title>Example</title>
<meta charset="utf-8" />
<h1>Hi</h1>
<math xlink:href="#foo"></math>
<div xlink:href="#foo"></div>
<foo:bar></foo:bar>

It's HTML, and parsed as such by a browser. On this document:

document.querySelectorAll('[xlink\\:href]') will return the div element
document.querySelectorAll('[*|href]') will return the math element
document.querySelectorAll('foo\\:bar') will return the foo:bar element

It's all in the rules defined by the HTML parsing algorithm. For instance, when the parser encounters an xlink:href attribute on an SVG or MathML element, it adjusts it to put it in the XLink Namespace.

Now, consider this XHTML document:

<html xmlns="http://www.w3.org/1999/xhtml" xmlns:m="http://www.w3.org/1998/Math/MathML" xml:lang="en" lang="en">
<head>
  <meta charset="utf-8" />
  <title>Example</title>
</head>
<body>
<h1>Hi</h1>
<m:math></m:math>
</body>
</html>

If parsed as XHTML (for instance, put it in a file with the '.xhtml' extension and open it in a browser, or serve it with the application/xhtml+xml media type, then:

document.querySelectorAll('m\\:math) is empty
document.querySelectorAll('math) contains the m:math element
document.querySelectorAll('*|math) contains the m:math element

Again, the reason lies in the parsing algorithm. The MathML namespace is known and defined, so the local name is just "math" and the namespace is "http://www.w3.org/1998/Math/MathML".

So, what should aXe do (ideally)?

The easy approach would be to create *|name selectors for prefixed element names. That would probably work in most of the cases, since prefixed foreign names are very rarely found in HTML, and it would work in an XHTML context.
This is assuming that you're only creating type selectors (i.e. for elements). If you're also creating attribute selectors, you'd have to make cases depending on foreign attribute adjustments.

A more correct approach would be to define the selector based on the value of document.contentType. If it is text/html, the selector would be prefix\\:localname; if it is application/xhtml+xml, the selector would be *|localname.

Voilà. I hope it makes sense :-)
I'm happy to help with the PR if needed (btw, is there a way to run a single test in axe without having to run everything with grunt test? I couldn't find it at first sight, and it's not in the docs either).

WilcoFiers · 2017-10-12T12:08:52Z

Makes sense, and I learned something new today :D. Thanks Romain. @dylanb What do you think?

Also, yes you can run specific tests by doing --grep="test name".

dylanb · 2017-10-18T12:35:18Z

I think we should implement option 2 because it will only change selectors for XHTML documents which is less disruptive to our users

WilcoFiers · 2017-10-18T14:48:56Z

Agreed. Any idea how to detect what parser was used on a page? I want to avoid changing the selectors as much as possible. If we can make it so that localName is used in xml documents, and nodeName in all other cases, that'd be ideal. Do you want to take a stab at updating this PR @rdeltour ?

rdeltour · 2017-10-18T15:02:37Z

Agreed. Any idea how to detect what parser was used on a page?

I think we can look at document.contentType

Do you want to take a stab at updating this PR @rdeltour ?

Sure, I can try in the coming days (end of next week at the latest).

- by default, ensure the nodename is escaped - for XHTML documents, only use the local name Replaces dequelabs#566 Closes dequelabs#563

marcysutton · 2017-10-25T00:59:28Z

I saw a real-world case of invalid/namespaced HTML tags rendered from the server today, and they took down axe-core and our extensions. Here's an example:

<search:facet-list></search:facet-list>

I have a question out as to whether this was an XHTML or HTML5 doctype, but I'd be curious to see if namespaced HTML tags in a regular document like this cause any problems for assistive technologies. If they do cause problems, it might be a good thing for axe-core to flag.

rdeltour · 2017-10-25T01:06:31Z

I'd be curious to see if namespaced HTML tags in a regular document like this cause any problems for assistive technologies. If they do cause problems, it might be a good thing for axe-core to flag.

Good question. To be honest I have no idea, but I suspect most ATs would just ignore it (as HTML parsers do).
If they do have a problem to process them, yes it should certainly be flagged! (using a specific rule rather than the current syntax error of course).

WilcoFiers · 2017-11-14T12:34:11Z

PR superseded by #582

- by default, ensure the nodename is escaped - for XHTML documents, only use the local name Replaces dequelabs#566 Closes dequelabs#563

) * feat(utils): add function `isXHTML` to test if a document node is XHTML * test(utils): add a test for `axe.utils.isXHTML` on an XHTML document * fix(getSelector): improve selectors for namespaced elements - by default, ensure the nodename is escaped - for XHTML documents, only use the local name Replaces #566 Closes #563 * test(getSelector): add a test for `axe.utils.getSelector` on a namespaced XHTML element

Co-authored-by: Steven Lambert <2433219+straker@users.noreply.github.com>

fix(getSelector): Ensure nodename is eescaped

e10fab9

WilcoFiers requested review from marcysutton and isner October 10, 2017 12:00

rdeltour mentioned this pull request Oct 24, 2017

Fix getSelector for namespaced elements + introduce XHTML fixtures #582

Merged

marcysutton mentioned this pull request Nov 3, 2017

Failed to execute querySelectorAll #599

Closed

WilcoFiers closed this Nov 14, 2017

WilcoFiers deleted the selector-escape branch November 14, 2017 12:34

mrtnvh pushed a commit to mrtnvh/axe-core that referenced this pull request Nov 24, 2023

fix: only set allowedOrigin when needed (dequelabs#566)

a83907b

Co-authored-by: Steven Lambert <2433219+straker@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(getSelector): Ensure nodename is eescaped #566

fix(getSelector): Ensure nodename is eescaped #566

WilcoFiers commented Oct 10, 2017

rdeltour commented Oct 11, 2017

WilcoFiers commented Oct 11, 2017

rdeltour commented Oct 12, 2017

WilcoFiers commented Oct 12, 2017

dylanb commented Oct 18, 2017

WilcoFiers commented Oct 18, 2017

rdeltour commented Oct 18, 2017

marcysutton commented Oct 25, 2017

rdeltour commented Oct 25, 2017 •

edited

WilcoFiers commented Nov 14, 2017

fix(getSelector): Ensure nodename is eescaped #566

fix(getSelector): Ensure nodename is eescaped #566

Conversation

WilcoFiers commented Oct 10, 2017

rdeltour commented Oct 11, 2017

WilcoFiers commented Oct 11, 2017

rdeltour commented Oct 12, 2017

WilcoFiers commented Oct 12, 2017

dylanb commented Oct 18, 2017

WilcoFiers commented Oct 18, 2017

rdeltour commented Oct 18, 2017

marcysutton commented Oct 25, 2017

rdeltour commented Oct 25, 2017 • edited

WilcoFiers commented Nov 14, 2017

rdeltour commented Oct 25, 2017 •

edited