-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Specify attribute term delimiter as post-normalized space #191
Comments
So, the facts are as follows:
In conclusion, we need to retain the current definition of and not refer to XML normalized space. Therefore, no action is required on this issue, so closing. |
@skynavga this is a bit surprising. Firstly, we do require attribute value normalisation when constructing the XML infoset, independently of the concrete representation of the document instance, and secondly you seem not to have addressed the third proposal at the end of #191 (comment):
This would ensure that implementations always get a consistent single |
@nigelmegitt your proposal contradicts the algorithm specified in https://www.w3.org/TR/REC-xml/#AVNormalize Furthermore, implementations do not assume that normalization applies to character references that allow inserting non-normalized whitespace in attribute values; for example, TTV tests for the presence of whitespace padding around an attribute value and reports an error if it appears; testing this verification process requires the ability to insert non-normalized whitespace in this context, which is done using character references; with your proposal, the expansion of character references would have a second pass of normalization, and would prevent testing the padding detection; I would suggest we limit changes to adding a note under B.3 |
OK, I do not understand which part of a pre-processing algorithm can be contradictory to a step that comes after it, but on further reflection, most of what I am proposing here is about implementation optimisation, an area we don't need to define normatively. Adding a note under B.3 as you suggest seems like the best way to go. I'll prepare a pull request. |
Clarify that unnormalized whitespace may appear in [normalized value] (#191). Merging this editorial only PR.
See also #185 and #170 for background: the current use of
<lwsp>
permits white space even though XML attribute normalization would remove leading and trailing white space and replace intermediate strings of white space with a single#x20
character. My proposal for this was to replace<lwsp>
with<nsp>
where:<nsp>: #x20 after applying the normalization rules in [1]
[1] https://www.w3.org/TR/REC-xml/#AVNormalize
Right now, traversing all the links from https://w3c.github.io/ttml2/spec/ttml2.html#reduced-infoset-attribute through the term definition and the reference into https://www.w3.org/TR/2004/REC-xml-infoset-20040204/#infoitem.attribute , we already specify attribute values in terms of normalized values in the reduced infoset, so the use of
<lwsp>
is actually rather difficult to achieve - anything other than a single #x20 character would have to be escaped. However it is possible to escape those characters. I do not know why that would be useful.Some (non-mutually-exclusive) proposals to allow for simpler implementations:
#0x20
character between terms.The text was updated successfully, but these errors were encountered: