-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Augment use of lwsp in ttp:{content,processor}Profiles (#170). #185
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me.
One slight oddity is that this definition of <lwsp>
permits newlines and carriage returns: I would be very surprised to see this in attribute values of this kind, though I don't have a particular reason not to allow it.
See [1] regarding rules for whitespace normalization in attribute values. |
Thanks, very helpful. Why do we not specify the post-normalization syntax rather than the pre-normalized syntax? That would allow us to replace |
Worth considering. |
Not only that but it may be reasonable for some interchange contexts to constrain document delivery to have pre-normalized attribute values. |
I've just dug into this a bit further and it turns out that traversing all the links from https://w3c.github.io/ttml2/spec/ttml2.html#reduced-infoset-attribute through the term definition and the reference into https://www.w3.org/TR/2004/REC-xml-infoset-20040204/#infoitem.attribute , we already specify attribute values in terms of normalized values in the reduced infoset, so the use of |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
On further inspection, permitting general <lwsp>
is providing for escaped white space characters that would otherwise be normalized according to attribute information item
. I do not think we actually want to permit this; presumably the intent behind proposing to allow white space is merely to indicate that post-normalization, a single #x20
character separates items within attribute values; an explanatory Note could be added to alert the more casual reader to the fact that other white space characters can be included, also leading and trailing ones, but that the syntax representation is for these having been normalized out.
@skynavga pointed out in today's TTWG meeting that in fact TTML1 already allows for general LWSP as delimiters between attribute value terms so the proposal to adopt the post-normalization form only could be a breaking change. On that basis we can go ahead and merge this and I will raise a new issue proposing post-normalized single white space. It could be that the best resolution to this would be to add an informative note, which I think is unfortunate but perhaps unavoidable. |
See issue #170.