-
-
Notifications
You must be signed in to change notification settings - Fork 179
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
definition of NCName too strict #540
Comments
is well-formed, but Xerces holds on to the original XML 1.0 definition of NCName which was quite quickly superseded in XML 1.0 by the definition made in XML 1.1. Since eXist uses Xerces, this means that xml:id’s beginning with digits are held to be malformed in eXist, though they are well-formed according to XML 1.0. |
@jensopetersen I wonder if the Xerces property See: |
In the same vein see discussion at relaxng/jing-trang#188 I have encountered the issue when wanting to use some polytonic Greek characters ͷ 0377 GREEK SMALL LETTER PAMPHYLIAN DIGAMMA which are perfectly kosher NameChar under XML Spec fifth edition https://www.w3.org/TR/REC-xml/#d0e804, alas not a Letter according to fourth https://www.w3.org/TR/2006/REC-xml-20060816/#NT-Letter |
this seems to have been fixed, using the OP examples in 4.5.0. |
I would normally use ASCII characters only for xml:id's and suchlike, but for a specific project it makes sense to use Chinese characters as xml:id's. Here eXist appears to me to be too strict. While it accepts characters in the Basic Multilingual Plane (the second one below), it refuses later additions to Unicode.
The text was updated successfully, but these errors were encountered: