-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
UTF-8 and CJK error #26
Comments
same |
similar, utf-8 with or without BOM fails on some chars. allowed linebreaks throw error too. error message claim no valid utf-8 is submitted, but chars and line breaks are allowed, so this is buggy |
Similar for Polish characters. Simplified test case: <?xml version="1.0" encoding="UTF-8"?>
<czytelnicy xsi:noNamespaceSchemaLocation="ImpCz.xsd" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<czytelnicy>Ząb</czytelnicy>
</czytelnicy> XSD:
Error shown on demo page:
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi Kripken,
I have used your library in my project and see some issue but didn't know this issue belong to your lib or mine. So I log this issue there:
When I validate my xml file using xsd schema with format (utf-8). In xml file I have use some CJK characters and then the result was failed. I research some way to resolve but have no ideas. This is my schema and xml file:
`
<xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema"
xmlns:usdm="http://usdm.asia/usdm">
</xs:schema>
`
xml :
<?xml version="1.0" encoding="utf-8"?> <usdm version="0.0.0" xmlns:usdm="http://usdm.asia/usdm"> <group id="0" sort="0" NO="ROOT.0"> <richcontent>を</richcontent> </group> </usdm>
I realize from this page https://www.utf8-chartable.de/unicode-utf8-table.pl?start=12288&number=512&names=- that the characters begin
U+3081 | め | e3 82 81
to the end is failed with utf-8
Thank for your attention!
The text was updated successfully, but these errors were encountered: