Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-bytes character sometime gets screwed up #9

Closed
pludov opened this issue Sep 7, 2018 · 1 comment
Closed

Multi-bytes character sometime gets screwed up #9

pludov opened this issue Sep 7, 2018 · 1 comment

Comments

@pludov
Copy link

pludov commented Sep 7, 2018

Trying to parse xml file that contains multi-byte chars sometime results in a character being replaced with two invalid characters in text nodes (and potentially other).

The cause is probably the boundary of chunks when reading the file, that split a two (or more) bytes char.

@Sai1919
Copy link
Owner

Sai1919 commented Jun 2, 2019

Fixed by this PR: #10 Thanks for the contribution @pludov

@Sai1919 Sai1919 closed this as completed Jun 2, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants