Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue #120: Byteorder mark removal in UncSource removes first left angle bracket '<' of xml document #123

Merged
merged 2 commits into from
Mar 24, 2018

Conversation

sonntam
Copy link

@sonntam sonntam commented Feb 27, 2018

This pull request fixes the issue of reading XML files due to BOM removal not working as described in issue #120. I added several test cases for reading differently encoded files:

  • UTF-8
  • UTF-8 with BOM
  • UTF-16 Big Endian
  • UTF-16 Little Endian
  • ISO-8859-1

Currently "foreign" (non-standard 7-bit) characters such as äöü are not supported directly if the file is encoded in an ANSI codepage format such as ISO-8859-1. I added a test case for that as well which currently fails.

In order to support ANSI encoded files some changes have to be made to the framework, i.e. the IUpdateSource::GetUpdatesFeed() method has got to return a Stream instead of a string for example. This would allow the Xml Reader for checking the XmlDescription and its encoding attribute. I might propose a fix for that as well some time in the future while I'm at it.

@FlorianAsal
Copy link

Hope this fix gets integrated asap, as it well known for more than 5 years now:

https://groups.google.com/forum/#!topic/nappupdate/Irr2cIch5bQ

@synhershko
Copy link
Owner

Thank you @sonntam ! looks good, merging now!

@synhershko synhershko merged commit e00b096 into synhershko:master Mar 24, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants