Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for 4 byte UTF-8 characters and stricter character checking #27

Merged
merged 2 commits into from Nov 25, 2014

Conversation

Projects
None yet
2 participants
@klondi
Copy link
Contributor

commented Nov 24, 2014

With amongst other things the emoticon parts of the new versions of UTF-8 the lack of proper checking of the strings as defined in RFC 3629 has become apparent. More info in this issue can be found at http://forum.dcbase.org/viewtopic.php?f=18&t=956

This branch also adds stricter checking of utf-8 strings to detect things like control characters encoded using 2 bytes amongst other things.

Finally tests for these features have also been added.

janvidar added a commit that referenced this pull request Nov 25, 2014

Merge pull request #27 from klondi/utf-8_fixes
Add support for 4 byte UTF-8 characters and stricter character checking

@janvidar janvidar merged commit e32bb3f into janvidar:master Nov 25, 2014

1 check passed

continuous-integration/travis-ci The Travis CI build passed
Details
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.