Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Serialized String comparison, Unicode support #801

Closed
wants to merge 1 commit into from
Closed

Serialized String comparison, Unicode support #801

wants to merge 1 commit into from

Conversation

zentol
Copy link

@zentol zentol commented May 13, 2014

The StringComparator now works on serialized data.

To this end new string read/write/copy/compare methods were introduced, which use a variable-length encoding for the characters.

key-points:

  • The most significant bits are written/read first.
  • The first 2 bits of the character are used to encode the size of the character.
  • A character is at most 3 Bytes big.

Additionally, the StringSerializer now has full unicode support. i couldn't find a unicode character that uses more than 22 bits, as such 3 Bytes should be sufficient.

@zentol zentol closed this Jun 10, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants