Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and
privacy statement. We’ll occasionally send you account related emails.
Already on GitHub?
to your account
Migrated from rt.perl.org#127048 (status was 'resolved')
Searchable as RT127048$
The text was updated successfully, but these errors were encountered:
This is a continuation of
“The U+200D ZERO WIDTH JOINER (ZWJ) can be used between the elements of a
sequence of characters to indicate that a single glyph should be presented
“So to the user, these would behave like single emoji characters, even
though internally they are sequences.”
It sounds like we shouldn't cut these sequences in half when doing .substr
(which in turn means that these should be treated as one grapheme).
There is a chart of possible combinations here
http://www.unicode.org/emoji/charts/emoji-zwj-sequences.html, but I think
that any sequence with U+200D ZERO WIDTH JOINER should probably result in
one grapheme. As crazy as it sounds…
Sorry, something went wrong.
It should also be noted that ZERO WIDTH JOINER is used for other purposes
But I'm not sure if it should affect the character count in such cases.
This has been resolved since a month or so ago. This was closed with this commit:
@samcv - Status changed from 'new' to 'resolved'
No branches or pull requests