You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
some systems emit utf8 strings with surrogate pairs encoded as two 3-byte sequences. uftcpp does not support such an encoding and throws an exception about an invalid code point when encountering it.
The text was updated successfully, but these errors were encountered:
According to https://en.wikipedia.org/wiki/UTF-8#CESU-8
That is indeed invalid UTF-8 but there's a derivative standard callied CESU-8 in which it is infact valid.
There are systems (Android OS for instance) that do emit this encoding so the question of if its valid or not is irrelevant.
I would argue that "the right thing" would be to do the thing that is most useful and offers the broadest support. Otherwise any user who wants to use uftcpp on Android is going to run in to this issue and would eventually need to replace uftcpp with something else.
some systems emit utf8 strings with surrogate pairs encoded as two 3-byte sequences. uftcpp does not support such an encoding and throws an exception about an invalid code point when encountering it.
The text was updated successfully, but these errors were encountered: