-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Incrementally parsed invalid sequences spanning multiple chunks write data #52
Comments
U+FFFE is a noncharacter, but it doesn't make the corresponding UTF-8 sequence (
There are also a number of noncharacters, including U+FDD0..FDEF reserved for the Arabic processing, and none of them are prohibited in UTF-8. Rust's |
Ah. I tired looking up invalid utf-8 and that's what I found. Silly me! Can you give me an example of something which is invalid utf-8? |
@cgaebel Rust-encoding has a full test suite for the invalid UTF-8 sequences. |
Ahhh I missed the |
This test successfully reports an error, but when it does it writes an invalid code sequence into the buffer.
(side note, github markup is eating the invalid UTF-8 char in
left
. Rest assured SOMETHING is in there.The text was updated successfully, but these errors were encountered: