Skip to content

Encoded bytes with \x values above 7F (80-FF) not handled correctly #37

@slevithan

Description

@slevithan

E.g., standalone \x80 and \xEF\xC0\xBB correctly throw (since the parser is catching these as invalid multibyte sequences), but the valid UTF-8 encoded byte sequence \xE2\x82\xAC is being incorrectly passed through as /\xE2\x82\xAC/. It is in fact equivalent to the single code point \u20AC (or ) in JS.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions