Extended Unicode escape doesn't seem to accept hexadecimal digits. #476

AshtonSnapp · 2022-10-25T16:32:56Z

Hello. I am trying to enter the following Regex into RegExr in an effort to debug a potential problem with it that has caused a lexer to pick up a trailing parentheses after the closing quotation mark (said parentheses being the end delimiter of function arguments).

The regex I am trying to debug is as follows: /"([\u{0}-\u{10FFFF}]|(\\"))*"/gu (although, it is written in my code as a raw string literal r#""([\u{0}-\u{10FFFE}\u{10FFFF}]|(\\"))*""# - a bug in the lexer library I'm using causes \u{0}-\u{10FFFF} to match any byte, hence the slight weirdness there)

However, attempting to type in the \u{10FFFF} escape results in RegExr failing to identify the escape sequence - it gets marked as invalid. I am using the JavaScript (Browser) regex engine for this, because Unicode. This appears to be a bug, as the sidebar reference indicates that any number of hexadecimal digits may be used within the brackets. Using lowercase F's does not work either.

The text was updated successfully, but these errors were encountered:

valadaptive · 2024-03-05T20:18:21Z

I'm also experiencing this--note that while the parser rejects the character escape, the regex itself works properly:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extended Unicode escape doesn't seem to accept hexadecimal digits. #476

Extended Unicode escape doesn't seem to accept hexadecimal digits. #476

AshtonSnapp commented Oct 25, 2022

valadaptive commented Mar 5, 2024

Extended Unicode escape doesn't seem to accept hexadecimal digits. #476

Extended Unicode escape doesn't seem to accept hexadecimal digits. #476

Comments

AshtonSnapp commented Oct 25, 2022

valadaptive commented Mar 5, 2024