Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extended Unicode escape doesn't seem to accept hexadecimal digits. #476

Open
AshtonSnapp opened this issue Oct 25, 2022 · 1 comment
Open

Comments

@AshtonSnapp
Copy link

Hello. I am trying to enter the following Regex into RegExr in an effort to debug a potential problem with it that has caused a lexer to pick up a trailing parentheses after the closing quotation mark (said parentheses being the end delimiter of function arguments).

The regex I am trying to debug is as follows: /"([\u{0}-\u{10FFFF}]|(\\"))*"/gu (although, it is written in my code as a raw string literal r#""([\u{0}-\u{10FFFE}\u{10FFFF}]|(\\"))*""# - a bug in the lexer library I'm using causes \u{0}-\u{10FFFF} to match any byte, hence the slight weirdness there)

However, attempting to type in the \u{10FFFF} escape results in RegExr failing to identify the escape sequence - it gets marked as invalid. I am using the JavaScript (Browser) regex engine for this, because Unicode. This appears to be a bug, as the sidebar reference indicates that any number of hexadecimal digits may be used within the brackets. Using lowercase F's does not work either.

@valadaptive
Copy link

I'm also experiencing this--note that while the parser rejects the character escape, the regex itself works properly:

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants