Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Modifiers are dropped in \X regular expression matches #4832
This Issue is about how that match becomes wrong.
The circle above the
Note the absence of the "modifier".
In order to know what I'm talking about, here are links.
Keywords: extended grapheme cluster
The unicode_normalize appears to be working properly here, expanding (in this case) all the diacritics to their combined character forms. The subsequent
This is likely missing or incorrect logic in joni. I'm reading up on how parsers and regex are expected to handle unicode normalized into combining characters.