You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It's my understanding that STRATEGY_IGNORE should "add characters to result", which to me sounds like it should retain the character in the output if it isn't matched.
However, I cannot seem to retain my complete original input
This is an issue because there are characters that, while not true homoglyphs, can still be used as them. Consider the German eszett, ß, which is a common stand-in for 'B' online.
Because of this, I'm unable to properly detect (as an example) the string 'Сaptchaß𝗈t' -- Cyrillic ES (homoglyph of latin C), German Eszett (leet-speak for latin B), and Mathematical o (normalized to latin o). The best I've been able to achieve is Captchaot with strategy LOAD and ascii_strategy REMOVE.
Is there a way to have homoglyphs simply pass-through any character that isn't matched?
The text was updated successfully, but these errors were encountered:
It's my understanding that STRATEGY_IGNORE should "add characters to result", which to me sounds like it should retain the character in the output if it isn't matched.
However, I cannot seem to retain my complete original input
This is an issue because there are characters that, while not true homoglyphs, can still be used as them. Consider the German eszett,
ß
, which is a common stand-in for 'B' online.Because of this, I'm unable to properly detect (as an example) the string 'Сaptchaß𝗈t' -- Cyrillic ES (homoglyph of latin
C
), German Eszett (leet-speak for latinB
), and Mathematical o (normalized to latino
). The best I've been able to achieve isCaptchaot
with strategy LOAD and ascii_strategy REMOVE.Is there a way to have homoglyphs simply pass-through any character that isn't matched?
The text was updated successfully, but these errors were encountered: