Simplification On Chars That Does Not Have Accent #1

erthium · 2024-01-14T11:57:09Z

Currently used string simplification process is simple:

Normalise all character with NFKD, which removes accents from characters and create 2 different chars.
Remove all characters that are not in ASCII range.

This process works just fine for almost all cases, but in some situtation it fails, such as the letter ı does not have any accent, but used a lot in Turkish language and clearly corresponds to the letter i in ASCII, but since it does not have accent, it gets lost in the process.

We need to find a way to support such characters.

The text was updated successfully, but these errors were encountered:

erthium added bug Something isn't working good first issue Good for newcomers labels Jan 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplification On Chars That Does Not Have Accent #1

Simplification On Chars That Does Not Have Accent #1

erthium commented Jan 14, 2024

Simplification On Chars That Does Not Have Accent #1

Simplification On Chars That Does Not Have Accent #1

Comments

erthium commented Jan 14, 2024