Skip to content
This repository has been archived by the owner on Dec 2, 2021. It is now read-only.

Unicode Normalization: convert all characters with accents into their one-unicode-character form? #29

Open
bao-xingchen opened this issue Jul 15, 2016 · 1 comment

Comments

@bao-xingchen
Copy link

Hello Thomas,

Thank you for your great works!

When I copy and paste the translated word with accents, e.g. géométrie, the é is actually two characters e´ (+U0065 +U0301). I think the one-unicode-character form would be a better choice, since Google Translate outputted the normalization form, which in this case is é (\u00e9).

So is there any possibilities to convert all characters with accents into their one-unicode-character form in the copy-paste? Thank you.

@bao-xingchen bao-xingchen changed the title Unicode Normalization: Convert all characters with accents into their one-unicode-character form? Unicode Normalization: convert all characters with accents into their one-unicode-character form? Jul 15, 2016
@thomashempel
Copy link
Owner

I will have a look at this. Thanks for reporting.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants