Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Look in to using Text::Unidecode's tables #47

Closed
dharple opened this issue Feb 15, 2021 · 3 comments
Closed

Look in to using Text::Unidecode's tables #47

dharple opened this issue Feb 15, 2021 · 3 comments
Assignees
Projects

Comments

@dharple
Copy link
Owner

dharple commented Feb 15, 2021

Text::Unidecode has already solved transliteration, more or less.

@dharple dharple self-assigned this Feb 15, 2021
@dharple dharple added this to To Do in detox v2.0 via automation Feb 15, 2021
@dharple
Copy link
Owner Author

dharple commented Feb 15, 2021

test.pl

use utf8;
use Text::Unidecode;

my $x = "\x{100}";

print unidecode($x);
print "\n";

print unidecode("Ā\n");
$ perl test.pl
A
A

@dharple dharple moved this from To Do to In Progress in detox v2.0 Feb 16, 2021
@dharple
Copy link
Owner Author

dharple commented Feb 16, 2021

Undo some of the changes from the last commit? I'm not sure they're adding value.

@dharple
Copy link
Owner Author

dharple commented Feb 17, 2021

Done. New repo needs polish.

@dharple dharple closed this as completed Feb 17, 2021
detox v2.0 automation moved this from In Progress to Done Feb 17, 2021
dharple added a commit that referenced this issue Feb 17, 2021
- Remove commented out entries in Unidecode.
- Remove "?" and "[?]" entries in Unidecode, unless the Unicode description
  contains the word "QUESTION".
- Remove and revamp some of the transliterations in the 0xA0 - 0xBF space.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Development

No branches or pull requests

1 participant