Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use pronounciation for transliteration #8

Open
davidpomerenke opened this issue May 30, 2020 · 1 comment
Open

Use pronounciation for transliteration #8

davidpomerenke opened this issue May 30, 2020 · 1 comment
Labels
enhancement New feature or request

Comments

@davidpomerenke
Copy link
Owner

While transliterating letter-by-letter works nicely for German → *, most users appear to find it unintuitive for English → *.

There exists a tool for retrieving the international phonetic alphabet (IPA) version of an English word: https://github.com/shukriadams/node-text-to-ipa

The main work would be to rewrite the transliteration rules for English → * using the IPA characters as source characters. There's 107 characters + diacritics, so this will get really complex. I don't know whether Regexes work well with IPA characters.

@davidpomerenke davidpomerenke added the enhancement New feature or request label May 30, 2020
@davidpomerenke
Copy link
Owner Author

davidpomerenke commented May 31, 2020

An advantage of using the IPA as source characters would be that it would then no longer be necessary to distinguish between different source languages. And probably, the source part of rules would no longer need to contain multiple characters, and rules would no longer need to be prioritized. (This would bring no performance improvement, however, as the prioritization happens during preprocessing.)

However, this presupposes that there are suitable IPA dictionaries available for all relevant languages (only German, so far). The package mentioned above only includes an IPA dictionary for American English, and they mention in https://github.com/surrsurus/text-to-ipa that it was hard even to find this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant