Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unicode rears its ugly head #8

Open
ketsuban opened this issue Feb 19, 2014 · 0 comments
Open

Unicode rears its ugly head #8

ketsuban opened this issue Feb 19, 2014 · 0 comments

Comments

@ketsuban
Copy link

The query "What is Pokémon?" silently drops the é and then tells me there is no result in the database for "Pokmon". "What is Tyranitar?", on the other hand, got me this gem from Freebase:

are one of the 493 fictional species of Pokᅢᄅmon creatures from the multi-billion-dollar Pokᅢᄅmon media franchise, designed by Ken Sugimori. The purpose of Tyranitar in the games, anime, and manga, as with all other Pokᅢᄅmon, is to battle both wild Pokᅢᄅmon¬タヤuntamed creatures that characters encounter while embarking on various adventures and tamed Pokᅢᄅmon creatures owned by Pokᅢᄅmon trainer.

In case Github eats them, the two middle characters are U+FFC3 HALFWIDTH HANGUL LETTER AE and U+FFA9 HALFWIDTH HANGUL LETTER RIEUL - how U+00E9 LATIN SMALL LETTER E WITH ACUTE ended up as that is beyond me. (It may be a bug in the database, because the query "What is Pikachu?" gets back a correctly formatted page.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant