Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wikipedia extractor inserts Chinese characters instead em dashes #19

Closed
milekpl opened this issue Oct 6, 2013 · 0 comments
Closed

Wikipedia extractor inserts Chinese characters instead em dashes #19

milekpl opened this issue Oct 6, 2013 · 0 comments
Assignees

Comments

@milekpl
Copy link
Member

milekpl commented Oct 6, 2013

The em dashes from the page:

http://en.wikipedia.org/wiki/August_22

are converted to Chinese (?) characters:

1777 舑 American Revolutionary War: British forces abandon the Siege of Fort Stanwix...

Some Unicode encoding failure?

@ghost ghost assigned danielnaber Oct 6, 2013
linuxscout pushed a commit to linuxscout/languagetool that referenced this issue Dec 31, 2019
danielnaber pushed a commit that referenced this issue Dec 31, 2019
* Fixes #19, Wrong error position when text has Tashkeel
linuxscout pushed a commit to linuxscout/languagetool that referenced this issue Dec 31, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants