Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UnicodeEncodeError: 'gbk' codec can't encode character u'\ufb01' in position 173 : illegal multibyte sequence #72

Closed
xiaoyanguoke opened this issue Oct 19, 2017 · 3 comments

Comments

@xiaoyanguoke
Copy link

image

image

how to solve?

@bozhodimitrov
Copy link
Collaborator

bozhodimitrov commented Oct 20, 2017

@xiaoyanguoke you can try to use cygwin terminal platform under windows, since you have a mix of locales. The tesseract executable itself uses utf8 as output.
So you should either switch to it, in order to have proper output in cmd or you should switch the cmd prompt itself.

Also you can try the win_unicode_console module

@xiaoyanguoke
Copy link
Author

@int3l I still can not solve

@bozhodimitrov
Copy link
Collaborator

Please provide us with the following information:

  • OS name and version.
  • python version
  • sample code snippet + sample image file that produce the error.

I can't help much without reproducing the exact error.
Thanks for reporting the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants