UnicodeEncodeError: 'gbk' codec can't encode character u'\ufb01' in position 173 : illegal multibyte sequence #72

xiaoyanguoke · 2017-10-19T06:07:09Z

how to solve?

bozhodimitrov · 2017-10-20T07:13:43Z

@xiaoyanguoke you can try to use cygwin terminal platform under windows, since you have a mix of locales. The tesseract executable itself uses utf8 as output.
So you should either switch to it, in order to have proper output in cmd or you should switch the cmd prompt itself.

Also you can try the win_unicode_console module

xiaoyanguoke · 2017-10-20T08:56:20Z

@int3l I still can not solve

bozhodimitrov · 2017-10-21T02:40:59Z

Please provide us with the following information:

OS name and version.
python version
sample code snippet + sample image file that produce the error.

I can't help much without reproducing the exact error.
Thanks for reporting the issue.

bozhodimitrov closed this as completed Nov 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UnicodeEncodeError: 'gbk' codec can't encode character u'\ufb01' in position 173 : illegal multibyte sequence #72

UnicodeEncodeError: 'gbk' codec can't encode character u'\ufb01' in position 173 : illegal multibyte sequence #72

xiaoyanguoke commented Oct 19, 2017

bozhodimitrov commented Oct 20, 2017 •

edited

Loading

xiaoyanguoke commented Oct 20, 2017

bozhodimitrov commented Oct 21, 2017

UnicodeEncodeError: 'gbk' codec can't encode character u'\ufb01' in position 173 : illegal multibyte sequence #72

UnicodeEncodeError: 'gbk' codec can't encode character u'\ufb01' in position 173 : illegal multibyte sequence #72

Comments

xiaoyanguoke commented Oct 19, 2017

bozhodimitrov commented Oct 20, 2017 • edited Loading

xiaoyanguoke commented Oct 20, 2017

bozhodimitrov commented Oct 21, 2017

bozhodimitrov commented Oct 20, 2017 •

edited

Loading