Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UnicodeDecodeError: 'gbk' codec can't decode byte 0x9d in position 4771: illegal multibyte sequence #3

Open
makerdd opened this issue Dec 2, 2022 · 1 comment

Comments

@makerdd
Copy link

makerdd commented Dec 2, 2022

On my Windows10, the default decode mode is gbk.
In this file cwe2\database\699.csv line 418 col 3402, gbk can't decode the 0xE2 0x80 0x9C.

Suggestion:
When open *.csv files, should explicitly use encoding='utf-8'.

Thank you very much for your project!

@ziadhany
Copy link
Collaborator

It's my pleasure and sorry for the late reply, i will handle this 👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants