Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Encoding issue: UTF-16 LE with BOM encoding gets read as having spaces between all the characters #198

Open
kanarinka opened this issue Feb 15, 2016 · 1 comment

Comments

@kanarinka
Copy link
Collaborator

Here's the file in question
https://www.dropbox.com/s/afo8o8bilaqq3n6/UTF16_LEWITHBOM.txt?dl=0

Here are a couple interesting posts about it on StackOverflow:
http://stackoverflow.com/questions/19328874/python-read-text-file-with-weird-utf-16-format
http://stackoverflow.com/questions/603115/python-opens-text-file-with-a-space-between-every-character

@kanarinka kanarinka self-assigned this Feb 15, 2016
@kanarinka kanarinka added this to the v1.2 - Next Major Release milestone Feb 15, 2016
@rahulbot rahulbot modified the milestones: v1.3 - Next Big Release, v1.2 - Next Major Release Feb 25, 2016
@kanarinka kanarinka modified the milestones: v2.0 - Someday, v1.3 - Portuguese Language Release Aug 23, 2016
@rahulbot
Copy link
Collaborator

Still failing in py3:

2020-05-25 14:09:07,184 - databasic.logic.filehandler - DEBUG - loading txt file
2020-05-25 14:09:07,198 - databasic.logic.filehandler - ERROR - Wasn't able to read the words from the file /var/folders/fb/lkll_1p966b2_hqz9s94q1v80000gq/T/LIKELYFAIL_UTF16_LEWITHBOM.txt
2020-05-25 14:09:07,199 - databasic.logic.filehandler - DEBUG - loaded 0 chars

@rahulbot rahulbot removed this from the Someday milestone May 25, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants