Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not compatible with Hungarian language and/or date format #24

Closed
Rastamas opened this issue Jan 23, 2017 · 5 comments
Closed

Not compatible with Hungarian language and/or date format #24

Rastamas opened this issue Jan 23, 2017 · 5 comments

Comments

@Rastamas
Copy link

Rastamas commented Jan 23, 2017

Downloading in Hungarian gives the following error.
Unexpected time format in "2016. j├║nius 16., 9:44 UTC+02". If you downloaded your Facebook data in a language other than English, then it's possible support may need to be added to this tool.

When downloading Hungarian messages in English all our special letters are displayed improperly (á é ö ő ó ú ü ű)
Thanks for the package though, it's awesome!

@ownaginatious
Copy link
Owner

Thanks for bringing this up! It's expected that the Hungarian date format isn't supported (each one has to be added manually for each language), but that's very odd that the letters are getting corrupted.

Did you happen to open your messages.htm file ahead of time and save in a format other than UTF-8?

I will try exporting my own data with the Hungarian locale to see if I can diagnose the issue.

@Rastamas
Copy link
Author

Rastamas commented Jan 23, 2017

Just double checked, did not open with anything, just copied out of the zip and into the working folder. Tried opening it with notepad++ and converting it to UTF-8 again but that didn't help either.

Probably worth mentioning that I'm running it from win10 cli .

@ownaginatious
Copy link
Owner

So I just downloaded my own data in the Hungarian locale, and I see the following:

Unexpected time format in "2013. július 16., 21:18 PDT". If you downloaded your Facebook data in a language other than English, then it's possible support may need to be added to this tool.
Please report this as a bug on the associated GitHub page and it will be fixed promptly.

It seems the letters are coming out as expected. I think the issue may be your terminal not being set to UTF-8 encoding.

Anyway, if you pipe your data to a file when extracting, it should have the correct encoding. I'll add Hungarian-style date support now 👍

@Rastamas
Copy link
Author

I was just about to write..., starting the console with the proper encoding fixed the issue. Sorry about that.
Thanks for adding the Hungarian date support! :)

@ownaginatious
Copy link
Owner

If you pull the newest version from PyPI (0.8.post16) you should be able to parse your data correctly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants