Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update to try to use windows-950 when extract-msg is imported #20

Merged
merged 1 commit into from
Jul 22, 2023

Conversation

TheElementalOfDestruction

Changed RTFDE.text_extraction.get_python_codec to lookup the windows-950 encoding to see if it is available. This is only done when the codepage number is 950, and if windows-950 is not found, the function behaves like normal. If the encoding is found, it will return 'windows-950' for the codec to use.

The upcoming version of extract-msg, version 0.42.0, adds the implementation that Microsoft uses for cp950 to ensure that documents using it will be parsed correctly.

This is a fix for #19 and passes with the test file specified in that issue.

@seamustuohy seamustuohy merged commit 9f6483b into seamustuohy:dev Jul 22, 2023
seamustuohy added a commit that referenced this pull request Jul 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants