New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support funky Microsoft Word XHTML unicode escapes #60
Conversation
These escape sequences are common in MS Word XHTML docs. |
4 similar comments
@malsmith Could you please provide tests as well ? |
|
||
from html2text.compat import htmlentitydefs | ||
|
||
# Based on http://stackoverflow.com/questions/7105874/valueerror-unichr-arg-not-in-range0x10000-narrow-python-build-please-hel |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe link to the share link for this question instead: http://stackoverflow.com/q/7105874/173630
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
Will this fix this error I'm getting?
|
@nikolas Could you try this branch to see if it solve the issue you're facing ? |
@Alir3z4 I tried this branch, the issue is not solved. Failure at the struct module decode. |
I close this pull-request, the author of the patch @malsmith has not respond regarding to this patch and it comes with conflicts. |
No description provided.