-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with special characters #37
Comments
Hi there, I suspect it might have something to do with the string replacement logic, could you provide a minimal reproducible example? |
Hi! any website in Spanish is giving the same results.
Thank you again! |
I did a test using some simple spanish words, seems ok, maybe not a problem caused by special characters
|
You are right. I should have tested the plain text before. It turns out that the problem is not in your library, is because I am using the http library to retrieve the HTML from an url and in the response headers is not coming the charset, so the http library is encoding the response body with the default encoding (latin1) I'll see how I can encode the response myself or force the http library to encode it. Thank you again! |
Hello! first of all thank you for your work on this library.
I am having an issue transforming an html file with characters like for example: è or when is a space and then a dot.
It shows weird characters like for example  instead è.
Another case is when I have the symbol ". It shows instead â followed by two boxes.
I hope my problem is more or less clear.
Thank you again!
The text was updated successfully, but these errors were encountered: