You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently html_to_unicode prefers Content-Type header if BOM is present.
But browsers, as well as WHATWG standard use BOM first. This can be checked e.g. by running this server, and opening URL in a browser - UTF-8 is used if BOM is present, and cp1251 is used if it's not:
The text was updated successfully, but these errors were encountered:
kmike
changed the title
BOM should take precedence over Content-Type header when detecting an encoding
BOM should take precedence over Content-Type header when detecting the encoding
Aug 16, 2022
Currently html_to_unicode prefers Content-Type header if BOM is present.
But browsers, as well as WHATWG standard use BOM first. This can be checked e.g. by running this server, and opening URL in a browser - UTF-8 is used if BOM is present, and cp1251 is used if it's not:
The text was updated successfully, but these errors were encountered: