You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hex(E7) is the windows-1252 code page encoding for ç and not its UTF-8 encoding (which would be C3A7 if I remember correctly). So your file was not in UTF-8 but in a windows-1252 code page that is currently not supported.
Please convert your files to a supported encoding before uploading or write a custom extractor. And feel free to vote for support of codepages at https://feedback.azure.com/forums/327234-data-lake/suggestions/13077555-add-ansi-code-page-support-for-built-in-extractors
Hex(E7) is the windows-1252 code page encoding for ç and not its UTF-8
encoding (which would be C3A7 if I remember correctly). So your file was
not in UTF-8 but in a windows-1252 code page that is currently not
supported.
Please convert your files to a supported encoding before uploading or
write a custom extractor. And feel free to vote for support of codepages at https://feedback.azure.com/forums/327234-data-lake/suggestions/13077555-add-ansi-code-page-support-for-built-in-extractors
—
You are receiving this because you authored the thread.
Reply to this email directly or view it on GitHub #33 (comment)
When reading this .log file it fails in this line... :(
2016-03-07 11:34:48 W3SVCXYZ805 SERVER13 10.101.146.157 GET /pt/Prt/PublishingImages/mailimages/visto_131114.jpg - 80 - 10.101.146.3 HTTP/1.1 Mozilla/5.0+(compatible;+MSIE+10.0;+Windows+NT+6.1;+WOW64;+Trident/6.0;+SRHE+S.R.+Habitação+e+Equipamentos) - - ind.xyz.pt 200 0 0 10143 308 0
The problem are the characters ç and ã
If you convert the file to unicode it works well, though...
The text was updated successfully, but these errors were encountered: