-
Notifications
You must be signed in to change notification settings - Fork 100
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Help with unsupported charsets #17
Comments
Looks like cp-850 and ibm850 is charmap.CodePage850 |
Looks like you took care of most of these. Seems like Go does not support UTF-7 out of the box, not sure if that's because Rob Pike invented UTF-8 and doesn't want it. We could leverage: https://github.com/cention-sany/utf7 @cention-sany appears to make use of a heavily modified version of the older go.enmime I'd like to avoid adding a bunch of dependencies if possible, but I'd rather one import than bring in iconv, since it wraps a C library; complicating the build. |
Sounds good to me, I'd avoid iconv as well. |
Looks like https://github.com/emersion/go-imap/tree/v1/utf7 may be a more idiomatic implementation. Turns out that UTF-7 was actually defined in the IMAP RFC. |
Just wanna comment that package https://github.com/cention-sany/utf7 is https://tools.ietf.org/html/rfc2152 compliance meanwhile https://github.com/emersion/go-imap/tree/v1/utf7 seem to be https://tools.ietf.org/html/rfc3501#section-5.1.3 which actually two slightly different UTF-7. |
Good to know, skimming rfc2152, it was specifically designed for internet mail content, so your implementation would be the better choice for enmime. |
Added - Inject a `application/octet-stream` as default content type when none is present (#140, thanks requaos) - Add support for content-type params to part & encoding (#148, thanks pzeinlinger) - UTF-7 support (#17) Fixed - Handle missing parameter values in the middle of the media parameter list (#139, thanks requaos) - Fix boundaryReader to respect length instead of capacity (#145, thanks dcormier) - Handle very empty mime parts (#144, thanks dcormier)
@jhillyerd Nice one |
If I had known it was only going to take 3 lines of code to implement, I might not have waited two years to fix this. 🙄 |
I have collected a small list of logs for unsupported charsets. Here's the list:-
As far as I understand ISO646-US is "us-ascii", ISO: Western is "iso8859" and cp936 is "gb2312". Is this correct?
I have been unable to figure out the rest of the encodings although cp-850 and ibm850 seem to be the same.
Does anyone have an idea about these charsets and if they can be supported in this library easily?
The text was updated successfully, but these errors were encountered: