Support Unicode response message #41

doanguyen · 2018-12-08T15:41:47Z

Hi Cole,

I was reading the test and see the test_live.py::test_qq_login has been marked because of utf-8 unsupported. As far as I investigated, the reason is that qq smtp service return non-ascii (and even non utf-8 chars):

535 Error: \xc7\xeb\xca\xb9\xd3\xc3\xca\xda\xc8\xa8\xc2\xeb\xb5\xc7\xc2\xbc\xa1\xa3\xcf\xea\xc7\xe9\xc7\xeb\xbf\xb4: http://service.mail.qq.com/cgi-bin/help?subtype=1&&id=28&&no=1001256\r\n

I was thinking of 2 posibility to solve this problem:

Using chardet to guess the response message encoding.
Using re to eliminate non-utf8 (ascii) chars since it is the message, we still have response code

What do you think?

The text was updated successfully, but these errors were encountered:

cole · 2018-12-08T22:21:20Z

Thanks for submitting. Yeah, I'm not sure how best to handle this. IIRC originally we just did UTF-8 decoding, but then the SMTP protocol actually requires ASCII responses, so I changed it to that.

I agree though this is situation where we might as well be liberal with what we accept (as you pointed out, there is still the status code). However:

I don't like using chardet here as it would be nice to avoid requiring any additional libs (currently they are only required for tests).
Stripping non-utf8 chars seems not ideal either, as character encoding issues could leave you with an empty response when there is actually data.

smtplib seems to avoid decoding response test altogether, which is a neat solution, although it would be a bigger change. Maybe it'll look at that though.

doanguyen · 2018-12-08T22:33:06Z

Another possibility just come to my mind is that we can just print out the formatted bytecodes that are not fit in utf-8 chars, my temp solution just to get rid of the error is something like:

message = line[4:].strip(b" \t\r\n").decode("ascii", errors='backslashreplace')

Thanks for giving very nice library.

cole · 2018-12-08T22:36:28Z

Yeah, I was just looking at that too 😄
I think backslashreplace should allow debugging without errors in most cases. That seems the best to me right now.

Glad you're finding it useful!

cole · 2019-01-14T18:00:48Z

Fixed in 581d718.

doanguyen closed this as completed Dec 9, 2018

cole reopened this Jan 14, 2019

cole self-assigned this Jan 14, 2019

cole closed this as completed Jan 14, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Unicode response message #41

Support Unicode response message #41

doanguyen commented Dec 8, 2018

cole commented Dec 8, 2018

doanguyen commented Dec 8, 2018

cole commented Dec 8, 2018

cole commented Jan 14, 2019

Support Unicode response message #41

Support Unicode response message #41

Comments

doanguyen commented Dec 8, 2018

cole commented Dec 8, 2018

doanguyen commented Dec 8, 2018

cole commented Dec 8, 2018

cole commented Jan 14, 2019