replace ToASCII with mime base64 #197

Alexfilus · 2021-06-26T17:52:10Z

it fixes non ascii attach names such as cyrillic

jhillyerd · 2021-06-26T19:57:16Z

I fixed the static check failure in #198 -- that should go away if you rebase.

We'll need to update the golden files, please run the unit tests, inspect the output and run go test -update if it looks correct to you.

it fixes non ascii attach names such as cyrillic

Alexfilus · 2021-06-27T16:50:55Z

I hope how it's right order of commits

jhillyerd · 2021-06-27T19:39:26Z

Looks like it picked up my PR, but no changes to the golden files:

--- FAIL: TestEncodePartQuotedHeaders (0.00s)
    encode_test.go:83: Test output did not match testdata/encode/part-quoted-headers.golden
        To update golden file, run: go test -update
    encode_test.go:83: diff -want +got:
        |-Content-Disposition: attachment; filename="arvizturo \"x\"
        |- tukorfurogep.zip"; modification-date="01 Feb 03 04:05 GMT"
        |+Content-Disposition: attachment;
        |+ filename="=?utf-8?b?w6FydsOtenTFsXLFkSAieCIgdMO8a8O2cmbDunLDs2fDqXAuemlw?=";
        |+ modification-date="01 Feb 03 04:05 GMT"
        | Content-Id: <mycontentid>
        | Content-Transfer-Encoding: base64
        | Content-Type: application/zip; boundary=enmime-abcdefg0123456789;
        |- charset=binary; name="arvizturo \"x\" tukorfurogep.zip";
        |+ charset=binary;
        |+ name="=?utf-8?b?w6FydsOtenTFsXLFkSAieCIgdMO8a8O2cmbDunLDs2fDqXAuemlw?=";
        |  param1=myparameter1; param2=myparameter2
        | 
        | WklQWklQWklQ
        | 
        
--- FAIL: TestEncodePartQuotedPrintableHeaders (0.00s)
    encode_test.go:104: Test output did not match testdata/encode/part-quoted-printable-headers.golden
        To update golden file, run: go test -update
    encode_test.go:104: diff -want +got:
        |-Content-Disposition: attachment; filename="arvizturo \"x\"
        |- tukorfurogep.zip"; modification-date="01 Feb 03 04:05 GMT"
        |+Content-Disposition: attachment;
        |+ filename="=?utf-8?b?w6FydsOtenTFsXLFkSAieCIgdMO8a8O2cmbDunLDs2fDqXAuemlw?=";
        |+ modification-date="01 Feb 03 04:05 GMT"
        | Content-Id: <mycontentid>
        | Content-Transfer-Encoding: base64
        | Content-Type: application/zip; boundary=enmime-abcdefg0123456789;
        |- charset=binary; name="arvizturo \"x\" tukorfurogep.zip";
        |+ charset=binary;
        |+ name="=?utf-8?b?w6FydsOtenTFsXLFkSAieCIgdMO8a8O2cmbDunLDs2fDqXAuemlw?=";
        |  param1=myparameter1; param2=myparameter2
        | X-Qp-Header: =?utf-8?q?Just_enough_to_need_qp_=E2=98=86?=
        | 
        | WklQWklQWklQ
        |

Alexfilus · 2021-06-28T14:45:31Z

I see, those tests checks file names got from ToAscii function witch replace some symbols to similar, but it is not keep original file names.
mime.BEncoding.Encode allows to keep original file names, it is especially important in case of cyrillic file names, witch was looked like _________.pdf

Maybe I should replace golden files?

jhillyerd · 2021-06-28T15:06:31Z

Yes, we should update the golden files if the new output looks correct.

Something that I didn't realize when initially reviewing this PR, is that BEncode will always to base64. That makes it difficult to read an ASCII filename when looking at the raw text. I don't think we should do this.

Instead, please use selectTransferEncoding in encode.go to pick the encoder. That will allow us to pass through pure ASCII unchanged, use q-encoding for filenames with a few special characters, or b-encoding for ones with a majority.

jhillyerd · 2021-07-17T02:52:49Z

Do you plan to keep working on this? Otherwise I'll take a crack at it.

Alexfilus · 2021-07-18T07:51:52Z

I'm sorry for delay. I made changes in golden files and add switch case for file name encodings.

jhillyerd

Looks good, thank you!

jhillyerd self-requested a review June 26, 2021 19:57

Alexfilus force-pushed the master branch 2 times, most recently from 81e266c to d9fb44a Compare June 27, 2021 16:45

replace ToASCII with mime base64

e6940b4

it fixes non ascii attach names such as cyrillic

Alexfilus force-pushed the master branch from d9fb44a to e6940b4 Compare June 27, 2021 16:49

Alexfilus added 2 commits July 18, 2021 10:43

fix golden files

f3df4d9

select encoding for file names

e51ee69

jhillyerd approved these changes Jul 20, 2021

View reviewed changes

jhillyerd merged commit 0e47cdb into jhillyerd:master Jul 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replace ToASCII with mime base64 #197

replace ToASCII with mime base64 #197

Alexfilus commented Jun 26, 2021

jhillyerd commented Jun 26, 2021

Alexfilus commented Jun 27, 2021

jhillyerd commented Jun 27, 2021

Alexfilus commented Jun 28, 2021

jhillyerd commented Jun 28, 2021

jhillyerd commented Jul 17, 2021

Alexfilus commented Jul 18, 2021

jhillyerd left a comment

replace ToASCII with mime base64 #197

replace ToASCII with mime base64 #197

Conversation

Alexfilus commented Jun 26, 2021

jhillyerd commented Jun 26, 2021

Alexfilus commented Jun 27, 2021

jhillyerd commented Jun 27, 2021

Alexfilus commented Jun 28, 2021

jhillyerd commented Jun 28, 2021

jhillyerd commented Jul 17, 2021

Alexfilus commented Jul 18, 2021

jhillyerd left a comment

Choose a reason for hiding this comment