Maximal number of errors recorded in Part limited #240

pavelbazika · 2022-04-08T09:26:08Z

Hi,

I use enmime to parse emails in my service. It happened that instead of valid mime structure, client sent several MBs of base64 data as input. In such case, enmime consumes a lot of memory (several GBs). It's because readHeader function adds many many warnings in order to "Attempt to detect and repair a non-indented continuation of previous line" as comment says.

In the end, readHeader calls ReadMIMEHeader, which doesn't find valid MIME structure and fails with error and all these warnings are freed. However the memory peak remains and can lead to swapping.

To make it more robust, I'm adding a MaxPartErrors global public variable, which does not allow to record more than limit errors in one part. Default value is 1000 in my PR, but it can be also 0 if you desire, which means no limit and thus there will be no change in current behavior.

Best regards
Pavel

iredmail · 2022-04-08T09:43:49Z

Maybe default 1000 is too many. :)

pavelbazika · 2022-04-08T10:12:09Z

Ok, it was meant primarily to avoid extreme memory consumption, what about 100?

jhillyerd · 2022-04-08T16:36:57Z

I think 100 is probably reasonable, in that it's not likely to provide any useful feedback beyond that. For example, my Inbucket project displays these errors to the user to help them improve the quality of the email they send.

In my mind, this should probably be blocked by #90, so they it can be controlled without a global. Let me think on accepting the PR without that though, since I expect 99% of users will never change it.

iredmail · 2022-04-09T01:04:21Z

for backward compatible, default value should be unlimited which is same as current release.

jhillyerd · 2022-04-09T01:45:54Z

That's good point, let's give it a high or infinite default for now. I think we are about due to release 0.9.4. I need to look at #90 more and determine if that should be before or after a 1.0.

pavelbazika · 2022-04-11T06:56:19Z

I'll give it default value 0, which means infinite. One more question, should I rather hide the global limit variable behind a getter/setter pair? Could be handy in the future, as you plan some options struct.

jhillyerd · 2022-04-11T15:46:01Z

No need for get/set pair: my plan for options is that they'd would be passed in with each call to decode; they wouldn't be global.

Edit: filed #241 for the broken pull request checks

jhillyerd · 2022-04-11T15:54:58Z

error.go

@@ -25,6 +25,9 @@ const (
 	ErrorMissingRecipient = "no recipients (to, cc, bcc) set"
 )

+// MaxPartErrors limits number of part parsing errors, 0 means no limit.


Please set default to 0, and extend comment to note that errors after the limit are ignored, it does not cause parsing to fail.

jhillyerd · 2022-04-12T15:45:29Z

Thank you!

jhillyerd · 2023-02-03T21:08:44Z

Heads up that this max errors setting will be available as a Parser option (see #274) after the next release. I will remove the global variable for the release after that.

Maximal number of errors recorded in Part limited

4c8ca4a

jhillyerd reviewed Apr 11, 2022

View reviewed changes

Default part parsing error limit set to unlimited

7c8c756

jhillyerd merged commit de45901 into jhillyerd:master Apr 12, 2022

pavelbazika deleted the limit-errors branch June 2, 2022 08:05

jhillyerd mentioned this pull request Feb 3, 2023

parser: Add MaxStoredPartErrors option #274

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Maximal number of errors recorded in Part limited #240

Maximal number of errors recorded in Part limited #240

pavelbazika commented Apr 8, 2022

iredmail commented Apr 8, 2022

pavelbazika commented Apr 8, 2022

jhillyerd commented Apr 8, 2022

iredmail commented Apr 9, 2022

jhillyerd commented Apr 9, 2022

pavelbazika commented Apr 11, 2022

jhillyerd commented Apr 11, 2022 •

edited

Loading

jhillyerd Apr 11, 2022

pavelbazika Apr 12, 2022

jhillyerd commented Apr 12, 2022

jhillyerd commented Feb 3, 2023

Maximal number of errors recorded in Part limited #240

Maximal number of errors recorded in Part limited #240

Conversation

pavelbazika commented Apr 8, 2022

iredmail commented Apr 8, 2022

pavelbazika commented Apr 8, 2022

jhillyerd commented Apr 8, 2022

iredmail commented Apr 9, 2022

jhillyerd commented Apr 9, 2022

pavelbazika commented Apr 11, 2022

jhillyerd commented Apr 11, 2022 • edited Loading

jhillyerd Apr 11, 2022

Choose a reason for hiding this comment

pavelbazika Apr 12, 2022

Choose a reason for hiding this comment

jhillyerd commented Apr 12, 2022

jhillyerd commented Feb 3, 2023

jhillyerd commented Apr 11, 2022 •

edited

Loading