PBM decoder robustness improvements and BufferedReadStream observability #2551

antonfirsov · 2023-10-14T00:28:02Z

Prerequisites

I have written a descriptive pull-request title
I have verified that there are no overlapping pull-requests open
I have verified that I am following the existing coding patterns and practice as demonstrated in the repository. These follow strict Stylecop rules 👮.
I have provided test coverage for my change (where applicable)

Description

Add additonal checks to handle corrupt files better in the PBM decoder.

Also extend BufferedReadStream to monitor the number of times it has been made to hit EOF by read calls. This improves the testability of decoder behavior. The perf impact of the BufferedReadStream change is within the margin of error.

antonfirsov · 2023-10-14T00:29:18Z

tests/ImageSharp.Tests/Formats/Pbm/PbmMetadataTests.cs

-        Assert.Equal(default, info.Size);
-        Configuration.Default.ImageFormatsManager.TryFindFormatByFileExtension("pbm", out IImageFormat format);
-        Assert.Equal(format!, info.Metadata.DecodedImageFormat);
+        Assert.Throws<InvalidImageContentException>(() => Image.Identify(bytes));


It doesn't really matter but I don't think decoding makes much sense when the EOF is right in the header.

Yeah, I'm happy with that.

JimBobSquarePants · 2023-10-14T04:56:03Z

src/ImageSharp/Formats/Pbm/PbmDecoderCore.cs

-        stream.SkipWhitespaceAndComments();
-        int height = stream.ReadDecimal();
-        stream.SkipWhitespaceAndComments();
+        if (!stream.SkipWhitespaceAndComments() ||


I really like this!

Only thing I would suggest is to change the method names to use Try prefix to match convention.

I've pushed those changes.

Ha, didn't realize you'd explicitly removed them! I can revert at a later time if you feel strongly about it.

I really prefer it to be the other way.

By convention, TryDoSomething(...) == false should indicate that the operation failed as a whole. This is not the case with TryReadDecimal(out value) == false when it happens at the very las digit in the file: the decoding of the digit was succesful (we should threat the result as valid!) just the file reached an EOF.

I like following such conventions to the letter, because ambiguity can lead to a misunderstandings and programming mistakes.

I’m not precious about it (and happy to revert) but there’s something a little clunky about the method there. We’re really returning two things, the decimal and the stream state. I’ve seen something in the runtime I’m sure that handles it better (Maybe inside guid parsing code)

I considered returning a (bool, int) tuple, but that would lead to much more complicated code on the callsites, since we couldn't do !stream.SkipWhitespaceAndComments() || .... Defining the names without the Try prefix + documenting the behavior was the best idea I was able to come up with.

JimBobSquarePants · 2023-10-14T04:58:29Z

src/ImageSharp/Formats/Pbm/PlainDecoder.cs

-                byte value = (byte)stream.ReadDecimal();
-                stream.SkipWhitespaceAndComments();
-                rowSpan[x] = new L8(value);
+                stream.ReadDecimal(out int value);


Shouldn't we test here?

I guess it doesn't matter since we're simply assigning 0.

No, when it's the last digit of the file the value is valid! For simplicity, I'm ignoring the retval here and letting the SkipWhitespaceAndComments call below to detect EOF.

JimBobSquarePants

Very nice. Liking the EOF counter changes!

antonfirsov · 2023-10-14T11:49:05Z

src/ImageSharp/Formats/Pbm/BinaryDecoder.cs

@@ -71,7 +71,11 @@ private static void ProcessGrayscale<TPixel>(Configuration configuration, Buffer

        for (int y = 0; y < height; y++)
        {
-            stream.Read(rowSpan);
+            if (stream.Read(rowSpan) == 0)


This is lame actually. I should have sliced down rowSpan with the result of stream.Read(), for the pixel conversion done later, or used the condition stream.Read(rowSpan) < rowSpan.Length (the latter isstricter, but simpler).

This has little practical relevance (BuffferedReadStream is hitting EOF twice & FromL8Bytes is converting some memory garbage for broken files), so no need to fix it now, but the code looks stupid :)

We can improve it with follow up.

Backport of #2551 & #2552

…eam observability (#2555) * PBM decoder robustness improvements and BufferedReadStream observability Backport of #2551 & #2552 * Remove DoesNotReturn attribute --------- Co-authored-by: James Jackson-South <james_south@hotmail.com>

…ity (#2551)

…ity (#2551) (#2559)

antonfirsov added 4 commits October 9, 2023 03:56

handle premature EOF in the PBM decoder

8241a5a

BufferedReadStreamExtensions: remove the 'Try' prefix

fecaa53

count EOF hits in BufferedReadStream

a1d7284

use EofHitCounter in pbm tests

c74fbec

antonfirsov commented Oct 14, 2023

View reviewed changes

antonfirsov added bug formats:pbm labels Oct 14, 2023

antonfirsov and others added 2 commits October 14, 2023 04:23

Merge branch 'main' into pbm-eof-02

c359d13

Merge branch 'main' into pbm-eof-02

e5ba24c

JimBobSquarePants reviewed Oct 14, 2023

View reviewed changes

Naming convention tweaks

9b42de6

JimBobSquarePants approved these changes Oct 14, 2023

View reviewed changes

JimBobSquarePants merged commit d76fe6f into main Oct 14, 2023
8 checks passed

JimBobSquarePants deleted the pbm-eof-02 branch October 14, 2023 05:59

antonfirsov commented Oct 14, 2023

View reviewed changes

antonfirsov added a commit that referenced this pull request Oct 14, 2023

Follow up on post-merge discussions in #2551

45b6c36

JimBobSquarePants pushed a commit that referenced this pull request Oct 14, 2023

Follow up on post-merge discussions in #2551 (#2552)

d93bc6c

antonfirsov added a commit that referenced this pull request Oct 15, 2023

PBM decoder robustness improvements and BufferedReadStream observability

1f3aa0c

Backport of #2551 & #2552

antonfirsov mentioned this pull request Oct 15, 2023

[release/2.1] PBM decoder robustness improvements and BufferedReadStream observability #2555

Merged

antonfirsov added a commit that referenced this pull request Oct 16, 2023

PBM decoder robustness improvements and BufferedReadStream observabil…

954f5b7

…ity (#2551)

antonfirsov mentioned this pull request Oct 16, 2023

[release/3.0] PBM decoder robustness improvements and BufferedReadStream observability #2559

Merged

JimBobSquarePants pushed a commit that referenced this pull request Oct 16, 2023

PBM decoder robustness improvements and BufferedReadStream observabil…

645fc83

…ity (#2551) (#2559)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PBM decoder robustness improvements and BufferedReadStream observability #2551

PBM decoder robustness improvements and BufferedReadStream observability #2551

antonfirsov commented Oct 14, 2023

antonfirsov Oct 14, 2023

JimBobSquarePants Oct 14, 2023

JimBobSquarePants Oct 14, 2023

JimBobSquarePants Oct 14, 2023

JimBobSquarePants Oct 14, 2023

JimBobSquarePants Oct 14, 2023

antonfirsov Oct 14, 2023

JimBobSquarePants Oct 14, 2023

antonfirsov Oct 14, 2023

JimBobSquarePants Oct 14, 2023

JimBobSquarePants Oct 14, 2023

antonfirsov Oct 14, 2023

JimBobSquarePants left a comment

antonfirsov Oct 14, 2023

JimBobSquarePants Oct 14, 2023

antonfirsov Oct 14, 2023

PBM decoder robustness improvements and BufferedReadStream observability #2551

PBM decoder robustness improvements and BufferedReadStream observability #2551

Conversation

antonfirsov commented Oct 14, 2023

Prerequisites

Description

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JimBobSquarePants left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment