Tried an approch for text case in pixelmatch #7552

Vaivaswat2244 · 2025-02-16T07:56:35Z

Resolves #7496

Changes:

Modified the checkMatch function in visualtest.js to be more tolerant of slight text position shifts while still catching meaningful changes. Key changes:

Added text detection using pixel pattern analysis:

Looks for moderate density and high contrast transitions typical of text
Helps identify regions where small shifts should be allowed

Implemented shift tolerance for text regions:

Allows up to 2-pixel shifts for detected text areas
Still fails on actual content changes
Configurable via shiftThreshold parameter

Screenshots of the change:

PR Checklist

npm run lint passes
Inline reference is included / updated
Unit tests are included / updated

davepagurek · 2025-02-16T15:52:29Z

Thanks @Vaivaswat2244, nice approach! I have a few questions that might help us determine how to get this across the finish line.

Do you have a sense of how often isLikelyText returns true, and if this gets called on anything non-text? I wonder if maybe rather than trying to detect whether we think there is text, we can directly record when text is called. Since we pass in a p5 instance to the test case anyway, maybe we can do something like this to patch text() to record that it got called:
```
let hasText = false
const prevText = myp5.text
myp5.text = function(...args) {
  hasText = true
  prevText.apply(this, args)
}
```
It looks like some tests are failing on CI now. Do you know if these look like (1) false positives that the checker needs to address, (2) cases where we need to regenerate the screenshots because the old ones were out of date but still passed on CI under the old pixel checker, or (3) are actual bugs in p5 that are surfaced now by using a better pixel checker?

Vaivaswat2244 · 2025-02-16T17:35:20Z

yeah sure, the text() passing makes much more sense. I can implement that.
And regarding the tests getting failed. I think that these are the 2nd case as you described, failed the CI tests after regenerating on my machine but were not passed by the old checker (Although the previous one seemed fine). Also I'm not so sure how to move ahead from here. So ig for now I'll change the text case and will see how that works out.

davepagurek · 2025-02-17T14:32:51Z

Are you able to see the test output from CI on your PR by clicking the Details button here?

For a next step, I'd look at the test output, and try regenerating the failed screenshots locally and pushing those. We can see if that fixes it on CI, and also by looking at the new images in the Files Changed tab, whether we think the changed images are expected.

Vaivaswat2244 · 2025-02-17T18:42:24Z

hey, so I just noticed that the files which I added in the earlier commit aren't the one which are failing the tests on CI, but they are other files which were not touched.

Now, I changed the files which were failing tests in CI, and tried the tests locally. The tests are now failing for different files in the typography and webgl cases. I guess there is a problem with my changes in the visualtest file.
So maybe I'll revert these changes and think of something else.

davepagurek · 2025-02-20T16:14:41Z

Just looking at the last failure in the test file:

Expected:

Received:

Diff:

...it looks like a few isolated pixels are off. I wonder if we can do something to augment pixelmatch and check if the difference exists just on a single pixel (so no surrounding pixels are also flagged) and then ignore those?

Vaivaswat2244 · 2025-02-26T14:04:12Z

Sure, I can try that

This reverts commit 8e5025a.

This reverts commit 1569f51.

…of pixelmatch

Vaivaswat2244 · 2025-03-04T21:22:36Z

Hey, @davepagurek, So I tried to customize pixelmatch for our needs. In order to avoid the isolated pixels and also some clusters which I saw ion some failed screenshots,
I tried a clustering approach for this.
The clustering algorithm uses a breadth-first search (BFS) approach to identify and analyze connected pixel differences:

Initial Difference Detection:
- Pixelmatch identifies different pixels between images
- Different pixels are marked in red in the diff canvas
Cluster Identification Process:
- Start from each different pixel
- Use a queue-based breadth-first search to find connected pixels
- Explore neighboring pixels within a defined radius
- Track:
  - Total number of connected different pixels
  - Cluster size
  - Cluster center coordinates
Cluster Filtering Criteria:
- Ignore small, isolated pixel differences
- Focus on meaningful, connected changes
- Parameters control sensitivity:
  - minClusterSize: Minimum pixels to consider a significant difference
  - maxTotalDiffPixels: Maximum total different pixels allowed
Key Benefits:
- Distinguishes between minor rendering variations
- Catches substantial visual changes
- Provides detailed diff analysis
- Adaptable to different testing requirements

Now, the thing is that I have commited two variations of the approach, the earlier one was with less tolerant(failed the same cases as failing in CI),
and the later one is more tolerant(passed all the cases locally ).

So, I needed a help with this, in one of the failed cases,
Expected:

Recieved:

Diff:

Now this is a scenario which is getting failed in both the cases(less and more tolerant).
Can you suggest some changes for this as this is the scheme which I think we can solve our problem as in this, the difference in failed CI tests and local changes are minimal which I'm assuming that means the CI is working as expected.

davepagurek · 2025-03-05T21:00:55Z

Nice, the clustering idea is good and gives us some options for dealing with issues like this! I suspect if a shape is consistently 1px thick, then it could be the result of a shift due to environment differences.

A way to check that might be to try to see if, for every pixel in the cluster, it has at most two neighbours?

Vaivaswat2244 · 2025-03-07T09:43:56Z

I have made some changes to this, so now additional to clustering logic our algo does the following

Iterates through all 8 neighboring pixels (excluding itself) to determine how many are part of the difference.
If more than 80% of the pixels in the cluster have ≤2 neighbors, the difference is classified as a line shift (isLineShift = true).

Using this and keeping the thresholds same as the more tolerable commit, the tests are now passing.
After a game of trial and error in the thresholds, found that 0.5 for pixelmatch is optimum for us, as this is passing the tests in default cases. And in the pr #7495 , I reversed the changes and added pervVertex and the test which you added was the only one failed So I think this was the expected results.

Expected:

Recieved:

Diff:

So is there anything else I should change.

davepagurek

Awesome, this is looking really promising, great work! I think the algorithm is in a spot where it's good to go. The last thing might be to add a comment somewhere explaining the general approach of the diff algorithm, similar to what you've described in comments to me here, and maybe additionally why it's necessary at all (because contributors running tests just on their own system might not realize that there will be differences on CI.)

davepagurek · 2025-03-07T12:54:42Z

test/unit/visual/visualTest.js

  const ratio = expected.width / expected.height;
  const narrow = ratio !== 1;
  if (narrow) {
    scale *= 2;
  }
-
+  


Can we take out the whitespace here?

davepagurek · 2025-03-07T13:00:31Z

test/unit/visual/visualTest.js

+  // Define significance thresholds
+  const MIN_CLUSTER_SIZE = 4;  // Minimum pixels in a significant cluster
+  const MAX_TOTAL_DIFF_PIXELS = 40;  // Maximum total different pixels
+  const MAX_LINE_SHIFT_PIXELS = 200;


Does this get used?

First two are getting used in the check match function itself for final comparison.
Variation of these thresholds also causes variations in the tests results as well, like the Min_Cluster_Size if reduced from this makes our test less tolerable and can cause false CI fails like we saw in the isolated pixel case.
The line shift pixels is not needed. I'll remove that

tried an approch for text case in pixelmatch

1569f51

regenrated test failing screenshots

8e5025a

Vaivaswat2244 added 4 commits March 5, 2025 01:53

Revert "regenrated test failing screenshots"

9bae94f

This reverts commit 8e5025a.

Revert "tried an approch for text case in pixelmatch"

7cdd321

This reverts commit 1569f51.

tried pixelmatch with some customizations

6e793f2

test commit for checking CI breakages in higher tolerance thresholds …

8f50108

…of pixelmatch

Vaivaswat2244 added 3 commits March 7, 2025 03:42

tried to fix environmental shifting

71ebd2d

back to more tolerant for checks

7ec24f5

updated threshold for tests

a9f909b

davepagurek reviewed Mar 7, 2025

View reviewed changes

Added docs and removed unnecessary code

dc4ed3f

davepagurek approved these changes Mar 11, 2025

View reviewed changes

davepagurek merged commit 0906c29 into processing:dev-2.0 Mar 11, 2025
2 checks passed

Vaivaswat2244 deleted the snapshot-test branch March 16, 2025 19:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Tried an approch for text case in pixelmatch #7552

Tried an approch for text case in pixelmatch #7552

Uh oh!

Vaivaswat2244 commented Feb 16, 2025

Uh oh!

davepagurek commented Feb 16, 2025

Uh oh!

Vaivaswat2244 commented Feb 16, 2025 •

edited

Loading

Uh oh!

davepagurek commented Feb 17, 2025

Uh oh!

Vaivaswat2244 commented Feb 17, 2025

Uh oh!

davepagurek commented Feb 20, 2025

Uh oh!

Vaivaswat2244 commented Feb 26, 2025

Uh oh!

Vaivaswat2244 commented Mar 4, 2025

Uh oh!

davepagurek commented Mar 5, 2025

Uh oh!

Vaivaswat2244 commented Mar 7, 2025 •

edited

Loading

Uh oh!

davepagurek left a comment

Uh oh!

davepagurek Mar 7, 2025

Uh oh!

Vaivaswat2244 Mar 7, 2025

Uh oh!

davepagurek Mar 7, 2025

Uh oh!

Vaivaswat2244 Mar 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Tried an approch for text case in pixelmatch #7552

Tried an approch for text case in pixelmatch #7552

Uh oh!

Conversation

Vaivaswat2244 commented Feb 16, 2025

PR Checklist

Uh oh!

davepagurek commented Feb 16, 2025

Uh oh!

Vaivaswat2244 commented Feb 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davepagurek commented Feb 17, 2025

Uh oh!

Vaivaswat2244 commented Feb 17, 2025

Uh oh!

davepagurek commented Feb 20, 2025

Uh oh!

Vaivaswat2244 commented Feb 26, 2025

Uh oh!

Vaivaswat2244 commented Mar 4, 2025

Uh oh!

davepagurek commented Mar 5, 2025

Uh oh!

Vaivaswat2244 commented Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

davepagurek left a comment

Choose a reason for hiding this comment

Uh oh!

davepagurek Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

Vaivaswat2244 Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

davepagurek Mar 7, 2025

Choose a reason for hiding this comment

Uh oh!

Vaivaswat2244 Mar 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Vaivaswat2244 commented Feb 16, 2025 •

edited

Loading

Vaivaswat2244 commented Mar 7, 2025 •

edited

Loading

Vaivaswat2244 Mar 7, 2025 •

edited

Loading