Fix huge imshow range #18458

tacaswell · 2020-09-11T21:48:12Z

PR Summary

We need to special case when rescaling the user input for
interpolation with LogNorm. The issue is that due to the inherent
precision limits of floating point math we can end up with errors on
the order if 1e-18 of the overall data range when rescaling. If the
bottom vlim is close to 0 and the top is order 10e20 than this error
can result in the rescaled vmin being negative which then fails when
we (temporarily) change the vmin/vmax on the norm.

We started adjusting the vmin/vmax to work around the same issue with
float precision in #17636 / 3dea5c7 to make sure that values exactly
equal to the limits did not move across the boundary and get
erroneously mapped is over/under.

Long term we may want to add the ability for the norms to suggest to
their clients what the constraints on the vmin/vmax are, but
hard-coding a special case for LogNorm is the pragmatic solution.

There are still some issues with this sort of extreme data range. With the default interpretation (anti-aliased) when zoomed out we will get stippling that disappears when you zoom in (because the interpolation kernels are not perfect and the constant background at -1 picks up noise on a scale big enough to get above 100 (this is because we shrink and re-scale the data to the [0, 1] range before interpolation and then scale back out to the full range at the end). It is extra visible on this example because things are flipping between "bad" (defaults to white) and "good but small".

PR Checklist

Has pytest style unit tests (and pytest passes).
Is Flake 8 compliant (run flake8 on changed files to check).

closes matplotlib#18415 We need to special case when rescaling the user input for interpolation with LogNorm. The issue is that due to the inherent precision limits of floating point math we can end up with errors on the order if 1e-18 of the overall data range when rescaling. If the bottom vlim is close to 0 and the top is order 10e20 than this error can result in the rescaled vmin being negative which then fails when we (temporarily) change the vmin/vmax on the norm. We started adjusting the vmin/vmax to work around the same issue with float precision in matplotlib#17636 / 3dea5c7 to make sure that values exactly equal to the limits did not move across the boundary and get erroneously mapped is over/under. Long term we may want to add the ability for the norms to suggest to their clients what the constraints on the vmin/vmax are, but hard-coding a special case for LogNorm is the pragmatic solution.

jklymak · 2020-09-11T22:19:00Z

This is probably fine for 3.3.2 but this really needs a proper rethink with tolerances for floating point error built in consistently and in a way we can explain to end users.

tacaswell · 2020-09-11T23:41:56Z

but this really needs a proper rethink with tolerances for floating point error built in consistently and in a way we can explain to end users

I'm really not sure there is a "right" way to do this because no matter what you do it is wrong in some situation.

jklymak · 2020-09-11T23:52:37Z

I agree its frustrating! But I feel we've been playing whack-a-mole - I think a bit of research and carefully laying out what we do and can't do would go a long way to fixing things.

FWIW, I think the current bug is pretty bad. I think that the "bug" we were trying to fix should really have been the user's domain. If you have vmin=1 and vmax=1e20, you cannot expect x=1e20 to be guaranteed to not be "over", and the user should know enough to put a bit of slop into the vmax they choose.

dopplershift · 2020-09-11T23:54:42Z

@jklymak I'm not sure this is the right venue to hash this out, but it's not even obvious to me why you'd need any margin on vmax.

jklymak · 2020-09-11T23:58:31Z

I think #16910 (comment) describes the issue relatively well, though I guess we eventually came to the wrong conclusion.

tacaswell · 2020-09-14T14:50:04Z

I stand by the conclusion we landed on, we should not expect users to be aware of our internal mechanizations / representations of the data / transforms and we should not silently re-order values.

All of this difficulty is driven by us using the Agg re-sampler for our up/down sampling of the images and it's [0, 1] + hard clipping constraints (which was in turn driven by changing the default colormap, seeing red in a viridis colored image, and realizing the colormapping -> resampling in RGBA space was not the right thing to do). If we replaced the resampler with something without those constraints we would at least be able to move all of these issues to the other side of the function call or avoid all together.

tacaswell · 2020-09-14T15:10:20Z

The two py38 failures are collisions with nbconvert. Not sure why it is windows only, but more recent jobs run with nbconvert 6.0.2 and these failed with 6.0.1, restarted the jobs to hopefully pick up the new versions.

jklymak · 2020-09-14T16:46:56Z

I'm just suggesting that for 3.4 someone sit down and write it out carefully and prove that its mathematically correct, and that we clearly delineate any floating point limitations we can't get around.

anntzer · 2020-09-14T16:58:00Z

Wouldn't it be simpler at some point to just rewrite the resampling code ourselves? It can't be that hard(*)...

(* conditions apply)

jklymak

This fixes the current problem...

jklymak · 2020-09-14T17:35:24Z

Wouldn't it be simpler at some point to just rewrite the resampling code ourselves? It can't be that hard(*)...

2-D convolution is pretty slow so I'd think it would have to be done in C? How quick is skimage and would we want that as a dependency? OTOH I don't see that it does filtering as it resamples, so its different than Agg. Same with scipy.ndimage.zoom

anntzer · 2020-09-14T18:48:42Z

I don't mind rewriting the resampling code in C myself (some day).

jklymak · 2020-09-14T18:54:50Z

... and all the filter/interp method support etc? It just seems like the kind of thing that people have done 100 times. Its just the Agg algorithm doesn't track over/under, whereas that is a "feature" we have.

anntzer · 2020-09-14T18:58:02Z

The filters are likely not that hard to implement themselves, as they are probably just one formula each...

lumberbot-app · 2020-09-14T20:15:12Z

Owee, I'm MrMeeseeks, Look at me.

There seem to be a conflict, please backport manually. Here are approximate instructions:

Checkout backport branch and update it.

$ git checkout v3.3.x
$ git pull

Cherry pick the first parent branch of the this PR on top of the older branch:

$ git cherry-pick -m1 0b21c7c2adbd547e4357d3ada9b0b3ab72d441af

You will likely have some merge/cherry-pick conflict here, fix them and commit:

$ git commit -am 'Backport PR #18458: Fix huge imshow range'

Push to a named branch :

git push YOURFORK v3.3.x:auto-backport-of-pr-18458-on-v3.3.x

Create a PR against branch v3.3.x, I would have named this PR:

"Backport PR #18458 on branch v3.3.x"

And apply the correct labels and milestones.

Congratulation you did some good work ! Hopefully your backport PR will be tested by the continuous integration and merged soon!

If these instruction are inaccurate, feel free to suggest an improvement.

Merge pull request matplotlib#18458 from tacaswell/fix_huge_imshow_range Fix huge imshow range Conflicts: lib/matplotlib/colors.py - had spurious commit in the initial PR which re-factored code not on the 3.3.x branch lib/matplotlib/tests/test_image.py - conflicts from many tests added to bottom of test_image.py on default branch

…-v3.3.x Backport PR #18458: Fix huge imshow range

tacaswell added 2 commits September 11, 2020 17:14

MNT: do not cache a version of inverse transform that can go stale

d13b20a

tacaswell added this to the v3.3.2 milestone Sep 11, 2020

jklymak approved these changes Sep 14, 2020

View reviewed changes

timhoffm approved these changes Sep 14, 2020

View reviewed changes

dopplershift approved these changes Sep 14, 2020

View reviewed changes

dopplershift merged commit 0b21c7c into matplotlib:master Sep 14, 2020

lumberbot-app bot added the status: needs manual backport label Sep 14, 2020

tacaswell deleted the fix_huge_imshow_range branch September 14, 2020 21:06

tacaswell removed the status: needs manual backport label Sep 14, 2020

tacaswell mentioned this pull request Sep 14, 2020

Backport PR #18458: Fix huge imshow range #18484

Merged

jklymak added a commit that referenced this pull request Sep 15, 2020

Merge pull request #18484 from tacaswell/auto-backport-of-pr-18458-on…

43bc27d

…-v3.3.x Backport PR #18458: Fix huge imshow range

jklymak mentioned this pull request Jun 22, 2021

test_huge_range_log is failing... #20487

Closed

QuLogic mentioned this pull request Jun 23, 2021

FIX: Include 0 when checking lognorm vmin #20488

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix huge imshow range #18458

Fix huge imshow range #18458

tacaswell commented Sep 11, 2020

jklymak commented Sep 11, 2020

tacaswell commented Sep 11, 2020

jklymak commented Sep 11, 2020

dopplershift commented Sep 11, 2020

jklymak commented Sep 11, 2020

tacaswell commented Sep 14, 2020

tacaswell commented Sep 14, 2020

jklymak commented Sep 14, 2020

anntzer commented Sep 14, 2020 •

edited

jklymak left a comment

jklymak commented Sep 14, 2020

anntzer commented Sep 14, 2020

jklymak commented Sep 14, 2020

anntzer commented Sep 14, 2020

lumberbot-app bot commented Sep 14, 2020

Fix huge imshow range #18458

Fix huge imshow range #18458

Conversation

tacaswell commented Sep 11, 2020

PR Summary

PR Checklist

jklymak commented Sep 11, 2020

tacaswell commented Sep 11, 2020

jklymak commented Sep 11, 2020

dopplershift commented Sep 11, 2020

jklymak commented Sep 11, 2020

tacaswell commented Sep 14, 2020

tacaswell commented Sep 14, 2020

jklymak commented Sep 14, 2020

anntzer commented Sep 14, 2020 • edited

jklymak left a comment

Choose a reason for hiding this comment

jklymak commented Sep 14, 2020

anntzer commented Sep 14, 2020

jklymak commented Sep 14, 2020

anntzer commented Sep 14, 2020

lumberbot-app bot commented Sep 14, 2020

anntzer commented Sep 14, 2020 •

edited