Improve SSIM documentation and warn about data range. #6595

mcourteaux · 2022-10-26T10:19:05Z

Many researchers rely on this function, but gives in case of floating-point data wrong results. I added some explanation on what might go wrong and how to accommodate for this. Personally, I'm in my fourth year of my PhD research and only find this out now. Probably, hundreds of papers have published incorrect SSIMs due to this subtlety.

mkcor

Thank you for this contribution! It should be highly valuable to many users.

skimage/metrics/_structural_similarity.py

mkcor · 2022-10-28T11:07:35Z

skimage/metrics/_structural_similarity.py

+    `dtype_range` in `skimage.util.dtype.py` has defined intervals from -1 to
+    +1. This yields an estimate of 2, instead of 1, which is most often


... for signed integers. Then the question is that of type inference.

Maybe link to https://scikit-image.org/docs/stable/user_guide/data_types.html as well.

stefanv · 2022-10-28T14:24:06Z

Perhaps we should error out unless the data range is explicitly provided?

mkcor · 2022-10-28T16:08:49Z

Perhaps we should error out unless the data range is explicitly provided?

Yes! It would be safer this way. These docs improvements would go into the error message then. Only error when floating-point input images?

stefanv · 2022-10-28T19:30:17Z

Perhaps we should error out unless the data range is explicitly provided?

Only error when floating-point input images?

Implementation would be simpler to always error, but will break a bunch of code unnecessarily. Erroring on float only is therefore slightly preferable, IMO.

mcourteaux · 2022-10-29T16:11:29Z

I think erroring is extremely useful actually. As this change propagates through to people across the world, they'll have to pause for a second and think about this. I would guess many people will figure out their results were wrong. In general, this is going to be annoying for a lot of individuals, but at least now they know, and it feels like the most honest thing to do.

Co-authored-by: Marianne Corvellec <marianne.corvellec@ens-lyon.org>

mcourteaux · 2022-10-29T16:20:11Z

Sort of the only case where the estimate is going to be reliably correct is if you have uint8 data. All other cases can be vague and misleading. You very easily have a numpy array of uint8 type where in some calculation you add a (A+B)*0.5 somewhere and the whole array switches to being float, but the data is still ranged in 0-255.

stefanv · 2022-10-29T16:34:36Z

All makes good sense, I agree with your assessment.

mkcor · 2022-10-31T07:49:11Z

@mcourteaux would you like to implement the erroring behaviour on floating-point data here, by adding new commits?

@stefanv or should we merge this first (since this PR already brings an improvement) and let @mcourteaux submit a follow-up PR?

mcourteaux · 2022-10-31T09:24:59Z

@mcourteaux would you like to implement the erroring behaviour on floating-point data here, by adding new commits?

I could give it a go. Currently on a break. Will do this in a couple of days. Feel free to also just merge it and then in some days, I submit a new PR.

mkcor · 2022-10-31T11:50:22Z

I could give it a go. Currently on a break. Will do this in a couple of days. Feel free to also just merge it and then in some days, I submit a new PR.

Wonderful! I have just approved this PR; it takes an approval by another maintainer before we can merge it. Your second PR will be most welcome. Thanks again, @mcourteaux!

stefanv · 2022-10-31T15:48:53Z

Thanks! I filed a tracking issue.

mcourteaux added 2 commits October 26, 2022 12:18

Fix whitespace requirements.

e97b19d

mkcor reviewed Oct 28, 2022

View reviewed changes

Apply suggestions from code review

fe5f1ae

Co-authored-by: Marianne Corvellec <marianne.corvellec@ens-lyon.org>

mkcor approved these changes Oct 31, 2022

View reviewed changes

stefanv merged commit ff699a5 into scikit-image:main Oct 31, 2022

stefanv mentioned this pull request Oct 31, 2022

SSIM should fail unless data_range specified (especially for float) #6602

Closed

jarrodmillman added this to the 0.20 milestone Nov 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve SSIM documentation and warn about data range. #6595

Improve SSIM documentation and warn about data range. #6595

mcourteaux commented Oct 26, 2022

mkcor left a comment

mkcor Oct 28, 2022

stefanv commented Oct 28, 2022

mkcor commented Oct 28, 2022

stefanv commented Oct 28, 2022

mcourteaux commented Oct 29, 2022

mcourteaux commented Oct 29, 2022

stefanv commented Oct 29, 2022

mkcor commented Oct 31, 2022

mcourteaux commented Oct 31, 2022

mkcor commented Oct 31, 2022

stefanv commented Oct 31, 2022

		`dtype_range` in `skimage.util.dtype.py` has defined intervals from -1 to
		+1. This yields an estimate of 2, instead of 1, which is most often

Improve SSIM documentation and warn about data range. #6595

Improve SSIM documentation and warn about data range. #6595

Conversation

mcourteaux commented Oct 26, 2022

mkcor left a comment

Choose a reason for hiding this comment

mkcor Oct 28, 2022

Choose a reason for hiding this comment

stefanv commented Oct 28, 2022

mkcor commented Oct 28, 2022

stefanv commented Oct 28, 2022

mcourteaux commented Oct 29, 2022

mcourteaux commented Oct 29, 2022

stefanv commented Oct 29, 2022

mkcor commented Oct 31, 2022

mcourteaux commented Oct 31, 2022

mkcor commented Oct 31, 2022

stefanv commented Oct 31, 2022