improve error message for cost function #863

amanmdesai · 2023-04-10T10:07:46Z

This PR is regarding #791

HDembinski · 2023-04-11T09:55:16Z

Cool, thanks! I think the error message is spot on.

Would you be willing to also write a test for this? Otherwise I would merge as is and write a test at some later point. Ideally we want to maintain 100 % test coverage.

A test would go to tests/test_cost.py and the pytest facility to use is with pytest.raises(ValueError): <create condition which triggers the exception>. Please have a look at examples in the source code, just search for pytest.raises.

HDembinski · 2023-04-11T13:28:14Z

Thinking more about it, there are a couple of things that need to be improved.

I think we should check that the shapes are compatible, not only the length, since the binned cost functions support N-dimensional histograms.
The check should be implemented in class BinnedCostWithModel, so that all derived classes benefit from it. A good place seems to be BinnedCostWithModel._pred.

amanmdesai · 2023-04-11T14:14:53Z

Hi @HDembinski

Thanks for the feedback and suggesstions.
I will work on them.

amanmdesai · 2023-04-14T08:44:56Z

Hi @HDembinski

I have added a test for raisevalue error using pytest.raises. However this test fails since scipy is not installed.
In some cases, the other tests in test_cost use the following statement:

@pytest.mark.skipif(not scipy_available, reason="scipy.stats is needed")

so shall I include the above statement to skip test if scipy is not available?
locally, the test seem to pass.

I have also implemented a check for shape compatibility.

Please let me know if something needs to be changed.

Thanks,
Aman

HDembinski · 2023-04-14T12:51:30Z

so shall I include the above statement to skip test if scipy is not available?

Yes, please. This is happening because scipy is an optional dependency in some methods. Those methods work with and without scipy being installed, but execute different code in both cases. We want to test both code paths, that's why the tests are run with and without scipy. When you test a method for which scipy is required, you need to explicitly mark so that it is skipped in those test runs.

amanmdesai · 2023-04-14T13:23:46Z

so shall I include the above statement to skip test if scipy is not available?

Yes, please. This is happening because scipy is an optional dependency in some methods. Those methods work with and without scipy being installed, but execute different code in both cases. We want to test both code paths, that's why the tests are run with and without scipy. When you test a method for which scipy is required, you need to explicitly mark so that it is skipped in those test runs.

I see. Thanks. I have added that statment.
I am not sure what to write in tests to check compatibility of shapes (which is probably the reason for decrease in coverage).

src/iminuit/cost.py

HDembinski · 2023-04-14T14:38:48Z

src/iminuit/cost.py

+            if len(self._xe_shape) != len(self._model_xe):
+                raise ValueError("Variable shapes do not match")


I think this always matches by construction, no? So this if should never trigger and can be removed.

src/iminuit/cost.py

HDembinski · 2023-04-14T14:40:43Z

src/iminuit/cost.py

+        if len(self._data) != len(d):
+            raise ValueError(
+                f"Expected model to return an array of size {len(self._data)},"
+                f"but it returns an array of size {len(d)}"
+            )


Good, but d and self._data could be multidimensional, so we want to check self._data.shape == d.shape.

The error message needs to be adapted accordingly, "array of size" -> "array with shape" etc

HDembinski · 2023-04-14T14:44:16Z

tests/test_cost.py

+@pytest.mark.skipif(not scipy_available, reason="scipy.stats is needed")
+def test_error_message_cost():
+    from iminuit import cost
+    import numpy as np
+    from scipy.stats import norm


Ah, this test only requires scipy because you import norm from scipy.stats. That's not necessary. We don't actually want to fit this cost function, so we can invent a bogus model that just returns a constant array with the wrong shape. In other words, you don't need norm to test this.

Please also test the multi-dimensional case, where edges is a tuple of two arrays, with edges for the x and y-axis, and n is a 2d array.

amanmdesai · 2023-04-16T06:55:41Z

Hi @HDembinski

I have implemented all suggesstions with the exception of 2D test (I could not implement this test).

HDembinski · 2023-04-17T10:15:59Z

I have implemented all suggesstions with the exception of 2D test (I could not implement this test).

Ok, I implemented the 2D test and found some issue with our implementation. We should already test the length of the array returned by the model before it is further reshaped by _pred, otherwise there still will be confusing error messages.

I made a couple of unrelated changes, to make mypy happier and to increase coverage back to 100 %.

amanmdesai · 2023-04-17T10:17:15Z

I have implemented all suggesstions with the exception of 2D test (I could not implement this test).

Ok, I implemented the 2D test and found some issue with our implementation. We should already test the length of the array returned by the model before it is further reshaped by _pred, otherwise there still will be confusing error messages.

I made a couple of unrelated changes, to make mypy happier and to increase coverage back to 100 %.

Thanks a lot @HDembinski for your help and guidance!

HDembinski · 2023-04-17T10:18:32Z

Thanks @amanmdesai !

improve error message for cost function

d527fa4

move the improved cost function error message to binnedcostmodel class

7cdd8de

amanmdesai added 2 commits April 14, 2023 13:41

add check for variable shape compatibility

af92605

add test for improved cost function error message

dbb726f

update test to skip scipy

fdb93da

amanmdesai force-pushed the imrpoved-error-message branch from ad2d953 to fdb93da Compare April 14, 2023 13:15

Merge branch 'develop' into imrpoved-error-message

732cac6

HDembinski reviewed Apr 14, 2023

View reviewed changes

src/iminuit/cost.py Outdated Show resolved Hide resolved

update

3ca9ac2

HDembinski reviewed Apr 14, 2023

View reviewed changes

src/iminuit/cost.py Outdated Show resolved Hide resolved

HDembinski reviewed Apr 14, 2023

View reviewed changes

amanmdesai added 2 commits April 16, 2023 12:22

update

b9d04c9

update

6ee1f8a

fixes

0fb256f

HDembinski merged commit 905dbbd into scikit-hep:develop Apr 17, 2023
7 checks passed

amanmdesai mentioned this pull request May 2, 2023

Improve error messages when using builtin cost functions #791

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve error message for cost function #863

improve error message for cost function #863

amanmdesai commented Apr 10, 2023

HDembinski commented Apr 11, 2023 •

edited

HDembinski commented Apr 11, 2023

amanmdesai commented Apr 11, 2023

amanmdesai commented Apr 14, 2023

HDembinski commented Apr 14, 2023

amanmdesai commented Apr 14, 2023

HDembinski Apr 14, 2023 •

edited

HDembinski Apr 14, 2023

HDembinski Apr 14, 2023 •

edited

HDembinski Apr 14, 2023

HDembinski Apr 14, 2023

amanmdesai commented Apr 16, 2023

HDembinski commented Apr 17, 2023

amanmdesai commented Apr 17, 2023

HDembinski commented Apr 17, 2023

		if len(self._xe_shape) != len(self._model_xe):
		raise ValueError("Variable shapes do not match")

improve error message for cost function #863

improve error message for cost function #863

Conversation

amanmdesai commented Apr 10, 2023

HDembinski commented Apr 11, 2023 • edited

HDembinski commented Apr 11, 2023

amanmdesai commented Apr 11, 2023

amanmdesai commented Apr 14, 2023

HDembinski commented Apr 14, 2023

amanmdesai commented Apr 14, 2023

HDembinski Apr 14, 2023 • edited

Choose a reason for hiding this comment

HDembinski Apr 14, 2023

Choose a reason for hiding this comment

HDembinski Apr 14, 2023 • edited

Choose a reason for hiding this comment

HDembinski Apr 14, 2023

Choose a reason for hiding this comment

HDembinski Apr 14, 2023

Choose a reason for hiding this comment

amanmdesai commented Apr 16, 2023

HDembinski commented Apr 17, 2023

amanmdesai commented Apr 17, 2023

HDembinski commented Apr 17, 2023

HDembinski commented Apr 11, 2023 •

edited

HDembinski Apr 14, 2023 •

edited

HDembinski Apr 14, 2023 •

edited