Fewer checks on xdata for curve_fit. #10196

anntzer · 2019-05-20T09:58:20Z

The xdata argument to curve_fit is essentially a helper in the case
where one wants to write the fit function as func(x, *params). Until
scipy 1.3.0, it was possible to just pass a dummy value (e.g. None) as
xdata, e.g.

curve_fit(lambda _, a: a * np.arange(10), None, 2 * np.arange(10))

A comment in the implementation clearly states that the intent was for
xdata to be anything, including a non-array:

    if isinstance(xdata, (list, tuple, np.ndarray)):
        # `xdata` is passed straight to the user-defined `f`, so allow
        # non-array_like `xdata`.

(Effectively, this is an interface closer to the one of least_squares,
with the advantage that the covariance matrix is also returned.)

1.3.0 added two checks that make this impossible: it now requires that
xdata is not empty, and casts it to float64 for improved precision.

The non-emptiness check is spurious: it was introduced at the same time
as a check that ydata not be empty (which is necessary to avoid a
confusing error message, and is reasonable). This PR removes it.

The cast to float64 can be applied only if xdata is indeed an
array-like, which this PR does.

xref #9893, #10076.

tylerjereddy

I'll let the resident optimize experts chime in with more meaningful comments, but I just did a quick pass through the line coverage data in Azure in case that is helpful.

Not sure it is crucial to add more tests for what sounds like a simple-enough adjustment though. Probably a good sign that no old tests have been modified, and just a new test added.

tylerjereddy · 2019-05-25T00:49:57Z

scipy/optimize/minpack.py

    else:
-        ydata = np.asarray(ydata)
+        ydata = np.asarray(ydata, float)


this code path is never hit by a test

But this was already the case before, wasn't it?

Yes, my point is that confidence & safety in code changes drops for lines that aren't covered by tests.

I'm not really sure how you would go about testing this, apart from trying to fit a dataset with a different dtype and seeing if it failed.

I modified another test to exercise that code path.

tylerjereddy · 2019-05-25T00:50:08Z

scipy/optimize/minpack.py

        else:
-            xdata = np.asarray(xdata)
+            xdata = np.asarray(xdata, float)


scipy/optimize/tests/test_minpack.py

andyfaff · 2019-05-27T01:39:28Z

scipy/optimize/minpack.py


    if isinstance(xdata, (list, tuple, np.ndarray)):
        # `xdata` is passed straight to the user-defined `f`, so allow
        # non-array_like `xdata`.
        if check_finite:
-            xdata = np.asarray_chkfinite(xdata)
+            xdata = np.asarray_chkfinite(xdata, float)


My comment doesn't belong here, but still...

The docstring for xdata needs to be updated.

I think it needs to be explicit.

xdata : array_like or None

It can really be any object, so I edited the docstring accordingly.

The *xdata* argument to curve_fit is essentially a helper in the case where one wants to write the fit function as `func(x, *params)`. Until scipy 0.13, it was possible to just pass a dummy value (e.g. None) as *xdata*, e.g. ``` curve_fit(lambda _, a: a * np.arange(10), None, 2 * np.arange(10)) ``` A comment in the implementation clearly states that the intent was for *xdata* to be anything, including a non-array: ``` if isinstance(xdata, (list, tuple, np.ndarray)): # `xdata` is passed straight to the user-defined `f`, so allow # non-array_like `xdata`. ``` (Effectively, this is an interface closer to the one of `least_squares`, with the advantage that the covariance matrix is also returned.) 0.13.0 added two checks that make this impossible: it now requires that *xdata* is not empty, and casts it to float64 for improved precision. The non-emptiness check is spurious: it was introduced at the same time as a check that *ydata* not be empty (which is necessary to avoid a confusing error message, and is reasonable). This PR removes it. The cast to float64 can be applied only if xdata is indeed an array-like, which this PR does.

andyfaff · 2019-06-18T23:49:06Z

I'm happy with these changes. Could you add a line to the release notes outlining what's changed here?

anntzer · 2019-06-19T09:10:28Z

Is there any chance this could go to 1.3.1 (if there are plans for such a release) instead of 1.4, on the basis that this fixes a regression? If yes, there's no wiki page for 1.3.1 release notes.

andyfaff · 2019-06-19T09:42:20Z

I've marked it as having a 1.3.1 milestone, hopefully that's the mechanism for specifying that it should be added to such a release, if it occurs.
@tylerjereddy do you know if there is a plan for a 1.3.1 release?

anntzer · 2019-06-19T09:47:08Z

I'll just post the release notes entry here, you can move it to whereever appropriate :)

scipy.optimize fixes

The xdata parameter to scipy.optimize.curve_fit is no longer checked to be
non-empty and no longer cast to float if not an array-like. This fixes a
regression in scipy 1.13.0, and restores the ability to use any object as xdata.

tylerjereddy · 2019-06-24T02:31:47Z

@andyfaff It is possible to do 1.3.1 I think, although I'd suggest that we haven't quite reached the "critical mass" of backports to justify it yet. I'll put the backport label on as well just in case.

brandonrwin · 2023-08-24T20:40:03Z

This regression came back, and rather than fixing it, the docs were changed. See: #12632

tylerjereddy added the scipy.optimize label May 25, 2019

tylerjereddy reviewed May 25, 2019

View reviewed changes

andyfaff requested changes May 27, 2019

View reviewed changes

anntzer force-pushed the curve_fit-xdata branch 2 times, most recently from 7188e68 to 273029a Compare May 28, 2019 10:37

anntzer force-pushed the curve_fit-xdata branch from 273029a to 9272061 Compare May 28, 2019 11:56

andyfaff approved these changes Jun 18, 2019

View reviewed changes

andyfaff merged commit 3a37396 into scipy:master Jun 18, 2019

anntzer deleted the curve_fit-xdata branch June 19, 2019 09:06

andyfaff modified the milestones: 1.3.1, 1.4.0 Jun 19, 2019

tylerjereddy added the backport-candidate This fix should be ported by a maintainer to previous SciPy versions. label Jun 24, 2019

tylerjereddy mentioned this pull request Jul 21, 2019

MAINT: backports / preparation for SciPy 1.3.1 #10496

Merged

rgommers removed the backport-candidate This fix should be ported by a maintainer to previous SciPy versions. label Aug 7, 2019

tylerjereddy mentioned this pull request Sep 2, 2019

MAINT: SciPy 1.2.3 LTS backports / prep #10758

Merged

ilayn mentioned this pull request Mar 12, 2020

scipy.optimize.curve_fit fails with xdata that can't be converted to float, against docs #11662

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fewer checks on xdata for curve_fit. #10196

Fewer checks on xdata for curve_fit. #10196

anntzer commented May 20, 2019 •

edited

tylerjereddy left a comment

tylerjereddy May 25, 2019

anntzer May 25, 2019

tylerjereddy May 28, 2019

andyfaff May 28, 2019

anntzer May 28, 2019

tylerjereddy May 25, 2019

andyfaff May 27, 2019

anntzer May 27, 2019

andyfaff May 28, 2019

anntzer May 28, 2019

andyfaff commented Jun 18, 2019

anntzer commented Jun 19, 2019

andyfaff commented Jun 19, 2019

anntzer commented Jun 19, 2019

scipy.optimize fixes

tylerjereddy commented Jun 24, 2019

brandonrwin commented Aug 24, 2023

Fewer checks on xdata for curve_fit. #10196

Fewer checks on xdata for curve_fit. #10196

Conversation

anntzer commented May 20, 2019 • edited

tylerjereddy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andyfaff commented Jun 18, 2019

anntzer commented Jun 19, 2019

andyfaff commented Jun 19, 2019

anntzer commented Jun 19, 2019

scipy.optimize fixes

tylerjereddy commented Jun 24, 2019

brandonrwin commented Aug 24, 2023

anntzer commented May 20, 2019 •

edited