DOC: add missing details to linalg.lstsq docstring #18062

nilokr · 2020-12-23T15:34:00Z

It turns out that linalg.lstsq also minimizes the 2-norm of x when a is rank-deficient. I found that by searching the documentation for the LAPACK library, which is the current implementation of lstsq.

This PR rewrites the docstring for linalg.lstsq such that it contains details for all cases.

charris · 2020-12-27T17:42:19Z

numpy/linalg/linalg.py

+    ``a @ x = b``, where `a` is an `m`-by-`n` matrix and `b` is a `m` element 
+    vector. When the system is overdetermined (``m >= n`` and ``rank(a) = n``), 
+    a solution `x` that minimizes the Euclidean 2-norm ``||b-ax||`` is found.
+    Else, when the system is underdetermined (``m < n`` and ``rank(A) = m``) 


Maybe "If there are multiple minimizing solutions, the one with the smallest norm is returned". Note that the norm of the solution depends on column scaling (units) and choice of variables, so the smallest norm solution is seldom meaningful when the solution is not unique.

Sounds better - I've updated my commit to reflect the suggestion, thanks!

The reason I stumbled on this smallest norm matter was that I was specifically looking for that solution in a rank-deficient problem. I was intrigued I got exactly what I wanted for free, so I did some digging and found why :)

It turns out that lstsq also minimizes the 2-norm of x when a is rank-deficient. I found that by searching the documentation for the LAPACK library, which is the current implementation of lstsq (as of Dec 2020). Ref: https://www.netlib.org/lapack/lug/node27.html

charris · 2020-12-30T23:21:05Z

numpy/linalg/linalg.py

    ``a @ x = b``. The equation may be under-, well-, or over-determined
    (i.e., the number of linearly independent rows of `a` can be less than,
    equal to, or greater than its number of linearly independent columns).
    If `a` is square and of full rank, then `x` (but for round-off error)
    is the "exact" solution of the equation. Else, `x` minimizes the
-    Euclidean 2-norm :math:`|| b - a x ||`.
+    Euclidean 2-norm :math:`||b-ax||`. If there are multiple minimizing 


Keep spaces around - for readability.

I fixed that.

[ci skip]

charris · 2020-12-31T00:35:33Z

Thanks @krnilo .

matthew-brett · 2020-12-31T18:59:03Z

Worth pointing out that this is the behavior of the pseudoinverse?

github-actions bot added the 04 - Documentation label Dec 23, 2020

charris reviewed Dec 27, 2020

View reviewed changes

nilokr force-pushed the add-lstsq-details branch from 15f52e3 to c755c91 Compare December 30, 2020 19:09

charris reviewed Dec 30, 2020

View reviewed changes

STY: Add spaces around '-'.

860c8b8

[ci skip]

charris merged commit 31647f1 into numpy:master Dec 31, 2020

nilokr deleted the add-lstsq-details branch January 4, 2021 20:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: add missing details to linalg.lstsq docstring #18062

DOC: add missing details to linalg.lstsq docstring #18062

nilokr commented Dec 23, 2020

charris Dec 27, 2020

nilokr Dec 30, 2020

charris Dec 30, 2020

charris Dec 30, 2020

charris commented Dec 31, 2020

matthew-brett commented Dec 31, 2020

DOC: add missing details to linalg.lstsq docstring #18062

DOC: add missing details to linalg.lstsq docstring #18062

Conversation

nilokr commented Dec 23, 2020

charris Dec 27, 2020

Choose a reason for hiding this comment

nilokr Dec 30, 2020

Choose a reason for hiding this comment

charris Dec 30, 2020

Choose a reason for hiding this comment

charris Dec 30, 2020

Choose a reason for hiding this comment

charris commented Dec 31, 2020

matthew-brett commented Dec 31, 2020