Tweak corrcoef #7414

charris · 2016-03-13T21:03:52Z

Clip real and imag parts of result of corrcoef to the interval [-1, 1]. This doesn't really fix the issue with complex Hermitean results, nor is the diagonal guaranteed to be one, but it does take care of issue #7392. I put it here pending a decision of the proper course to take.

The input arrays are documented to have ndim <=2, so check for that and raise a ValueError on failure.

njsmith · 2016-03-13T23:47:54Z

numpy/lib/function_base.py

-        c[i,:] /= (d * d[i])
+    stddev = sqrt(d.real)
+    c /= stddev[:, None]
+    c /= stddev[None, :]


Have we checked that this is faster? (I would certainly hope that it is, but you never know :-).) Also, having read up a bit more I withdraw my worry about numerical stability, in case you want to do the multiply-by-inverse thing.

It's about the same. Using multiplication makes it 2% faster for the test problem in the original fix. I think most of the time is always going to be in cov. However, I prefer this version for its clarity, OTOH, you get two zero division warnings when one of the diagonals is zero (which can happen).

njsmith · 2016-03-13T23:54:22Z

Seems like a nice little improvement to me.

jaimefrio · 2016-03-14T00:45:48Z

numpy/lib/function_base.py

@@ -2517,6 +2523,12 @@ def corrcoef(x, y=None, rowvar=1, bias=np._NoValue, ddof=np._NoValue):

    Notes
    -----
+    Due to floating point rounding the resulting array may not be Hermitean,


Shouldn't that be "Hermit**_i**_an"?

Indeed. Thanks.

The non-nan elements of the result of corrcoef should satisfy the inequality abs(x) <= 1 and the non-nan elements of the diagonal should be exactly one. We can't guarantee those results due to roundoff, but clipping the real and imaginary parts to the interval [-1, 1] improves things to a small degree. Closes numpy#7392.

This doesn't actually test much, as we don't have any inputs where that was not already the case. But at least it is there and perhaps a fuzz test can be added at a later date.

charris · 2016-03-14T19:53:21Z

I'm going to merge this if there are no complaints.

njsmith · 2016-03-14T21:59:26Z

👍

Tweak corrcoef

ENH: Check array dimensionality in cov function.

fa107fe

The input arrays are documented to have ndim <=2, so check for that and raise a ValueError on failure.

charris added 06 - Regression component: numpy.lib labels Mar 13, 2016

charris added this to the 1.11.0 release milestone Mar 13, 2016

charris mentioned this pull request Mar 13, 2016

corrcoef started to escape [-1, 1] range #7392

Closed

njsmith reviewed Mar 13, 2016
View reviewed changes

jaimefrio reviewed Mar 14, 2016
View reviewed changes

charris added 2 commits March 13, 2016 20:06

TST: Check that result of corrcoef are clipped.

2043084

This doesn't actually test much, as we don't have any inputs where that was not already the case. But at least it is there and perhaps a fuzz test can be added at a later date.

charris force-pushed the tweak-corrcoef branch from 96fcbd2 to 2043084 Compare March 14, 2016 02:06

charris added a commit that referenced this pull request Mar 14, 2016

Merge pull request #7414 from charris/tweak-corrcoef

03e772a

Tweak corrcoef

charris merged commit 03e772a into numpy:master Mar 14, 2016

charris deleted the tweak-corrcoef branch March 14, 2016 22:18

charris mentioned this pull request Mar 14, 2016

Backport 7414, Bound result of corrcoef #7417

Merged

charris removed this from the 1.11.0 release milestone Mar 14, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tweak corrcoef #7414

Tweak corrcoef #7414

charris commented Mar 13, 2016

njsmith Mar 13, 2016

charris Mar 13, 2016

njsmith commented Mar 13, 2016

jaimefrio Mar 14, 2016

charris Mar 14, 2016

charris commented Mar 14, 2016

njsmith commented Mar 14, 2016

Tweak corrcoef #7414

Tweak corrcoef #7414

Conversation

charris commented Mar 13, 2016

njsmith Mar 13, 2016

Choose a reason for hiding this comment

charris Mar 13, 2016

Choose a reason for hiding this comment

njsmith commented Mar 13, 2016

jaimefrio Mar 14, 2016

Choose a reason for hiding this comment

charris Mar 14, 2016

Choose a reason for hiding this comment

charris commented Mar 14, 2016

njsmith commented Mar 14, 2016