BUG: np.ma.mean and var should return scalar if no mask #8142

ahaldane · 2016-10-11T18:51:06Z

This is a followup to #7350 (first added in 1.12), and makes sure np.ma.mean and np.ma.var return a scalar if appropriate in the nomask branch of the code.

ahaldane · 2016-10-11T19:40:33Z

Hmm this led me to notice some other (old) problems with np.ma.var I might fix here at the same time. For example, currently (without this PR) var returns ndarrays instead of maskedarrays if mask is nomask. Don't merge yet.

mhvk · 2016-10-12T23:36:10Z

numpy/ma/core.py

+            cnt = self.count(axis=axis, **kwargs) - ddof
+            danom = self - self.mean(axis, dtype, keepdims=True)
+            if iscomplexobj(self):
+                danom = umath.absolute(danom) ** 2


Good to check for a complex object, but might as well do (there was a PR to make a ufunc out of this...):

danom = danom.real**2 + danom.imag**2

I'm going to leave this alone for now, as outside the scope of this PR. (this code is already present).

charris · 2016-10-13T17:30:43Z

@ahaldane Still planning more for this PR?

ahaldane · 2016-10-13T18:53:36Z

Yeah, I need a day or two more to decide what to do in var.

I have an old branch lying around with a "proper" overhaul of np.ma.mean and np.ma.var, but I think for this PR I only want to make as minimal changes to var as possible while still fixing #5769.

Fixes numpy#5769

ahaldane · 2016-10-13T23:25:53Z

I finished my changes here, it should be ready to go.

charris · 2016-10-14T16:07:02Z

numpy/ma/core.py

@@ -5057,7 +5057,7 @@ def mean(self, axis=None, dtype=None, out=None, keepdims=np._NoValue):

        if self._mask is nomask:
            result = super(MaskedArray, self).mean(axis=axis,
-                                                   dtype=dtype, **kwargs)
+                                                   dtype=dtype, **kwargs)[()]
        else:


I assume the super mean returns a 0-d array scalar for relevant cases. I note the following odd behavior of ordinary arrays that is new to me.

>>> np.float64(1) 1.0 >>> np.float64(1)[()] array(1.0) >>> np.float64(1)[()][()] 1.0

That is, the [()] construct flips the result back and forth between arrays and scalars.

For anyone coming here in future, the above is no longer (1.11.1) the case. [()] does not promote scalars to 0d-arrays

charris · 2016-10-14T16:10:30Z

I note some odd behavior of [()] indexing that has me a bit worried. For instance with ordinary mean

>>> np.mean([1,2,3])[()]
array(2.0)

ahaldane · 2016-10-14T16:16:59Z

What numpy version? I don't get that on 1.11, and I think there is a lot of numpy code depends on [()] returning a scalar.

ahaldane · 2016-10-14T16:26:46Z

Hmm, but actually maybe you are right. I just remembered #7267.

And actually grepping shows the empty tumple indexing is only used in a few places in python, and I know of only one place in C it is used.

charris · 2016-10-14T16:47:59Z

I'm running on my travel machine that has numpy 1.8. I can't upgrade until I figure out how to get Cython installed. Yeah, my mac environment is a mess, I don't use it often enough...

charris · 2016-10-14T16:50:11Z

I'm not particularly worried as long as the tests cover the relevant possibilities.

charris · 2016-10-14T17:17:28Z

If you can't replicate what I'm seeing on 1.8 let's give it a shot. Thanks Allan.

ahaldane · 2016-10-14T18:24:16Z

Thanks Chuck!

As discussed in my comments for issue numpy#8145, this patch adds the equal_nan argument to assert_array_compare(), and assert_allclose() passes the value it receives for the same argument through to assert_array_compare(). Closes numpy#8142.

ahaldane force-pushed the ma_mean_scalar branch from f40633e to 2f792b1 Compare October 11, 2016 19:28

charris added 00 - Bug component: numpy.ma masked arrays labels Oct 12, 2016

mhvk reviewed Oct 12, 2016

View reviewed changes

BUG: np.ma.mean and var should return scalar if no mask

d8d7c25

Fixes numpy#5769

ahaldane force-pushed the ma_mean_scalar branch from 2f792b1 to d8d7c25 Compare October 13, 2016 22:50

charris reviewed Oct 14, 2016

View reviewed changes

charris merged commit fa31422 into numpy:master Oct 14, 2016

eric-wieser mentioned this pull request Sep 28, 2017

wny does the numpy.ma (masked) array mean method have a "special case" return type #7833

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: np.ma.mean and var should return scalar if no mask #8142

BUG: np.ma.mean and var should return scalar if no mask #8142

ahaldane commented Oct 11, 2016

ahaldane commented Oct 11, 2016

mhvk Oct 12, 2016

ahaldane Oct 13, 2016

charris commented Oct 13, 2016

ahaldane commented Oct 13, 2016

ahaldane commented Oct 13, 2016

charris Oct 14, 2016

eric-wieser Feb 28, 2017 •

edited

Loading

charris commented Oct 14, 2016

ahaldane commented Oct 14, 2016

ahaldane commented Oct 14, 2016

charris commented Oct 14, 2016

charris commented Oct 14, 2016

charris commented Oct 14, 2016

ahaldane commented Oct 14, 2016

BUG: np.ma.mean and var should return scalar if no mask #8142

BUG: np.ma.mean and var should return scalar if no mask #8142

Conversation

ahaldane commented Oct 11, 2016

ahaldane commented Oct 11, 2016

mhvk Oct 12, 2016

Choose a reason for hiding this comment

ahaldane Oct 13, 2016

Choose a reason for hiding this comment

charris commented Oct 13, 2016

ahaldane commented Oct 13, 2016

ahaldane commented Oct 13, 2016

charris Oct 14, 2016

Choose a reason for hiding this comment

eric-wieser Feb 28, 2017 • edited Loading

Choose a reason for hiding this comment

charris commented Oct 14, 2016

ahaldane commented Oct 14, 2016

ahaldane commented Oct 14, 2016

charris commented Oct 14, 2016

charris commented Oct 14, 2016

charris commented Oct 14, 2016

ahaldane commented Oct 14, 2016

eric-wieser Feb 28, 2017 •

edited

Loading