BUG: SparseArray numeric ops misc fixes #12910

sinhrks · 2016-04-17T05:10:55Z

no existing issue
tests added / passed
passes git diff upstream/master | flake8 --diff
whatsnew entry

Fixed following 3 issues occurred on the current master.

1. addition ignores rhs `fill_value`

pd.SparseArray([1., 1.]) + pd.SparseArray([1., 0.], fill_value=0.)
# [2.0, nan]
# Fill: nan
# IntIndex
# Indices: array([0], dtype=int32)

Expected:
# [2.0, 1.0]

2. mod raises `AttributeError`

pd.SparseArray([1, 1]) % pd.SparseArray([1, np.nan])
# AttributeError: 'module' object has no attribute 'sparse_nanmod'

3. pow outputs incorrect result wiht `1.0 ** np.nan`

pd.SparseArray([1., 1.]) ** pd.SparseArray([1., np.nan])
# [1.0, nan]
# Fill: nan
# IntIndex
# Indices: array([0], dtype=int32)

Expected:
# [1.0, 1.0]

# NumPy result
np.array([1., 1.]) ** np.array([1, np.nan])
# array([ 1.,  1.])

jreback · 2016-04-17T14:18:51Z

pandas/sparse/array.py

@@ -59,7 +59,12 @@ def wrapper(self, other):


 def _sparse_array_op(left, right, op, name):
-    if np.isnan(left.fill_value):
+    if (np.isnan(left.fill_value) and np.isnan(right.fill_value) and


so this begs the question of why we don't just always pass the fill values to the ufuncs (e.g. sparse_sub etc), which can then decide (based on np.isnan or (isnull better)) on left and/or rhs whether to use it?

Yes removing nanop simplifies the logic. Because data is sparse, it shouldn't affect to performance in most cases.

do you want to do that in this PR?

Yes, let me try.

jreback · 2016-04-17T20:42:26Z

whoosh you blew away lots of code. must have been work-arounds built in there for maybe an old cython or something. Just want to be sure (since tests were removed) everything still working (as may not be testing some of the numeric ops as much, though did see you added some tests)

sinhrks · 2016-04-17T20:55:18Z

The removed test calls _nan funcs which is no longer exists, and remaining logic is tested with _op_tests below. I'm willing to add more tests if you have anything.

jreback · 2016-04-18T17:13:20Z

thanks!

sinhrks added Bug Numeric Operations Arithmetic, Comparison, and Logical operations Sparse Sparse Data Type labels Apr 17, 2016

sinhrks added this to the 0.18.1 milestone Apr 17, 2016

sinhrks changed the title ~~BUG: SparseArray misc fixes~~ BUG: SparseArray numeric ops misc fixes Apr 17, 2016

jreback reviewed Apr 17, 2016
View reviewed changes

BUG: SparseArray misc fixes

d63da47

sinhrks force-pushed the sparse_ops branch from 3e33ea8 to d63da47 Compare April 17, 2016 14:40

jreback closed this in 3cc4198 Apr 18, 2016

sinhrks deleted the sparse_ops branch April 18, 2016 18:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: SparseArray numeric ops misc fixes #12910

BUG: SparseArray numeric ops misc fixes #12910

sinhrks commented Apr 17, 2016

jreback Apr 17, 2016

sinhrks Apr 17, 2016

jreback Apr 17, 2016

sinhrks Apr 17, 2016

jreback commented Apr 17, 2016

sinhrks commented Apr 17, 2016

jreback commented Apr 18, 2016

BUG: SparseArray numeric ops misc fixes #12910

BUG: SparseArray numeric ops misc fixes #12910

Conversation

sinhrks commented Apr 17, 2016

1. addition ignores rhs fill_value

2. mod raises AttributeError

3. pow outputs incorrect result wiht 1.0 ** np.nan

jreback Apr 17, 2016

Choose a reason for hiding this comment

sinhrks Apr 17, 2016

Choose a reason for hiding this comment

jreback Apr 17, 2016

Choose a reason for hiding this comment

sinhrks Apr 17, 2016

Choose a reason for hiding this comment

jreback commented Apr 17, 2016

sinhrks commented Apr 17, 2016

jreback commented Apr 18, 2016

1. addition ignores rhs `fill_value`

2. mod raises `AttributeError`

3. pow outputs incorrect result wiht `1.0 ** np.nan`