BUG: loosen kwargs requirements in ediff1d #12713

mattip · 2019-01-10T10:25:53Z

Fixes #12711 and compatibility with 1.15. In fixing #11490, checks were added to ediff1d's to_begin and to_end kwargs in PR #11805. The checks are now too stringent, and cause a regression from previous versions. This PR relies on np.asanyarray(values, dtype=dtype_req) to raise if the values cannot be cast. Unfortunately, asanyarray uses unsafe casting, so an additional check that the values did not change is needed.

Also note the exception type in tests was adjusted to ValueError, since casting np.array([1, 2, 3], dtype='int64') to 'int32' is fine, but np.array([1, 1<<35, 3], dtype='int64') is not.

tylerjereddy

Looks useful / sensible at first glance, but you've probably noticed that the Azure failures are related to the new unit test added here.

llimeht · 2019-01-10T12:56:29Z

This solves the failing tests I had with pbcore, so +1 from that perspective. Thanks!

seberg · 2019-01-10T14:05:55Z

@llimeht do you guys rely on the np.array input also being cast lazily unsafely? Or only scalars, or scalars and lists? All three of those are a bit different... If we want to only be more forgiving with scalars it may actually be pretty reasonable to use:

# Hack relying on our "scalars are worthless" casting behaviour :)
to_begin = np.asarray(to_begin)
np.result_type(to_begin, arr.dtype) == arr.dtype

A good quick middle way may be to use same_kind casting here. That keeps the fix for the surprises with np.nan while not being overly strict. It would be a bit stricter than the current approach (since this one actually allows float arrays if values match), but that seems fine overall.

seberg · 2019-01-10T14:40:07Z

Well, I guess this isn't a bad middle way, we can make it more strict/nicer at some point later. Will keep a bit in case someone has a better idea, but we should probably just go with it and backport.

mattip · 2019-01-10T15:16:39Z

The Azure tests are from 32 bit platforms where instead of a ValueError the call to np.asanyarray(values, dtype='int32') raises an OverflowError since 1<<35 does not fit into an int on that platform. I will change the tests to fit into 32 bit integers. I think it is out of scope to change the asanyarray exception or try to recast one exception as another inside ediff1d.

numpy/lib/arraysetops.py

tylerjereddy · 2019-01-10T19:16:12Z

This is a pure Python bug fix with 100 % patch diff coverage assessed by codecov and an approving backport suggestion from a non-BIDS core dev so in it goes with tentative 1.16.1 milestone tag and tag to remind us about the release note eventually needed.

tylerjereddy · 2019-01-10T19:18:18Z

Thanks @mattip

llimeht · 2019-01-10T22:49:07Z

pbcore is also happy with this version. Thanks!

stuartarchibald · 2019-03-06T11:52:34Z

This patch will cause an exception to be raised in the case of a NaN in the to_begin/to_end arrays. e.g.

dt=np.float64
np.ediff1d(np.arange(10, dtype=dt), to_begin=np.array([1, np.nan], dtype=dt))

because of lines like this with an equality test of array == array: https://github.com/numpy/numpy/pull/12713/files#diff-9e06c7b7bcc8e8153f5c2df57283b4cbR109

Numba unit tests caught it when updating to support 1.16. numba/numba#3826

Further... the error message is perhaps a bit strange in the context of e.g.:

np.ediff1d(np.arange(10, dtype=np.float32), to_begin=np.array([np.finfo(np.float64).max], dtype=np.float64))

which produces

ValueError: cannot convert 'to_begin' to array with dtype 'dtype('float32')' as required for input ary

which is not actually the problem, because technically the array np.array([np.finfo(np.float64).max], dtype=np.float64)) can be converted to a np.float32 dtype:

In [18]: np.array([np.finfo(np.float64).max], dtype=np.float64).astype(np.float32)                                              
Out[18]: array([inf], dtype=float32)

it's just that it's not "safe" under the given behavioural assumptions?

Happy to open a new ticket/patch for either of these if considered a problem? Else Numba will be patched to match.

Thanks!

mattip added 00 - Bug component: numpy.lib labels Jan 10, 2019

tylerjereddy reviewed Jan 10, 2019

View reviewed changes

llimeht mentioned this pull request Jan 10, 2019

test failure with numpy 1.16: dtype of to_begin must be compatible with input ary PacificBiosciences/pbcore#120

Closed

mattip force-pushed the gh-12711 branch from f15cb6c to 07004af Compare January 10, 2019 15:22

tylerjereddy reviewed Jan 10, 2019

View reviewed changes

numpy/lib/arraysetops.py Outdated Show resolved Hide resolved

BUG: loosen kwargs requirements in ediff1d

c088383

mattip force-pushed the gh-12711 branch from 07004af to c088383 Compare January 10, 2019 18:10

tylerjereddy added this to the 1.16.1 release milestone Jan 10, 2019

tylerjereddy added the 56 - Needs Release Note. Needs an entry in doc/release/upcoming_changes label Jan 10, 2019

tylerjereddy merged commit ce779dc into numpy:master Jan 10, 2019

charris added the 09 - Backport-Candidate PRs tagged should be backported label Jan 19, 2019

charris mentioned this pull request Jan 20, 2019

BUG: loosen kwargs requirements in ediff1d #12808

Merged

charris removed the 09 - Backport-Candidate PRs tagged should be backported label Jan 20, 2019

charris removed this from the 1.16.1 release milestone Jan 20, 2019

stuartarchibald mentioned this pull request Mar 8, 2019

ediff1d with np.nan in to_begin/to_end, behaviour and error messages #13103

Closed

mattip deleted the gh-12711 branch May 20, 2019 17:01

freelancing-solutions mentioned this pull request May 9, 2021

Contiguity of result not consistent with numpy freelancing-solutions/gcp-database-as-a-service-stock-markets#321

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: loosen kwargs requirements in ediff1d #12713

BUG: loosen kwargs requirements in ediff1d #12713

mattip commented Jan 10, 2019

tylerjereddy left a comment

llimeht commented Jan 10, 2019

seberg commented Jan 10, 2019

seberg commented Jan 10, 2019

mattip commented Jan 10, 2019

tylerjereddy commented Jan 10, 2019

tylerjereddy commented Jan 10, 2019

llimeht commented Jan 10, 2019

stuartarchibald commented Mar 6, 2019

BUG: loosen kwargs requirements in ediff1d #12713

BUG: loosen kwargs requirements in ediff1d #12713

Conversation

mattip commented Jan 10, 2019

tylerjereddy left a comment

Choose a reason for hiding this comment

llimeht commented Jan 10, 2019

seberg commented Jan 10, 2019

seberg commented Jan 10, 2019

mattip commented Jan 10, 2019

tylerjereddy commented Jan 10, 2019

tylerjereddy commented Jan 10, 2019

llimeht commented Jan 10, 2019

stuartarchibald commented Mar 6, 2019