Avoid Index DeprecationWarning in Series getitem #31361

TomAugspurger · 2020-01-27T20:04:03Z

cc @jorisvandenbossche. The alternative is to use a warnings filter inside SingleBlockManger.get_slice to filter the warning, but I think avoiding the warning in the first place is a bit nicer.

xref pandas-dev#30867

jorisvandenbossche

Thanks, looks good!

pandas/tests/series/indexing/test_indexing.py

pandas/core/internals/managers.py

TomAugspurger · 2020-01-27T20:43:49Z

Oh, I got who calls what backwards here... Will need to re-think things...

We don't want to change all the index subclasses __getitem__s to do the _getitem_deprecate_nd version that maybe silences warnings.

TomAugspurger · 2020-01-27T21:56:34Z

Changed to using a warnings filter. Note that this has a perf impact for slicing a Series. ~25% slower

Master

In [9]: s = pd.Series(range(10000))

In [10]: %timeit s[:5]
43.5 µs ± 2.15 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

In [11]: t = pd.Series([], dtype=float)

In [12]: %timeit t[:]
47.8 µs ± 2.24 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

This PR

In [7]: s = pd.Series(range(10000))

In [8]: %timeit s[:5]
55.6 µs ± 1.69 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

In [9]: t = pd.Series([], dtype=float)

In [10]: %timeit t[:]
64.2 µs ± 2.27 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

Is that slowdown acceptable? Right now, I don't think it is, given that

This is just a DeprecationWarning
We'll be deprecating this in the future anyway.

jbrockmendel · 2020-01-27T22:54:43Z

I think there are 2 ways we get to get_slice with non-slice. One of them is L931, and can be avoided be extracting the single-item list after potentially warning in #31333. The other is on L940 and is explicltly an "mpl hackaround", so we can maybe-unpack there.

TomAugspurger · 2020-01-28T14:18:04Z

Thanks @jbrockmendel, that did it. We're able to apply the filter only when using the mpl compat path.

master

In [2]: s = pd.Series(range(10000))

In [3]: %timeit s[:, None]
36.2 µs ± 293 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)

In [4]: t = pd.Series([], dtype=float)

In [5]: %timeit t[:]
52 µs ± 3.5 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

In [6]: %timeit s[:5]
45 µs ± 2.27 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

PR

In [4]: s = pd.Series(range(10000))

In [5]: %timeit s[:, None]
50.2 µs ± 2.56 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

In [6]: t = pd.Series([], dtype=float)

In [7]: %timeit t[:]
48.4 µs ± 2 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

In [8]: %timeit s[:5]
44.3 µs ± 2.52 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

So the first case is the only one with a slowdown, which I think is acceptable given that we want to deprecate that behavior anyway.

TomAugspurger · 2020-01-28T14:56:11Z

Had to xfail a test asserting that this produced a warning. Will want to restore that with one from Series.__getitem__ I think, but not for 1.0

pandas/tests/series/test_timeseries.py

jbrockmendel · 2020-01-28T15:47:56Z

pandas/core/internals/managers.py

@@ -1505,7 +1505,7 @@ def get_slice(self, slobj, axis=0):
        if axis >= self.ndim:
            raise IndexError("Requested axis not found in manager")

-        return type(self)(self._block._slice(slobj), self.index[slobj], fastpath=True)
+        return type(self)(self._block._slice(slobj), self.index[slobj], fastpath=True,)


i think we dont want the trailing comma here

Worth rerunning CI over?

i can change it in my next "assorted cleanups" PR

jbrockmendel · 2020-01-28T15:48:43Z

So the first case is the only one with a slowdown, which I think is acceptable given that we want to deprecate that behavior anyway.

Sounds good

…s getitem

…31403) Co-authored-by: Tom Augspurger <TomAugspurger@users.noreply.github.com>

Avoid Index DeprecationWarning in Series getitem

4b232a2

xref pandas-dev#30867

TomAugspurger added this to the 1.0.0 milestone Jan 27, 2020

TomAugspurger added Indexing Related to indexing on series/frames, not to indexes themselves Warnings Warnings that appear or should be added to pandas labels Jan 27, 2020

jorisvandenbossche reviewed Jan 27, 2020

View reviewed changes

pandas/tests/series/indexing/test_indexing.py Outdated Show resolved Hide resolved

clarify

3a430b3

jbrockmendel reviewed Jan 27, 2020

View reviewed changes

pandas/core/internals/managers.py Outdated Show resolved Hide resolved

TomAugspurger added 2 commits January 27, 2020 15:40

Merge remote-tracking branch 'upstream/master' into ndim-indexing-series

e85c88f

filter

6a3acb2

TomAugspurger added 2 commits January 28, 2020 08:14

perf

fc14a35

comment

bd17044

comment

90b1e60

jorisvandenbossche reviewed Jan 28, 2020

View reviewed changes

pandas/tests/series/test_timeseries.py Outdated Show resolved Hide resolved

unxfail

4762513

jorisvandenbossche approved these changes Jan 28, 2020

View reviewed changes

jbrockmendel reviewed Jan 28, 2020

View reviewed changes

TomAugspurger merged commit 0575149 into pandas-dev:master Jan 28, 2020

TomAugspurger deleted the ndim-indexing-series branch January 28, 2020 20:55

meeseeksmachine pushed a commit to meeseeksmachine/pandas that referenced this pull request Jan 28, 2020

Backport PR pandas-dev#31361: Avoid Index DeprecationWarning in Serie…

6d212ec

…s getitem

meeseeksmachine mentioned this pull request Jan 28, 2020

Backport PR #31361 on branch 1.0.x (Avoid Index DeprecationWarning in Series getitem) #31403

Merged

TomAugspurger added a commit that referenced this pull request Jan 28, 2020

Backport PR #31361: Avoid Index DeprecationWarning in Series getitem (#…

a08c2f9

…31403) Co-authored-by: Tom Augspurger <TomAugspurger@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid Index DeprecationWarning in Series getitem #31361

Avoid Index DeprecationWarning in Series getitem #31361

TomAugspurger commented Jan 27, 2020

jorisvandenbossche left a comment

TomAugspurger commented Jan 27, 2020

TomAugspurger commented Jan 27, 2020

jbrockmendel commented Jan 27, 2020 •

edited

TomAugspurger commented Jan 28, 2020 •

edited

TomAugspurger commented Jan 28, 2020

jbrockmendel Jan 28, 2020

TomAugspurger Jan 28, 2020

jbrockmendel Jan 28, 2020

jbrockmendel commented Jan 28, 2020

Avoid Index DeprecationWarning in Series getitem #31361

Avoid Index DeprecationWarning in Series getitem #31361

Conversation

TomAugspurger commented Jan 27, 2020

jorisvandenbossche left a comment

Choose a reason for hiding this comment

TomAugspurger commented Jan 27, 2020

TomAugspurger commented Jan 27, 2020

jbrockmendel commented Jan 27, 2020 • edited

TomAugspurger commented Jan 28, 2020 • edited

TomAugspurger commented Jan 28, 2020

jbrockmendel Jan 28, 2020

Choose a reason for hiding this comment

TomAugspurger Jan 28, 2020

Choose a reason for hiding this comment

jbrockmendel Jan 28, 2020

Choose a reason for hiding this comment

jbrockmendel commented Jan 28, 2020

jbrockmendel commented Jan 27, 2020 •

edited

TomAugspurger commented Jan 28, 2020 •

edited