REG: DataFrame.shift with axis=1 and CategoricalIndex columns #38504

jbrockmendel · 2020-12-15T16:33:25Z

closes REGR: DataFrame.shift(axis=1) raises TypeError when columns is CategoricalIndex #38434
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry

topper-123 · 2020-12-15T18:57:23Z

Thanks for taking this.

This PR should IMO be part of 1.2, since it’s a regression since 1.1.

topper-123 · 2020-12-15T19:01:18Z

pandas/core/frame.py

            if periods > 0:
                result = self.iloc[:, :-periods]
                for col in range(min(ncols, abs(periods))):
                    # TODO(EA2D): doing this in a loop unnecessary with 2D EAs
                    # Define filler inside loop so we get a copy
                    filler = self.iloc[:, 0].shift(len(self))
-                    result.insert(0, col, filler, allow_duplicates=True)
+                    result.insert(0, label, filler, allow_duplicates=True)


Does this work correctly for periods > 1? Looks like it inserts the same label in all locations, where Id think it should insert self.columns[col]?

This should be benign because we set result.columns directly below. ill add a test to be on the safe side

topper-123 · 2020-12-15T19:01:37Z

pandas/core/frame.py

            else:
                result = self.iloc[:, -periods:]
                for col in range(min(ncols, abs(periods))):
                    # Define filler inside loop so we get a copy
                    filler = self.iloc[:, -1].shift(len(self))
                    result.insert(
-                        len(result.columns), col, filler, allow_duplicates=True
+                        len(result.columns), label, filler, allow_duplicates=True


Same comment as above.

topper-123 · 2020-12-15T19:06:23Z

pandas/tests/frame/methods/test_shift.py

+        # GH#38434
+        ci = CategoricalIndex(["a", "b"])
+        df = DataFrame([[1, 2], [3, 4]], index=ci, columns=ci)
+        result = df.shift(axis=1)


Can you test a 3-column dataframe with periods=2 also?

simonjayhawkins · 2020-12-15T19:58:47Z

This PR should IMO be part of 1.2, since it’s a regression since 1.1.

and whatsnew probably not necessary

…g-insert-2

jbrockmendel · 2020-12-15T23:45:37Z

added test case with periods=2, removed whatsnew

…g-insert-2

jreback · 2020-12-17T13:47:32Z

@meeseeksdev backport 1.2.x

…tegoricalIndex columns

…ndex columns (#38555) Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

…-dev#38504)

REG: DataFrame.shift with axis=1 and CategoricalIndex columns

cead7e6

topper-123 added Bug Algos Non-arithmetic algos: value_counts, factorize, sorting, isin, clip, shift, diff Categorical Categorical Data Type labels Dec 15, 2020

topper-123 added this to the 1.2 milestone Dec 15, 2020

topper-123 requested changes Dec 15, 2020

View reviewed changes

jbrockmendel added 2 commits December 15, 2020 15:42

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

0c7ec6e

…g-insert-2

test with periods=2, remove whatsnew

3c07dcc

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

f725bef

…g-insert-2

jreback added the Regression Functionality that used to work in a prior pandas version label Dec 16, 2020

jbrockmendel added 2 commits December 15, 2020 20:05

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

cadbe64

…g-insert-2

Merge branch 'master' of https://github.com/pandas-dev/pandas into bu…

9bb37bb

…g-insert-2

jbrockmendel mentioned this pull request Dec 17, 2020

BUG: require arraylike in infer_dtype_from_array #38473

Merged

5 tasks

jreback merged commit d08f12c into pandas-dev:master Dec 17, 2020

This comment has been minimized.

Sign in to view

lumberbot-app bot added the Still Needs Manual Backport label Dec 17, 2020

jbrockmendel deleted the bug-insert-2 branch December 17, 2020 15:44

simonjayhawkins pushed a commit to simonjayhawkins/pandas that referenced this pull request Dec 18, 2020

Backport PR pandas-dev#38504: REG: DataFrame.shift with axis=1 and Ca…

f9f6bc7

…tegoricalIndex columns

simonjayhawkins mentioned this pull request Dec 18, 2020

Backport PR #38504: REG: DataFrame.shift with axis=1 and CategoricalIndex columns #38555

Merged

simonjayhawkins removed the Still Needs Manual Backport label Dec 18, 2020

simonjayhawkins added a commit that referenced this pull request Dec 18, 2020

Backport PR #38504: REG: DataFrame.shift with axis=1 and CategoricalI…

b381e4f

…ndex columns (#38555) Co-authored-by: jbrockmendel <jbrockmendel@gmail.com>

luckyvs1 pushed a commit to luckyvs1/pandas that referenced this pull request Jan 20, 2021

REG: DataFrame.shift with axis=1 and CategoricalIndex columns (pandas…

ddf31d4

…-dev#38504)

simonjayhawkins mentioned this pull request May 17, 2022

BUG: DataFrame.shift shows different behavior for axis=1 when freq is specified #47039

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REG: DataFrame.shift with axis=1 and CategoricalIndex columns #38504

REG: DataFrame.shift with axis=1 and CategoricalIndex columns #38504

jbrockmendel commented Dec 15, 2020

topper-123 commented Dec 15, 2020

topper-123 Dec 15, 2020

jbrockmendel Dec 15, 2020

topper-123 Dec 15, 2020

topper-123 Dec 15, 2020

simonjayhawkins commented Dec 15, 2020

jbrockmendel commented Dec 15, 2020

jreback commented Dec 17, 2020

This comment has been minimized.

REG: DataFrame.shift with axis=1 and CategoricalIndex columns #38504

REG: DataFrame.shift with axis=1 and CategoricalIndex columns #38504

Conversation

jbrockmendel commented Dec 15, 2020

topper-123 commented Dec 15, 2020

topper-123 Dec 15, 2020

Choose a reason for hiding this comment

jbrockmendel Dec 15, 2020

Choose a reason for hiding this comment

topper-123 Dec 15, 2020

Choose a reason for hiding this comment

topper-123 Dec 15, 2020

Choose a reason for hiding this comment

simonjayhawkins commented Dec 15, 2020

jbrockmendel commented Dec 15, 2020

jreback commented Dec 17, 2020

This comment has been minimized.