DataFrame.replace only replaces the first occurrence of replacement pattern #6689

fonnesbeck · 2014-03-22T22:11:29Z

This is best explained by a screenshot:

I'm running a pretty recent build of Pandas ('0.13.1-213-gc174c3d') on Python 2.7.5 on OS X 0.9.2.

dsm054 · 2014-03-22T22:51:36Z

Feels like a changing-dtype issue, or maybe a mixup of values and keys (True == 1 and False == 0):

>>> df = pd.DataFrame({"a": [True, False, True]})
>>> df
       a
0   True
1  False
2   True

[3 rows x 1 columns]
>>> df.replace({True: "Y", False: "N"})
   a
0  Y
1  N
2  Y

[3 rows x 1 columns]
>>> df.replace({"a": {True: "Y", False: "N"}})
      a
0     N
1     Y
2  True

[3 rows x 1 columns]
>>> df.astype(object).replace({"a": {True: "Y", False: "N"}})
   a
0  Y
1  N
2  Y

[3 rows x 1 columns]

but I've never fully understood the intended semantics of replace.

dsm054 · 2014-03-22T22:57:36Z

Yeah, it looks like the keys of the inner dict are being interpreted as indices to match, not values, which explains the Y/N, N/Y swap above. For example:

>>> df = pd.DataFrame({"a": [True, False, True]})
>>> df.replace({"a": {0: "zero", 1: "one", 2: "two"}})
      a
0  zero
1   one
2   two

[3 rows x 1 columns]

cpcloud · 2014-03-23T00:09:59Z

Are you guys running master? I fixed a similar bug somewhat recently.

dsm054 · 2014-03-23T00:59:03Z

@cpcloud: I am, at least.

cpcloud · 2014-03-23T02:47:46Z

Okay thanks I'll take a look. Looks like a dtype issue. @fonnesbeck thanks for the report.

cpcloud · 2014-04-06T16:07:48Z

@jreback Is there a way to select and set a block by name? Something like df._data['a'] = df._data['a'].some_method()?

cpcloud · 2014-04-06T16:11:59Z

i want to operate on a block inplace or out of place but see the changes in the whole block manager

cpcloud · 2014-04-06T16:19:48Z

Nevermind.

fonnesbeck changed the title ~~DataFrame replace only replaces the first occurrence of replacement pattern~~ DataFrame.replace only replaces the first occurrence of replacement pattern Mar 22, 2014

dsm054 mentioned this issue Mar 24, 2014

col.replace(dict) takes too much memory #6697

Open

cpcloud self-assigned this Mar 28, 2014

cpcloud added this to the 0.14.0 milestone Mar 28, 2014

cpcloud added Bug labels Mar 28, 2014

cpcloud mentioned this issue Apr 6, 2014

BUG: fix replace bug where different dtypes in a nested dict would only replace the first value #6820

Merged

cpcloud closed this as completed in #6820 Apr 8, 2014

wesm unassigned cpcloud Oct 12, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DataFrame.replace only replaces the first occurrence of replacement pattern #6689

DataFrame.replace only replaces the first occurrence of replacement pattern #6689

fonnesbeck commented Mar 22, 2014

dsm054 commented Mar 22, 2014

dsm054 commented Mar 22, 2014

cpcloud commented Mar 23, 2014

dsm054 commented Mar 23, 2014

cpcloud commented Mar 23, 2014

cpcloud commented Apr 6, 2014

cpcloud commented Apr 6, 2014

cpcloud commented Apr 6, 2014

DataFrame.replace only replaces the first occurrence of replacement pattern #6689

DataFrame.replace only replaces the first occurrence of replacement pattern #6689

Comments

fonnesbeck commented Mar 22, 2014

dsm054 commented Mar 22, 2014

dsm054 commented Mar 22, 2014

cpcloud commented Mar 23, 2014

dsm054 commented Mar 23, 2014

cpcloud commented Mar 23, 2014

cpcloud commented Apr 6, 2014

cpcloud commented Apr 6, 2014

cpcloud commented Apr 6, 2014