BUG: replace not converting dtypes #3907

jreback · 2013-06-14T18:46:53Z

I believe replace should do a convert_objects(copy=False) after replacement to provide dtype soft-conversion

In [1]: df = DataFrame([['foo','bar','bah'],['bar','foo','bah']])

In [2]: df
Out[2]: 
     0    1    2
0  foo  bar  bah
1  bar  foo  bah

In [3]: m = { 'foo' : 1, 'bar' : 2, 'bah' : 3 }

In [5]: df.replace(m)
Out[5]: 
   0  1  2
0  1  3  2
1  2  1  3

In [6]: df.replace(m).dtypes
Out[6]: 
0    object
1    object
2    object
dtype: object

In [8]: df.replace(m).convert_objects().dtypes
Out[8]: 
0    int64
1    int64
2    int64
dtype: object

The text was updated successfully, but these errors were encountered:

jreback · 2013-06-14T18:47:02Z

@cpcloud for you!

cpcloud · 2013-06-14T19:12:57Z

there's an infer_types param that does this already...should it default to True?

jreback · 2013-06-14T19:16:40Z

yes...didn't realize it was False default (and I would put copy=False)

cpcloud · 2013-06-14T19:17:53Z

i thought copy didn't do anything in internals.py, changed recently?

cpcloud · 2013-06-14T19:19:58Z

yep docs said true but i think forgot to change when i finally submitted the original pr

cpcloud · 2013-06-14T19:21:01Z

ah i c copy is doing something ...

jreback · 2013-06-14T19:25:50Z

actually this is tricky....because if say nothing changes then you don't need to copy, otherwise I guess you do.....maye just leave copy==True (if needs conversion, IOW there are ANY object blocks);
acutally this could be an option to convert, e.g. convert='needed' ?

jreback · 2013-06-14T19:26:15Z

Because I believe you copy a block in replace (if needed), so no point in copying twice...

jreback · 2013-06-14T19:26:51Z

then again, say you don't actually replace anything....should STILL copy I guess

cpcloud · 2013-06-14T19:29:09Z

i copy if inplace=True (the default)...

jreback · 2013-06-14T19:30:52Z

ok...then its easy, I would drop infer_types; and always convert_objects, passing copy=not inplace

cpcloud · 2013-06-14T19:31:35Z

sounds good

cpcloud · 2013-06-14T19:34:40Z

btw is there an iterable_but_not_string(obj) function somewhere?

cpcloud · 2013-06-14T19:35:32Z

easy enough for me to put one in common.py if not...

cpcloud · 2013-06-14T19:35:39Z

would do in a separate PR

jreback · 2013-06-14T19:38:05Z

com.is_list_like

cpcloud · 2013-06-14T19:38:23Z

thanks

cpcloud · 2013-06-14T19:47:58Z

problem is that now convert clobbers strings in object arrays with nans when u originally replace nans with numbers..

cpcloud · 2013-06-14T19:48:51Z

i.e., mixed_frame.fillna(value=10.0) and mixed_frame.replace(nan, 10.0) should be equivalent

cpcloud · 2013-06-14T20:54:04Z

nvm fixed it

cpcloud · 2013-06-14T21:48:34Z

@jreback i see that is_list_like has a check for hasattr(arg, 'len'). Should that be hasattr(arg, '__len__')?

cpcloud · 2013-06-14T21:49:50Z

also _is_sequence looks redundant with is_list_like should I remove that as well?

jreback · 2013-06-14T21:50:11Z

in theory yes, but I'll be it works anyhow (and python may do some translation on that)

jreback · 2013-06-14T21:50:43Z

check where things are used....I think _is_sequence is slightly different

cpcloud · 2013-06-14T21:51:15Z

hm, hasattr([], 'len') != hasattr([], '__len__') so prolly should chnage that one

cpcloud · 2013-06-14T22:13:22Z

hm or just removed entirely

jreback · 2013-06-15T12:34:38Z

thanks for the fix

ghost assigned cpcloud Jun 14, 2013

cpcloud mentioned this issue Jun 14, 2013

BUG/API: remove infer_types from replace and fix compiled regex bug #3909

Merged

jreback closed this as completed in #3909 Jun 15, 2013

wesm unassigned cpcloud Oct 12, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: replace not converting dtypes #3907

BUG: replace not converting dtypes #3907

jreback commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 14, 2013

jreback commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 15, 2013

BUG: replace not converting dtypes #3907

BUG: replace not converting dtypes #3907

Comments

jreback commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 14, 2013

jreback commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 14, 2013

jreback commented Jun 14, 2013

cpcloud commented Jun 14, 2013

cpcloud commented Jun 14, 2013

jreback commented Jun 15, 2013