BUG: read_csv: dtype={'id' : np.str}: Datatype not understood #3209

amelio-vazquez-reina · 2013-03-29T00:11:43Z

I have a CSV with several columns. The first of which is a field called id with entries of the type 0001, 0002, etc.

When loading this file, the following works:

pd.read_csv(my_path, dtype={'id' : np.int})

but the following doesn't:

pd.read_csv(my_path, dtype={'id' : np.str})

nor does this either:

pd.read_csv(my_path, dtype={'id' : str})

I get: Datatype not understood

This is with pandas-0.10.1

The text was updated successfully, but these errors were encountered:

jreback · 2013-03-29T20:33:13Z

use np.object_ dtype
np.str is a very specifc dtype that needs size information, so hard to deal with

In [13]: data = """1,0001
2,0002
3,0003"""

In [20]: pd.read_csv(StringIO.StringIO(data),header=None,
                                 names=['int','object'],dtype={1 : np.object_ })
Out[20]: 
   int object
0    1   0001
1    2   0002
2    3   0003

In [21]: pd.read_csv(StringIO.StringIO(data),header=0,
                                 names=['int','object'],dtype={1 : np.object_ }).dtypes
Out[21]: 
int        int64
object    object
dtype: object

jreback · 2013-04-02T19:48:16Z

@ribonoous did this solve your issue?

amelio-vazquez-reina · 2013-04-02T19:49:45Z

Yes @jreback Sorry I didn't acknowledge this earlier. I am all set!

zkk995 · 2016-01-08T04:37:40Z

it works now

D = pd.read_csv(filep, sep=sep, dtype=mm,header=None,names=feature_name,\
            keep_default_na=False,na_values={m:'' for m,v in mm.items() if v==np.object_})

amelio-vazquez-reina mentioned this issue Mar 29, 2013

BUG: read_csv: dtype={'id' : np.str}: Datatype not understood ipython/ipython#3109

Closed

amelio-vazquez-reina closed this as completed Apr 2, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BUG: read_csv: dtype={'id' : np.str}: Datatype not understood #3209

BUG: read_csv: dtype={'id' : np.str}: Datatype not understood #3209

amelio-vazquez-reina commented Mar 29, 2013

jreback commented Mar 29, 2013

jreback commented Apr 2, 2013

amelio-vazquez-reina commented Apr 2, 2013

zkk995 commented Jan 8, 2016

BUG: read_csv: dtype={'id' : np.str}: Datatype not understood #3209

BUG: read_csv: dtype={'id' : np.str}: Datatype not understood #3209

Comments

amelio-vazquez-reina commented Mar 29, 2013

jreback commented Mar 29, 2013

jreback commented Apr 2, 2013

amelio-vazquez-reina commented Apr 2, 2013

zkk995 commented Jan 8, 2016