Wrong dtype for unicode field in np.rec.fromarrays() #4201

taldcroft · 2014-01-14T20:01:51Z

In Python 2 or 3 with Numpy 1.7.1 there seems to be problem with np.rec.fromarrays creating a dtype format that is a factor of 4 too large:

In [5]: a = np.array([u'xyz'])

In [6]: a.dtype
Out[6]: dtype('<U3')

In [7]: a2 = np.rec.fromarrays([a], names=['a'])

In [8]: a2.dtype
Out[8]: dtype([('a', '<U12')])

In [9]: a3 = np.rec.fromarrays([a2['a']], names=['a'])

In [10]: a3.dtype
Out[10]: dtype([('a', '<U48')])

It looks like the problem is here, where here itemsize is 12 for a 3-character unicode string with UCS-4 encoding:

            if issubclass(obj.dtype.type, nt.flexible):
                formats += repr(obj.itemsize)

The text was updated successfully, but these errors were encountered:

charris · 2014-02-24T21:32:23Z

Hah. Numpy unicode is 4 bytes wide, I'll bet that is where the factor of 4 comes from.

charris · 2014-02-24T21:34:54Z

I'm guessing this is an easy fix. If not, it will be a pretty hard fix.

taldcroft · 2014-02-24T21:36:56Z

😄

embray · 2015-06-16T14:40:15Z

Ah, somehow missed this one before, but this was fixed by #5251. This issue can be closed.

eric-wieser · 2017-04-21T23:18:40Z

Thanks @embray for noting that

taldcroft mentioned this issue Jan 14, 2014

Make code Python 3 compatible sot/ska_numpy#1

Merged

charris added component: numpy.core and removed component: numpy.core labels Feb 24, 2014

eric-wieser closed this as completed Apr 21, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong dtype for unicode field in np.rec.fromarrays() #4201

Wrong dtype for unicode field in np.rec.fromarrays() #4201

taldcroft commented Jan 14, 2014

charris commented Feb 24, 2014

charris commented Feb 24, 2014

taldcroft commented Feb 24, 2014

embray commented Jun 16, 2015

eric-wieser commented Apr 21, 2017

Wrong dtype for unicode field in np.rec.fromarrays() #4201

Wrong dtype for unicode field in np.rec.fromarrays() #4201

Comments

taldcroft commented Jan 14, 2014

charris commented Feb 24, 2014

charris commented Feb 24, 2014

taldcroft commented Feb 24, 2014

embray commented Jun 16, 2015

eric-wieser commented Apr 21, 2017