Failure of unicode sandwich for multi-dimensional columns #9614

mhvk · 2019-11-17T18:58:56Z

We allow comparisons of unicode with byte-valued columns using a unicode sandwich, but this fails in the corner-case where the column is multidimensional and we take a single element from it:

from astropy.table import Column
c = Column([[b'a', b'b'], [b'c', b'd']], name='c')
print(c == ['a', 'b'])
# [[ True  True]
#  [False False]]
print(c[0] == ['a', 'b'])
# False  # expected [True, True]

EDIT (2024-01-11): now gives [False, False] so still an issue.

The reason is that for multi-D columns, access of any given Column row just returns a plain ndarray, which does not know about the unicode sandwich: https://github.com/astropy/astropy/blob/master/astropy/table/_column_mixins.pyx#L54-L55

A fix might be to convert the result to unicode or to use an ndarray subclass that still contains the unicode sandwich (but is otherwise a plain ndarray).

cc @taldcroft

p.s. Found while trying a new Masked class for creating a new MaskedColumn.

The text was updated successfully, but these errors were encountered:

taldcroft · 2020-04-02T13:54:13Z

I suspect this is a bit of a rabbit hole... with new numpy dtypes is there hope for a single-byte character type anywhere on the horizon before I'm retired?

mhvk · 2020-04-03T00:16:38Z

Yes, hope, but before you're retired... that may depend on how long you plan to continue working....

mhvk added table Bug labels Nov 17, 2019

taldcroft added Priority-Low Effort-medium Package-expert labels Apr 2, 2020

embray removed the Priority-Low label Jun 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Failure of unicode sandwich for multi-dimensional columns #9614

Failure of unicode sandwich for multi-dimensional columns #9614

mhvk commented Nov 17, 2019 •

edited

taldcroft commented Apr 2, 2020

mhvk commented Apr 3, 2020

Failure of unicode sandwich for multi-dimensional columns #9614

Failure of unicode sandwich for multi-dimensional columns #9614

Comments

mhvk commented Nov 17, 2019 • edited

taldcroft commented Apr 2, 2020

mhvk commented Apr 3, 2020

mhvk commented Nov 17, 2019 •

edited