`Column.pprint` fails for scalars #12584

nstarman · 2021-12-10T14:11:20Z

Description

Raises TypeError because object is scalar and the code expects an array object.

Expected behavior

If it can be made into a Column, pprint works.

Actual behavior

See Description.

Steps to Reproduce

import astropy.units as u
from astropy.table import Column

Column(1).pprint()

TypeError: len() of unsized object

System Details

macOS-10.16-x86_64-i386-64bit
Python 3.9.5 (default, May 18 2021, 12:31:01)
[Clang 10.0.0 ]
Numpy 1.21.4
pyerfa 2.0.0.1
astropy 5.1.dev207+gbfb9252df.d20211130
Scipy 1.7.1
Matplotlib 3.3.4

The text was updated successfully, but these errors were encountered:

nstarman · 2021-12-10T14:14:05Z

@mhvk, I suspect many issues similar to this one exist across Astropy. numpy.void is consistently an edge case: a pseudo-scalar that can contain heterogenous data, including non-scalars.

mhvk · 2021-12-10T19:13:56Z

I think this may not be related to void, but rather to .pprint being unable to print scalars generally:

Column(1).pprint()
TypeError: len() of unsized object

dhomeier · 2021-12-14T02:36:09Z

Yep

table.Column([m_nu]).pprint(show_dtype=True)
     None    
   void192   
-------------
(0., 0., 0.6)

nstarman · 2021-12-14T21:15:57Z

Thanks @mhvk, @dhomeier. Changing the title to reflect the true issue.

datajungler · 2022-01-22T15:59:48Z

m_nu = u.Quantity((0, 0, 0.6), unit="eV")
Column(m_nu).pprint(show_dtype=True)

pprint method for the "Column" class is originally designed for handling monotonous units. Therefore, the issue can be resolved by declaring a new Quantity with single-string unit.

Output:

None
float64
-------
    0.0
    0.0
    0.6

nstarman · 2022-01-22T19:52:30Z

@datajungler, u.Quantity((0, 0, 0.6), unit="eV") and u.Quantity((0, 0, 0.6), unit="(eV, eV, eV)") are fundamentally different objects. The former is a (3,) vector, while the latter is a () scalar. A better comparison is u.Quantity(0.6, unit="eV") , which will also fail with Column.pprint:

m_nu = u.Quantity(0.6, unit="eV")
Column(m_nu).pprint(show_dtype=True)

taldcroft · 2022-02-04T17:26:04Z

I edited the original description to show the real problem, since the use of a structured Quantity is just a misdirection that is confusing since it appears somewhat array-like at first glance.

At some level I feel like Column itself should raise an exception for a scalar input since it is really meant to be part of a table. I can't see where a scalar Column makes any sense, but I suspect that raising an exception in that case will break stuff.

So what should pprint() do in this case? The current behavior of outputting as a column really implies that it has a length.

nstarman · 2022-02-05T19:40:59Z

At some level I feel like Column itself should raise an exception for a scalar input since it is really meant to be part of a table. I can't see where a scalar Column makes any sense, but I suspect that raising an exception in that case will break stuff.

While I agree on the principle, pragmatically I think the most important thing is consistency with np.array behavior. numpy.ndarray allows for array scalars, so I think Column should as well.
Agreed that a behavior change will probably break stuff.

So what should pprint() do in this case? The current behavior of outputting as a column really implies that it has a length.

I think array scalars should be special cased. Unfortunate, but probably necessary.

neutrinoceros · 2023-12-15T17:20:43Z

I've opened #15749 to attempt to fix this. I went with what felt like the most natural approach (also suggested by @nstarman) to special-case scalar columns. The patch is really small at the moment but it's not completely functional as it breaks at least one existing test. Feedback and suggestions are most welcome !

mhvk · 2023-12-15T17:39:22Z

@neutrinoceros - I'm less sure we even want to print scalar column that way, since also the regular repr is different:

In [11]: Column(1)
Out[11]: 1

In [12]: Column([1])
Out[12]: 
<Column dtype='int64' length=1>
1

I do think pprint() should not fail, but it may be OK to just typeset the number with the format function without having the column name, etc. I.e., I'd advocate some form of if self.ndim == 0; return <something-simple>.

tactipus · 2023-12-15T20:17:34Z

I've opened #15749 to attempt to fix this. I went with what felt like the most natural approach (also suggested by @nstarman) to special-case scalar columns. The patch is really small at the moment but it's not completely functional as it breaks at least one existing test. Feedback and suggestions are most welcome !

hi! would you like to share the testing with us? like how you wrote it & what the results were. TYIA!

neutrinoceros · 2023-12-15T20:36:57Z

@mhvk sorry I opened #15754 before I saw your comment, I just noticed that independently and assumed it could be considered a bug. If you believe that behaviour to be desirable feel free to close my issue !

@tactipus My pull request is publicly available. Actually by now I think I figured it out !

taldcroft · 2023-12-18T14:18:57Z

I'm still on the fence about this. I think of Column as an nd.array subclass that is required to have a length. Like you can have a Box subclass of Rectangle where the edge lengths are required to be the same.

taldcroft · 2023-12-18T14:31:36Z

However, I've confirmed that not allowing Column to be initialized with a scalar does indeed create problems in astropy code that are not immediately trivial. This suggests that user code might break as well, so it looks like #15749 is the better choice if it doesn't impact performance.

tactipus · 2023-12-21T21:14:36Z

sup. just saw the notif for this thread. are we still good on this?

neutrinoceros · 2023-12-22T15:14:23Z

@tactipus We're working on a fix in the linked PR #15749

nstarman added table Bug labels Dec 10, 2021

nstarman changed the title ~~Column.pprint can't display dtype for numpy.void~~ Column.pprint fails for scalars Dec 14, 2021

neutrinoceros linked a pull request Dec 15, 2023 that will close this issue

BUG: fix a crash when calling Column.pprint on a scalar column #15749

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`Column.pprint` fails for scalars #12584

`Column.pprint` fails for scalars #12584

nstarman commented Dec 10, 2021 •

edited by taldcroft

nstarman commented Dec 10, 2021

mhvk commented Dec 10, 2021

dhomeier commented Dec 14, 2021

nstarman commented Dec 14, 2021

datajungler commented Jan 22, 2022

nstarman commented Jan 22, 2022 •

edited

taldcroft commented Feb 4, 2022

nstarman commented Feb 5, 2022

neutrinoceros commented Dec 15, 2023

mhvk commented Dec 15, 2023

tactipus commented Dec 15, 2023

neutrinoceros commented Dec 15, 2023

taldcroft commented Dec 18, 2023

taldcroft commented Dec 18, 2023

tactipus commented Dec 21, 2023

neutrinoceros commented Dec 22, 2023

Column.pprint fails for scalars #12584

Column.pprint fails for scalars #12584

Comments

nstarman commented Dec 10, 2021 • edited by taldcroft

Description

Expected behavior

Actual behavior

Steps to Reproduce

System Details

nstarman commented Dec 10, 2021

mhvk commented Dec 10, 2021

dhomeier commented Dec 14, 2021

nstarman commented Dec 14, 2021

datajungler commented Jan 22, 2022

nstarman commented Jan 22, 2022 • edited

taldcroft commented Feb 4, 2022

nstarman commented Feb 5, 2022

neutrinoceros commented Dec 15, 2023

mhvk commented Dec 15, 2023

tactipus commented Dec 15, 2023

neutrinoceros commented Dec 15, 2023

taldcroft commented Dec 18, 2023

taldcroft commented Dec 18, 2023

tactipus commented Dec 21, 2023

neutrinoceros commented Dec 22, 2023

`Column.pprint` fails for scalars #12584

`Column.pprint` fails for scalars #12584

nstarman commented Dec 10, 2021 •

edited by taldcroft

nstarman commented Jan 22, 2022 •

edited