-
Notifications
You must be signed in to change notification settings - Fork 4.1k
[Python] table.to_pandas segfault #18993
Copy link
Copy link
Closed
Milestone
Description
Here is a small snippet which produces a segfault:
In [1]: import pyarrow as pa
In [2]: import pyarrow.parquet as pq
In [3]: pa_ar = pa.array([[], []])
In [4]: pq.write_table(
...: table=pa.Table.from_arrays([pa_ar],["test"]),
...: where="test5.parquet",
...: compression="snappy",
...: flavor="spark"
...: )
In [5]: pq.read_table("test5.parquet")
Out[5]:
pyarrow.Table
test: list<item: null>
child 0, item: null
In [6]: pq.read_table("test5.parquet").to_pydict()
Out[6]: OrderedDict([(u'test', [None, None])])
In [7]: pq.read_table("test5.parquet").to_pandas()
Segmentation fault
I thank you in advance for having this fixed.
Best,
Jacques
Reporter: jacques
Note: This issue was originally created as ARROW-2598. Please see the migration documentation for further details.
Reactions are currently unavailable