You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
pyarrow.Table.to_pandas() fails on an Arrow List Vector where the data vector is of type "dictionary encoded string". Here is the table schema as printed by pyarrow:
Note that the data vector itself can be loaded successfully by to_pandas.
It'd be great if this would be addressed in the next version of pyarrow. For now, is there anything I can do on my end to bypass this unimplemented conversion?
Joris Van den Bossche / @jorisvandenbossche: [~razvanch] thanks for the report. Could you provide a small script to reproduce the issue (some code to create a Table with such a type) ?
Joris Van den Bossche / @jorisvandenbossche:
Another thing: I suppose it should be possible to add this conversion, but, the question is to what it would convert. For other List types, we convert it to a numpy arrays of numpy arrays. But for a dictionary type, this would mean that it would loose that information.
Hi,
pyarrow.Table.to_pandas()
fails on an Arrow List Vector where the data vector is of type "dictionary encoded string". Here is the table schema as printed by pyarrow:and the data (also attached in a file to this ticket)
and the exception I got
Note that the data vector itself can be loaded successfully by to_pandas.
It'd be great if this would be addressed in the next version of pyarrow. For now, is there anything I can do on my end to bypass this unimplemented conversion?
Thanks,
Razvan
Reporter: Razvan Chitu
Assignee: Wes McKinney / @wesm
Original Issue Attachments:
PRs and other links:
Note: This issue was originally created as ARROW-6899. Please see the migration documentation for further details.
The text was updated successfully, but these errors were encountered: