-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ARROW-10643: [Python] Pandas<->pyarrow roundtrip failing to recreate index for empty dataframe #12311
Conversation
I have split the code, hope it makes sense. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the updates!
Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank!
Benchmark runs are scheduled for baseline = 4144c17 and contender = bd35629. bd35629 is a master commit associated with this PR. Results will be available as each benchmark for each run completes. |
This PR tries to correct the roundtrip of an empty
pandas.DataFrame
withRangeIndex
(so no columns, but a non-zero shape for the rows) by adding a check for empty columns and apandas.RangeIndex
in thefrom_arrays
method called fromfrom_pandas
and then creating an empty table with schema andnum_rows
.