Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ARROW-6492: [Python] Handle pandas_metadata created by fastparquet with missing field_name #5331

Conversation

jorisvandenbossche
Copy link
Member

https://issues.apache.org/jira/browse/ARROW-6492

In our compat code, we have special cases for dealing with missing field_name keys (as arrow did not write those up to 0.8), but that also dealt with fastparquet compat. However, this code did not cover the case where the index was stored as metadata-only, which now happens in fastparquet (because fastparquet adopted the syntax we use to store index metadata).

@pitrou pitrou changed the title ARROW-6492: [Python] handle pandas_metadata created by fastparquet with missing field_name ARROW-6492: [Python] Handle pandas_metadata created by fastparquet with missing field_name Sep 9, 2019
Copy link
Member

@wesm wesm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1. I wish we could have avoided a lot of this by having a single Parquet library for Python

@wesm
Copy link
Member

wesm commented Sep 9, 2019

Merging. Appveyor only failed on account of the googletest_ep flakiness

I opened https://issues.apache.org/jira/browse/ARROW-6498

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants