ARROW-1103: [Python] Support read_pandas (with index metadata) on directory of Parquet files #862

wesm · 2017-07-18T02:04:18Z

Also fixes ARROW-1041, a case where the _metadata file contains the pandas schema metadata but the individual dataset fragments do not.

Change-Id: I7009d36ea5d3181d68d3819938d1e07dd0bdb004

…ndas with metadata working on a multi-file dataset Change-Id: I761df66366930c096f35bc98dc0fba054a9d1910

…e individual Parquet dataset pieces don't Change-Id: Ic4e7f92d66d7865337c4f728313c348e951ab778

wesm · 2017-07-18T02:04:56Z

cc @cpcloud could you have a look at this? at some point we might need to do a little work to make the test cases easier to generate; these were a bit tedious

xhochy

+1, LGTM

wesm added 3 commits July 17, 2017 20:35

Initial refactor to support common metadata, read_pandas on a dataset

b362d60

Change-Id: I7009d36ea5d3181d68d3819938d1e07dd0bdb004

Add experimental replace_schema_metadata functions, get basic read_pa…

5985fc1

…ndas with metadata working on a multi-file dataset Change-Id: I761df66366930c096f35bc98dc0fba054a9d1910

Add test for esoteric case where _metadata has pandas metadata but th…

3f30916

…e individual Parquet dataset pieces don't Change-Id: Ic4e7f92d66d7865337c4f728313c348e951ab778

xhochy approved these changes Jul 18, 2017

View reviewed changes

asfgit closed this in 362e754 Jul 18, 2017

wesm deleted the ARROW-1103 branch July 18, 2017 17:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-1103: [Python] Support read_pandas (with index metadata) on directory of Parquet files #862

ARROW-1103: [Python] Support read_pandas (with index metadata) on directory of Parquet files #862

wesm commented Jul 18, 2017

wesm commented Jul 18, 2017

xhochy left a comment

ARROW-1103: [Python] Support read_pandas (with index metadata) on directory of Parquet files #862

ARROW-1103: [Python] Support read_pandas (with index metadata) on directory of Parquet files #862

Conversation

wesm commented Jul 18, 2017

wesm commented Jul 18, 2017

xhochy left a comment

Choose a reason for hiding this comment