You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When reading partitioned parquet files (tested with those produced by Spark), that contain lists, the resulting table seems to contain data loaded only from one partition. Primitive types seems to be loaded correctly.
It can be reproduced using following code (arrow 0.6.0, spark 2.1.1):
Jonas Amrich:
Yes, that looks very similar - I must have overlooked that issue before. However it seems that the fix doesn't solve the problem. Using 0.6.1.dev64+g9968d95d only makes thing stranger:
Wes McKinney / @wesm:
OK, one of us ( @xhochy or me) will have to take a look so we can resolve this before 0.7.0 final goes out. If you find the problem feel free to submit a patch
When reading partitioned parquet files (tested with those produced by Spark), that contain lists, the resulting table seems to contain data loaded only from one partition. Primitive types seems to be loaded correctly.
It can be reproduced using following code (arrow 0.6.0, spark 2.1.1):
When the data is loaded using Spark or coalesced into one partition, everything works as expected:
Reporter: Jonas Amrich
Assignee: Wes McKinney / @wesm
Note: This issue was originally created as ARROW-1459. Please see the migration documentation for further details.
The text was updated successfully, but these errors were encountered: