Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Python] expose filters argument in parquet.read_table #21889

Closed
asfimport opened this issue May 29, 2019 · 1 comment
Closed

[Python] expose filters argument in parquet.read_table #21889

asfimport opened this issue May 29, 2019 · 1 comment

Comments

@asfimport
Copy link

asfimport commented May 29, 2019

Currently, the parquet.read_table function can be used both for reading a single file (interface to ParquetFile) as a directory (interface to ParquetDataset).

ParquetDataset has some extra keywords such as filters that would be nice to expose through read_table as well.

Of course one can always use ParquetDataset if you need its power, but for pandas wrapping pyarrow it is easier to be able to pass through keywords just to parquet.read_table instead of calling either read_table or ParquetDataset. Context: pandas-dev/pandas#26551

Reporter: Joris Van den Bossche / @jorisvandenbossche
Assignee: Joris Van den Bossche / @jorisvandenbossche

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-5436. Please see the migration documentation for further details.

@asfimport
Copy link
Author

Uwe Korn / @xhochy:
Issue resolved by pull request 4409
#4409

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants