You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, the parquet.read_table function can be used both for reading a single file (interface to ParquetFile) as a directory (interface to ParquetDataset).
ParquetDataset has some extra keywords such as filters that would be nice to expose through read_table as well.
Of course one can always use ParquetDataset if you need its power, but for pandas wrapping pyarrow it is easier to be able to pass through keywords just to parquet.read_table instead of calling either read_table or ParquetDataset. Context: pandas-dev/pandas#26551
Currently, the
parquet.read_table
function can be used both for reading a single file (interface to ParquetFile) as a directory (interface to ParquetDataset).ParquetDataset has some extra keywords such as
filters
that would be nice to expose throughread_table
as well.Of course one can always use
ParquetDataset
if you need its power, but for pandas wrapping pyarrow it is easier to be able to pass through keywords just toparquet.read_table
instead of calling eitherread_table
orParquetDataset
. Context: pandas-dev/pandas#26551Reporter: Joris Van den Bossche / @jorisvandenbossche
Assignee: Joris Van den Bossche / @jorisvandenbossche
Related issues:
PRs and other links:
Note: This issue was originally created as ARROW-5436. Please see the migration documentation for further details.
The text was updated successfully, but these errors were encountered: