-
Notifications
You must be signed in to change notification settings - Fork 727
Closed
Labels
Description
I am trying to read a big dataset. However, I am trying to filter some rows, I tried to add filters param in pyarrow_additional_kwargs but it did not work. I got same data (unfiltered).
x = wr.s3.read_parquet("s3://xxx/yyyy/",
chunked=True,
boto3_session=session,
dataset=False,
use_threads=True,
pyarrow_additional_kwargs={"filters":[('purchases', '=', 1)]},
)
Also I tried with dataset=True but it didnt work as well.
x = wr.s3.read_parquet("s3://xxx/yyyy/",
chunked=True,
boto3_session=session,
dataset=True,
use_threads=True,
pyarrow_additional_kwargs={"filters":[('purchases', '=', 1)]},
)
ClimbsRocks