-
-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Should we deprecate gather_statistics
in read_parquet
?
#8937
Comments
I’m generally +1 on deprecating My proposal would probably be to change the (Possibly) Related Thoughts… I’d also be interested to know if anyone is actually using the Note that I am already exploring these kinds of defaults in #8944, and I am still very unsure of what the best balance is. |
👍 👍 👍 This proposal makes sense to me. Changing this without a deprecation period would mainly lead to bad behavior for users reading from datasets containing large files written by some other tool (dask never writes large files). Based on our small survey I suspect this isn't the norm, so we might be able to get by without a deprecation period? Otherwise, do you have thoughts on how we might smoothly make this change? |
I was thinking along the lines of the plan mentioned in 8901: Temporarily raise a warning when we there is a _metadata file present (and Either way, I intend to submit a PR to cudf to set |
From @rjzamora in #8899 (comment):
The text was updated successfully, but these errors were encountered: