New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH: Add storage_options to read_parquet #2107
ENH: Add storage_options to read_parquet #2107
Conversation
I believe the CI failure is unrelated. |
Yes, fixed in #2101 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks a lot for the PR!
I have a slight preference for keeping to pass through string URIs for eg S3 if no storage options are provided, since pyarrow can handle those (the annoying part is to know what pyarrow supports, though, since this is only limited (currently s3 and hdfs). Similar to pandas-dev/pandas#41194. But can also leave this as a follow-up.
I copied your implementation from the pandas PR, but didn't add any tests. Are we wanting a full moto-based S3 testing setup here? Can I get another approval to run CI? :) |
Hopefully my latest commit fixes the test coverage issue (could I get another CI run @jorisvandenbossche or @martinfleis?) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, looks good!
Thanks @TomAugspurger ! |
agreed, thanks all! |
Closes #2071