-
Notifications
You must be signed in to change notification settings - Fork 647
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FIX-#4636: allows read_parquet
to detect column partitioning in non-local filesystems
#5192
Conversation
b9e1902
to
1ae5c98
Compare
path_generator = os.walk(path) | ||
else: | ||
storage_options = kwargs.get("storage_options") | ||
fs, fs_path = url_to_fs(path, **storage_options) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
storage_options
variable can be None
, so this construction breaks several tests in CI.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Done.
…ning in non-local filesystems Signed-off-by: Bill Wang <billiam@ponder.io>
Signed-off-by: Bill Wang <billiam@ponder.io>
…ndas version Signed-off-by: Bill Wang <billiam@ponder.io>
2c1b8c6
to
152a7a3
Compare
Signed-off-by: Bill Wang <billiam@ponder.io>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @Billy2551 for the fix!
…-local filesystems (#5192) Signed-off-by: Bill Wang <billiam@ponder.io>
Signed-off-by: Bill Wang billiam@ponder.io
What do these changes do?
Added support for
read_parquet
on column partitioned parquet files in non-local filesystems. Includes tests on non-local S3 parquet files.flake8 modin/ asv_bench/benchmarks scripts/doc_checker.py
black --check modin/ asv_bench/benchmarks scripts/doc_checker.py
git commit -s
docs/development/architecture.rst
is up-to-date