-
-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base dask on fsspec #5064
base dask on fsspec #5064
Conversation
Side thought: You should write about this. The message might be something like "We (I) have invested a lot of time building out the bytes handling in Dask in a nicely modular way that can be helpful to others. We found that, surprisingly, it was helpful to others. This includes Intake, X, Y, and Z (zarr, rapids, ...?). We've now moved the last bits from Dask so that you too can use these systems without depending on Dask explicitly (though of course it's a very lightweight dependency). |
@mrocklin : yep, eventually. I posted the code here more for backup and to show the dev group that this thing really would work. |
Merger here requires mergers on other repos and possibly releases; but all tests pass locally now. |
OK, so errors are due to two main things:
|
Woot! |
So it would be a good idea to make a decision now on read_* functions with compression. Should blocksize!=None be a straight exception? None of the compression engines support real random access normally (even if some can). Currently, random access is allowed and works on py37, but it is scanning through the file to find the right offset. That could be useful in some scenarios, but terrible in most. |
Just a single failure on https://travis-ci.org/dask/dask/jobs/557843869#L1002 around compression with chunksize; but quite a few windows path errors I'll need to do some work on. |
When you say it like that, erring seems reasonable to me. I haven't thought about this very deeply though. cc'ing @jcrist who might have thoughts? |
For Lines 380 to 387 in 3eb2dc8
Seems reasonable to me to convert this to an error everywhere. |
OK, actually passes Travis for py37 (linting not yet done). |
green on windows, only flake errors on py37 :) |
Leaving imports in dask.bytes.core, so as not to break code that uses thses outside of dask. Will later mark for deprecation
py37: passing (with s3fs from master) |
Summary:
Outstanding:
|
Most likely #5104
Could they master versions be moved to the nightly upstream build? I've almost got that passing, just one remaining pandas failure that I'll try to fix today. |
Sure, I'm ok with that - then for this PR, should go back to released versions only in all builds. |
PR#4995 now has these changes merged in, and all tests are passing. |
Will merge this evening, unless there are objections. |
Woo! Thanks for all your work on this @martindurant :) |
black dask
/flake8 dask
lots of red in the +/-!