Add compute kwargs to methods that write data to disk#6056
Add compute kwargs to methods that write data to disk#6056jrbourbeau merged 20 commits intodask:masterfrom KrishanBhasin:feature/compute-kwargs
Conversation
|
I can't see where to use the def compute_as_if_collection(cls, dsk, keys, scheduler=None, get=None, **kwargs):
"""Compute a graph as if it were of type cls.
Allows for applying the same optimizations and default scheduler."""
schedule = get_scheduler(scheduler=scheduler, cls=cls, get=get)
dsk2 = optimization_function(cls)(ensure_dict(dsk), keys, **kwargs)
return schedule(dsk2, keys, **kwargs) |
gforsyth
left a comment
There was a problem hiding this comment.
Hey @KrishanBhasin ! I know this is a draft, but I have a few comments here (also will help with the test failures).
|
@gforsyth do you have any thoughts on how I can keep |
|
I see you have got to the linting stage, is this PR ready for review? @gforsyth , did you have an interest? |
gforsyth
left a comment
There was a problem hiding this comment.
Hey @KrishanBhasin -- this is coming along! I haven't had a chance to look at to_hdf yet, I'll try to do that this week.
@martindurant may have thoughts on how to handle that
|
|
|
I think that at this time to_hdf belongs to all of us :)
…On Wed, Apr 8, 2020 at 6:26 AM Martin Durant ***@***.***> wrote:
to_hdf isn't one of mine, I would have to do some digging, not sure when
I would find the time.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#6056 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AACKZTHRFCMIOB6OEP3ZEP3RLR3QTANCNFSM4L2BB5QQ>
.
|
|
@gforsyth I've cleaned up the bits you mentioned! The hdf thing bothers me though, I'll try to have another crawl through its code when I get the time |
|
Hey @KrishanBhasin -- sorry for my delay in getting to this. I think we should push this in and open a separate issue to tackle the hdf stuff. Could you add tests for |
|
I'll get to this over the weekend, sorry for the delay! |
No worries! This is otherwise ready to go in. |
|
I've added a test for the json case, but I have a funny feeling I'm missing the point with it. Do you just want a test that passes in a dask/dask/dataframe/io/tests/test_csv.py Lines 1405 to 1421 in 929a40f ? |
I think there should be tests for each |
The Checking whether |
gforsyth
left a comment
There was a problem hiding this comment.
This looks great! Thanks for putting this in @KrishanBhasin!
@jrbourbeau this is good to go
jrbourbeau
left a comment
There was a problem hiding this comment.
Thanks for this contribution @KrishanBhasin, I'm looking forward to seeing this merged. Overall things here look really good, just left a few small comments
Co-authored-by: James Bourbeau <jrbourbeau@users.noreply.github.com>
|
Hey @KrishanBhasin -- looks like |
|
Test added and |
jrbourbeau
left a comment
There was a problem hiding this comment.
This looks great, thank you for working on this @KrishanBhasin! Thanks @gforsyth for reviewing!
black dask/flake8 daskFixes #6026