Serialization family to preserve headers of the underlying dumps functions. #5380

madsbk · 2021-10-01T12:46:05Z

Currently, the dask and cuda serialization families overwrite the header of the underlying dumps functions. For instance, the cuda_dumps overwrite the "type-serialized", which is a problem when Dask uses pickle5 and the underlying loads function use pickle protocol=4.
Beside, overwriting an underlying protocol's header is bad style.

This PR fixes this issue

Closes CI fails: "ValueError: unsupported pickle protocol: 5" rapidsai/dask-cuda#746
Tests added / passed
Passes pre-commit run --all-files

madsbk · 2021-10-01T14:35:22Z

Notice, I have changed the test test_compression_numpy_list() to better reflect what (I think) we want to support.

jrbourbeau

Thanks @madsbk! It looks like there's another spot we need to insert ["sub-headers"]

____________________________ test_deserialize_grad _____________________________

    def test_deserialize_grad():
        a = np.random.rand(8, 1)
        t = torch.tensor(a, requires_grad=True, dtype=torch.float)
>       t2 = deserialize(*serialize(t))

distributed/protocol/tests/test_torch.py:48: 
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
distributed/protocol/serialize.py:410: in deserialize
    return loads(header, frames)
distributed/protocol/serialize.py:50: in dask_loads
    return loads(header["sub-header"], frames)
distributed/protocol/torch.py:36: in deserialize_torch_Tensor
    x = dask_deserialize.dispatch(np.ndarray)(header, frames)
_ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 

header = {'device': 'cpu', 'requires_grad': True, 'serializer': 'dask', 'sub-header': {'dtype': (0, '<f4'), 'shape': (8, 1), 'strides': (4, 4), 'writeable': [True]}, ...}
frames = [<memory at 0x13abbdb80>]

    @dask_deserialize.register(np.ndarray)
    def deserialize_numpy_ndarray(header, frames):
        with log_errors():
            if header.get("pickle"):
                return pickle.loads(frames[0], buffers=frames[1:])
    
            (frame,) = frames
>           (writeable,) = header["writeable"]
E           KeyError: 'writeable'

distributed/protocol/numpy.py:116: KeyError

@jakirkham would you have time to take a look at this PR?

distributed/protocol/serialize.py

…_header

madsbk · 2021-10-05T11:43:40Z

Thanks @madsbk! It looks like there's another spot we need to insert ["sub-headers"]

Thanks, the torch protocol now also uses a sub-header

jakirkham · 2021-10-05T19:45:05Z

@jakirkham would you have time to take a look at this PR?

@jrbourbeau, were you just wanting feedback on that test failure or was there something else you were looking for?

madsbk · 2021-10-05T20:03:01Z

Hmm, distributed/protocol/tests/test_torch.py::test_resnet is still failing. Will look at it tomorrow.

…_header

madsbk · 2021-10-06T13:06:54Z

As far as I can see, the CI errors isn't related to this PR.

FAILED distributed/tests/test_stress.py::test_stress_creation_and_deletion - ...
FAILED distributed/comm/tests/test_ws.py::test_collections - tornado.util.Tim..
FAILED distributed/deploy/tests/test_adaptive.py::test_adaptive_local_cluster_multi_workers
FAILED distributed/tests/test_asyncprocess.py::test_exit_callback

madsbk · 2021-10-06T20:15:24Z

@jrbourbeau, I think this is ready to be merged.

…_header

jakirkham · 2021-10-08T04:33:16Z

@jrbourbeau was there anything else we needed to do here or is this good to go?

…_header

madsbk · 2021-11-04T15:40:29Z

@jrbourbeau it would be good to get this merged, it is blocking Dask-CUDA: rapidsai/dask-cuda#746

…_header

jakirkham · 2021-11-05T15:26:55Z

Going to merge to unblock Dask-CUDA. The PyTorch issue noted originally has been resolved.

Most of the failures here seem to be related to Distributed's concurrent.future support, which are unrelated. There is one failure with test_spill_to_disk, but it only happens in one job. We might want to keep an eye on that though.

dask & cuda now preserve headers of the underlying dumps functions.

0c1d847

madsbk mentioned this pull request Oct 1, 2021

CI fails: "ValueError: unsupported pickle protocol: 5" rapidsai/dask-cuda#746

Closed

madsbk added 3 commits October 1, 2021 14:53

style: flake8

b8e56cf

dask_dumps(): fix copy/paste error

b5ecd58

Added a test

0c8cd4c

madsbk marked this pull request as ready for review October 1, 2021 14:36

Trigger CI

a1ce06b

jrbourbeau reviewed Oct 4, 2021

View reviewed changes

distributed/protocol/serialize.py Outdated Show resolved Hide resolved

madsbk added 3 commits October 5, 2021 13:27

Merge branch 'main' of github.com:dask/distributed into preserve_dump…

fb69f62

…_header

clean up

f9a71c8

torch: use sub-header

2dddfef

madsbk mentioned this pull request Oct 5, 2021

Use cuDF Frame instead of Table rapidsai/dask-cuda#748

Merged

torch: use deserialize()

63cb761

madsbk added 2 commits October 6, 2021 09:28

ObjectDictSerializer(): match self.serialize() with self.deserialize()

c1ee25c

Merge branch 'main' of github.com:dask/distributed into preserve_dump…

c749b51

…_header

jakirkham approved these changes Oct 6, 2021

View reviewed changes

Merge branch 'main' of github.com:dask/distributed into preserve_dump…

3e17be3

…_header

pentschev mentioned this pull request Oct 7, 2021

Removing the FrameProxyObject workaround rapidsai/dask-cuda#751

Merged

madsbk added 4 commits October 11, 2021 08:25

Revert: use protocol=4 again

6c31ed4

Merge branch 'main' of github.com:dask/distributed into preserve_dump…

7a1cef2

…_header

Merge branch 'main' of github.com:dask/distributed into preserve_dump…

6917ac0

…_header

Merge branch 'main' of github.com:dask/distributed into preserve_dump…

f870107

…_header

madsbk requested a review from jrbourbeau October 25, 2021 13:12

Merge branch 'main' of github.com:dask/distributed into preserve_dump…

f8f1fd3

…_header

Merge branch 'main' of github.com:dask/distributed into preserve_dump…

c8e3d0a

…_header

pentschev mentioned this pull request Nov 5, 2021

Reenable explicit comms tests rapidsai/dask-cuda#770

Merged

jakirkham merged commit 4b2d1f2 into dask:main Nov 5, 2021

jakirkham mentioned this pull request Nov 5, 2021

Release 2021.11.0 dask/community#197

Closed

madsbk deleted the preserve_dump_header branch November 11, 2021 07:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Serialization family to preserve headers of the underlying dumps functions. #5380

Serialization family to preserve headers of the underlying dumps functions. #5380

madsbk commented Oct 1, 2021 •

edited

Loading

madsbk commented Oct 1, 2021

jrbourbeau left a comment

madsbk commented Oct 5, 2021

jakirkham commented Oct 5, 2021

madsbk commented Oct 5, 2021

madsbk commented Oct 6, 2021 •

edited

Loading

madsbk commented Oct 6, 2021

jakirkham commented Oct 8, 2021

madsbk commented Nov 4, 2021

jakirkham commented Nov 5, 2021

Serialization family to preserve headers of the underlying dumps functions. #5380

Serialization family to preserve headers of the underlying dumps functions. #5380

Conversation

madsbk commented Oct 1, 2021 • edited Loading

madsbk commented Oct 1, 2021

jrbourbeau left a comment

Choose a reason for hiding this comment

madsbk commented Oct 5, 2021

jakirkham commented Oct 5, 2021

madsbk commented Oct 5, 2021

madsbk commented Oct 6, 2021 • edited Loading

madsbk commented Oct 6, 2021

jakirkham commented Oct 8, 2021

madsbk commented Nov 4, 2021

jakirkham commented Nov 5, 2021

madsbk commented Oct 1, 2021 •

edited

Loading

madsbk commented Oct 6, 2021 •

edited

Loading