Skip to content

Commit

Permalink
Update docs
Browse files Browse the repository at this point in the history
  • Loading branch information
hendrikmakait committed Aug 17, 2023
1 parent 8251678 commit b76bd7a
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 13 deletions.
11 changes: 4 additions & 7 deletions distributed/shuffle/_rechunk.py
Expand Up @@ -293,15 +293,14 @@ class ArrayRechunkRun(ShuffleRun[NDIndex, "np.ndarray"]):
This object is responsible for splitting, sending, receiving and combining
data shards.
It is entirely agnostic to the distributed system and can perform a shuffle
with other `Shuffle` instances using `rpc` and `broadcast`.
It is entirely agnostic to the distributed system and can perform a rechunk
with other run instances using `rpc``.
The user of this needs to guarantee that only `Shuffle`s of the same unique
`ShuffleID` interact.
The user of this needs to guarantee that only `ArrayRechunkRun`s of the same unique
`ShuffleID` and `run_id` interact.
Parameters
----------
# FIXME
worker_for:
A mapping partition_id -> worker_address.
old:
Expand All @@ -318,8 +317,6 @@ class ArrayRechunkRun(ShuffleRun[NDIndex, "np.ndarray"]):
The scratch directory to buffer data in.
executor:
Thread pool to use for offloading compute.
loop:
The event loop.
rpc:
A callable returning a PooledRPCCall to contact other Shuffle instances.
Typically a ConnectionPool.
Expand Down
9 changes: 3 additions & 6 deletions distributed/shuffle/_shuffle.py
Expand Up @@ -342,14 +342,13 @@ class DataFrameShuffleRun(ShuffleRun[int, "pd.DataFrame"]):
data shards.
It is entirely agnostic to the distributed system and can perform a shuffle
with other `Shuffle` instances using `rpc` and `broadcast`.
with other run instances using `rpc`.
The user of this needs to guarantee that only `Shuffle`s of the same unique
`ShuffleID` interact.
The user of this needs to guarantee that only `DataFrameShuffleRun`s of the
same unique `ShuffleID` and `run_id` interact.
Parameters
----------
# FIXME
worker_for:
A mapping partition_id -> worker_address.
column:
Expand All @@ -364,8 +363,6 @@ class DataFrameShuffleRun(ShuffleRun[int, "pd.DataFrame"]):
The scratch directory to buffer data in.
executor:
Thread pool to use for offloading compute.
loop:
The event loop.
rpc:
A callable returning a PooledRPCCall to contact other Shuffle instances.
Typically a ConnectionPool.
Expand Down

0 comments on commit b76bd7a

Please sign in to comment.