[Draft] DSL design for next version #519

Binyang2014 · 2025-05-05T18:36:53Z

No description provided.

docs/design/mscclpp-dsl-next.md

docs/design/mscclpp-dsl-json-file.md

mahdiehghazim · 2025-05-21T16:47:41Z

docs/design/mscclpp-dsl-next.md

+channel = Channel(dst_rank, src_rank, channel_type)
+sem.acquire(tb=1)
+channel.put(other_peer_chunk, dst_chunk, tb=1)
+```


is this code pipelining "copy data from input buffer to scratch buffer" with "transfer data from scratch buffer to other peers". if so, which variable is the scratch buffer?

sem.release(tb=0) and sem.acquire(tb=1) ensures that copy to dst_chunk occures before put operation, right?

please provide more detailed explanation about the code

mahdiehghazim · 2025-05-21T16:59:13Z

docs/design/mscclpp-dsl-next.md

+```python
+sem = Rank.Semaphore(rank=rank, size=1)
+rank = Rank(src_rank)
+with Loop.iteration(unit=2**20, num_chunks=1) as iter:


please add more explanation about the code. e.g., what is unit?

mahdiehghazim · 2025-05-21T16:59:29Z

docs/design/mscclpp-dsl-next.md

+    # The dst_chunk and src_chunk sizes should match the num_chunks parameter in the loop context.
+    rank.copy(dst_chunk, src_chunk, tb=0, iter_context=iter)
+    sem.release(tb=0)
+    channel = Channel(dst_rank, src_rank, channel_type)


can we define channel before loop?

mahdiehghazim · 2025-05-21T17:00:41Z

docs/design/mscclpp-dsl-next.md

+
+
+Here is the example for two ranks allreduce. Which achieve non-zero copy and use nvls. We use 3 thread-blocks to do the allreduce.
+The first thread-block is used to copy data from input buffer to scratch buffer, the second thread-block is used to do allreduce in scratch buffer, and the third thread-block is used to copy data from scratch buffer to output buffer.  The thread-blocks are synchronized by semaphores.


please add a line by line explanation of the code

mahdiehghazim · 2025-05-21T17:03:42Z

docs/design/mscclpp-dsl-next.md

+```
+
+## All2All support
+For now, DSL only support static all2all algorithm. For all2allv support, we need to get the send/recv size at the runtime. It may require some placeholder at the Json execution plan and relace to the real size at the runtime. If we could make chunk size be variable, we could use the same way to support all2allv.


--> "Currently, the DSL only supports the static all2all algorithm. To support all2allv, we need to obtain the send/receive sizes at runtime. This may require using placeholders in the JSON execution plan, which would be replaced with the actual sizes during execution. If we can make the chunk size variable, the same approach could be used to support all2allv."

Binyang2014 added 7 commits May 5, 2025 18:34

dsl

3af88a0

WIP

2139ffb

WIP

f61f229

WIP

eeb9811

WIP

a59e675

WIP

a88006e

update

be6cc03

caiomcbr reviewed May 5, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Outdated Show resolved Hide resolved

caiomcbr reviewed May 5, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Show resolved Hide resolved

caiomcbr reviewed May 5, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Outdated Show resolved Hide resolved

caiomcbr reviewed May 5, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Outdated Show resolved Hide resolved

caiomcbr reviewed May 5, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Outdated Show resolved Hide resolved

Binyang2014 added 2 commits May 5, 2025 23:28

move chunk to top

731b588

WIP

2871995

caiomcbr reviewed May 5, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Outdated Show resolved Hide resolved

caiomcbr reviewed May 5, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Outdated Show resolved Hide resolved

caiomcbr reviewed May 5, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Outdated Show resolved Hide resolved

caiomcbr reviewed May 5, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Outdated Show resolved Hide resolved

caiomcbr reviewed May 6, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Show resolved Hide resolved

address some comments

99e3824

caiomcbr reviewed May 6, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Outdated Show resolved Hide resolved

caiomcbr reviewed May 6, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Outdated Show resolved Hide resolved

Binyang2014 added 5 commits May 6, 2025 20:35

fix comments

2dbaa58

update

7a7a921

address comments

efde2e1

WIP

ffcb121

WIP

4a527d2

caiomcbr reviewed May 8, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Show resolved Hide resolved

caiomcbr reviewed May 8, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Show resolved Hide resolved

caiomcbr reviewed May 8, 2025

View reviewed changes

docs/design/mscclpp-dsl-next.md Outdated Show resolved Hide resolved

Binyang2014 and others added 9 commits May 13, 2025 13:50

Merge branch 'main' into binyli/dsl

34d32be

WIP

a97ec12

WIP

35b5da9

Merge branch 'main' into binyli/dsl

d8e8d07

update

d288461

wip

b5aefca

wip

57c15fa

wip

7a70cdf

wip

91f47b3

Binyang2014 commented May 15, 2025

View reviewed changes

docs/design/mscclpp-dsl-json-file.md Outdated Show resolved Hide resolved

docs/design/mscclpp-dsl-json-file.md Outdated Show resolved Hide resolved

caiomcbr and others added 2 commits May 15, 2025 20:20

wip

4479111

Merge branch 'main' into binyli/dsl

439dcc8

Binyang2014 commented May 15, 2025

View reviewed changes

docs/design/mscclpp-dsl-json-file.md Outdated Show resolved Hide resolved

docs/design/mscclpp-dsl-json-file.md Outdated Show resolved Hide resolved

docs/design/mscclpp-dsl-json-file.md Outdated Show resolved Hide resolved

docs/design/mscclpp-dsl-json-file.md Outdated Show resolved Hide resolved

Binyang2014 commented May 15, 2025

View reviewed changes

docs/design/mscclpp-dsl-json-file.md Outdated Show resolved Hide resolved

Binyang2014 added 2 commits May 15, 2025 23:12

WIP

36a2460

typo fix

6f402c6

Binyang2014 commented May 16, 2025

View reviewed changes

docs/design/mscclpp-dsl-json-file.md Show resolved Hide resolved

caiomcbr added 4 commits May 16, 2025 23:53

wip

ef949b5

wip

2b54de9

wip

4cc1bad

wip

a9d476d

Binyang2014 commented May 18, 2025

View reviewed changes

docs/design/mscclpp-dsl-json-file.md Outdated Show resolved Hide resolved

docs/design/mscclpp-dsl-json-file.md Outdated Show resolved Hide resolved

docs/design/mscclpp-dsl-json-file.md Outdated Show resolved Hide resolved

docs/design/mscclpp-dsl-json-file.md Show resolved Hide resolved

Merge branch 'main' into binyli/dsl

9fa0068

mahdiehghazim reviewed May 21, 2025

View reviewed changes

caiomcbr added 5 commits May 21, 2025 21:52

wip

9dffc6a

wip

6658cdf

wip

c49ac04

wip

a1ae75d

wip

93457cf

Binyang2014 mentioned this pull request Jun 20, 2025

[Feature] Flexible selection logic of execution plans #556

Open



		Here is the example for two ranks allreduce. Which achieve non-zero copy and use nvls. We use 3 thread-blocks to do the allreduce.
		The first thread-block is used to copy data from input buffer to scratch buffer, the second thread-block is used to do allreduce in scratch buffer, and the third thread-block is used to copy data from scratch buffer to output buffer. The thread-blocks are synchronized by semaphores.

[Draft] DSL design for next version #519

Are you sure you want to change the base?

[Draft] DSL design for next version #519

Uh oh!

Conversation

Binyang2014 commented May 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mahdiehghazim May 21, 2025

Choose a reason for hiding this comment

Uh oh!

mahdiehghazim May 21, 2025

Choose a reason for hiding this comment

Uh oh!

mahdiehghazim May 21, 2025

Choose a reason for hiding this comment

Uh oh!

mahdiehghazim May 21, 2025

Choose a reason for hiding this comment

Uh oh!

mahdiehghazim May 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!