Increase segment sizes in coll/han and coll/adapt #11360

devreal · 2023-01-31T15:04:58Z

Increase segment sizes for bcast, reduce, and allreduce to 512k. On modern machines, higher segment sizes seem to be more efficient as they reduce the overhead of segmenting (less messages, better chance at saturating the network).

Example for increased segment sizes on Hawk (64 core AMD EPYC Rome, ConnectX-6):

Reduce with 64k (current segment size)

Reduce with 512k (new segment size)

Note the lower latency on the right side of the plots. The change in segment size yields an improvement of about 10x for han over tuned. There is no data for han over sm because sm crashes at this segment size.

Increase segment sizes for bcast, reduce, and allreduce to 512k. On modern machines, higher segment sizes seem to be more efficient as they reduce the overhead of segmenting. Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>

A larger segment size helps reduce the overhead of segmenting. The 512k size matches the size of coll/han. Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>

devreal · 2023-02-28T15:59:30Z

Some additional measurements on Hawk for allreduce, bcast, and reduce for 4MB operations, 32 nodes, 64 processes per node. Clearly, higher segment sizes are favorable for HAN. I tried to set the segment size for coll/tuned but that mechanism seems broken.

devreal added 2 commits January 31, 2023 09:49

coll/han: increase segment sizes to 512k

e29e7fa

Increase segment sizes for bcast, reduce, and allreduce to 512k. On modern machines, higher segment sizes seem to be more efficient as they reduce the overhead of segmenting. Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>

coll/adapt: Increase ireduce segment size

867c3df

A larger segment size helps reduce the overhead of segmenting. The 512k size matches the size of coll/han. Signed-off-by: Joseph Schuchart <schuchart@icl.utk.edu>

devreal added the Target: main label Jan 31, 2023

devreal requested a review from bosilca January 31, 2023 15:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Increase segment sizes in coll/han and coll/adapt #11360

Increase segment sizes in coll/han and coll/adapt #11360

Uh oh!

devreal commented Jan 31, 2023

Uh oh!

devreal commented Feb 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Increase segment sizes in coll/han and coll/adapt #11360

Are you sure you want to change the base?

Increase segment sizes in coll/han and coll/adapt #11360

Uh oh!

Conversation

devreal commented Jan 31, 2023

Uh oh!

devreal commented Feb 28, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant