Point-to-point communications in NCCL? #212

ktnyt · 2019-04-24T08:45:30Z

When I attended GTC last month and attended the session by Mr. J. Kraus on multi-GPU programming, I heard from him that there were plans for point-to-point communication support in NCCL and perhaps bumping the development team with an issue will help get the attention.
While I did feel like the idea was kind of controversial (since this is a collective communication library), it would be great if point-to-point communication is indeed supported. However I also feel that this is a niche request so I wouldn't expect it to roll out anytime soon.
Has there been a discussion on supporting point-to-point communication? And if so are there any roadmaps towards supporting it? Any response would be very helpful.
Thanks in advance.

sjeaugey · 2019-04-24T16:44:16Z

We've had plans to implement point to point communication for some time now, in the form of two new primitives : ncclSend and ncclRecv. Then, combining ncclSend and ncclRecv with ncclGroupStart and ncclGroupEnd, users could do any Alltoall, Scatter, Gather, or neighbor collectives.

So it would look like point to point, but with the idea of implementing a collective alltoallv operation, within the NCCL communicator -- which would follow the same rules as the current collective operations, i.e. operations are serialized on the communicator.

The main difference with MPI is that this is still a blocking call on the GPU side ; there is no Isend/Irecv.

We are indeed interested in hearing about use cases to precisely determine if users need blocking send/receive operations (allltoallv) or asynchronous send/receive with respect to collective operations (which NCCL cannot provide due to CUDA kernel semantics).

ktnyt · 2019-05-24T07:15:53Z

Thank you for the feedback! And apologies for not being able to reply back.

The current plan sounds feasible for our use case since we do not need non-blocking operations in the meantime.
We have been developing an algorithm that used blocking point-to-point communication and broadcasting functionalities of MPI and wanted to see if we can transition to using multi-GPUs.

Tixxx · 2019-06-14T23:28:44Z

Hi, I'm not sure if the work has already started or not since this thread was opened more than a month ago. But we are also looking for ways to do direct point-to-point communications using NCCL library to support one of our distributed training algorithms. The nature of the algorithm is a pair-wise binary tree reduction. We have already implemented this using blocking MPI send/recv. Having NCCL is believed to deliver an even better performance boost. Blocking NCCLSend and NCCLRecv would be sufficient for our use case. So I'm really looking forward to hearing about the road-map for this feature. Please let me know if this has been planned or not. Thanks in advance!

Tixxx · 2019-06-25T23:13:01Z

Hi @sjeaugey Could you provide any insight on the plan to support blocking NCCL send and recv? Looking forward to having some collaboration with NCCL devs. Thanks!

sjeaugey · 2019-06-27T07:31:14Z

Hi @Tixxx. I don't see Send/Recv coming in the near future as we are still focusing on allreduce and its variants, and this is a large feature which needs a significant amount of work, with a lot of preparation and refactoring to be done before. For example (and among other things), we are trying to rewrite the topology detection and ring/tree creation to make it less ring-focused and more general, which is one of the steps needed before we start on point-to-point.

nevion · 2020-02-24T07:10:36Z

hi @sjeaugey - the gap in point to point communication with a high level library like nccl impacts applications I work on that don't need collectives, just integrated efficient data transfer (glorified memcpy) across GPUs from the inter-thread to inter-node cases ( with gpudirect support ). Messaging semantics would be nice at times as well but the need there is lesser than that of an RMA like operation.

Is there any sort of timeline or actively worked items in support of the point-to-point communication pattern? Your messages here and in #270 indicate significant redesigns first that make me think it's a good 1+ years out - and that doesn't work for me. Is there anything that can be done to make it work in the next few months? I think the worst part is I really want to give nccl a try but none of the existing operations seem like a workable fit for my problem - simulation halo exchanges, a fairly common application.

sjeaugey · 2020-02-24T18:10:09Z

Hi @nevion, hopefully this will arrive sooner. We are actively working on it now; the goal being to post a preview branch late next month, so that users can give it a try and provide feedback. Would that work for you ?

nevion · 2020-02-24T22:55:34Z

@sjeaugey yes, that does indeed work for me.

victoryang00 · 2020-03-10T12:27:32Z

Looking forward to applying the P2P function to increase the power of my project!

2sin18 · 2020-03-16T12:52:18Z

Any progress on this issue?

gather/scatter/alltoall are important to recommendation models(e.g. https://github.com/facebookresearch/dlrm/blob/master/dlrm_s_pytorch.py#L426), which still cannot utilize GPU very well now.

sjeaugey · 2020-03-30T21:45:42Z

The p2p preview has been posted to the "p2p" branch. And PR #316 has been created for discussion / feedback.

victoryang00 · 2020-03-31T13:46:32Z

e "p2p" branch. And PR #316 has been created for discussion / feedback.

Thanks, that helps a lot.

2sin18 · 2020-04-03T02:31:03Z

The p2p preview has been posted to the "p2p" branch. And PR #316 has been created for discussion / feedback.

Great job!

sjeaugey mentioned this issue Dec 5, 2019

neighborhood collectives #270

Closed

sjeaugey mentioned this issue Mar 30, 2020

Point-to-point operations preview #316

Closed

sjeaugey linked a pull request Mar 31, 2020 that will close this issue

Point-to-point operations preview #316

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Point-to-point communications in NCCL? #212

Point-to-point communications in NCCL? #212

ktnyt commented Apr 24, 2019

sjeaugey commented Apr 24, 2019

ktnyt commented May 24, 2019

Tixxx commented Jun 14, 2019

Tixxx commented Jun 25, 2019

sjeaugey commented Jun 27, 2019

nevion commented Feb 24, 2020

sjeaugey commented Feb 24, 2020

nevion commented Feb 24, 2020

victoryang00 commented Mar 10, 2020

2sin18 commented Mar 16, 2020 •

edited

sjeaugey commented Mar 30, 2020

victoryang00 commented Mar 31, 2020

2sin18 commented Apr 3, 2020 •

edited

Point-to-point communications in NCCL? #212

Point-to-point communications in NCCL? #212

Comments

ktnyt commented Apr 24, 2019

sjeaugey commented Apr 24, 2019

ktnyt commented May 24, 2019

Tixxx commented Jun 14, 2019

Tixxx commented Jun 25, 2019

sjeaugey commented Jun 27, 2019

nevion commented Feb 24, 2020

sjeaugey commented Feb 24, 2020

nevion commented Feb 24, 2020

victoryang00 commented Mar 10, 2020

2sin18 commented Mar 16, 2020 • edited

sjeaugey commented Mar 30, 2020

victoryang00 commented Mar 31, 2020

2sin18 commented Apr 3, 2020 • edited

2sin18 commented Mar 16, 2020 •

edited

2sin18 commented Apr 3, 2020 •

edited