Optimize communication within the same rank #134

masterleinad · 2019-10-09T01:19:02Z

We typically have large message sizes for communication within the same rank. Hence, this also fixes problems with message sizes MPI can't handler within one MPI communication call. Also, see #131.

aprokop · 2019-10-09T13:35:34Z

src/details/ArborX_DetailsDistributor.hpp

+        auto const receive_buffer_ptr =
+            src_buffer.data() + _src_offsets[i] * num_packets;
+        requests.emplace_back();
+        MPI_Irecv(receive_buffer_ptr, message_size, MPI_BYTE, _sources[i], 123,


Is this just a good habit to fix the tag here?

I am always wary of fixing an MPI tag because you never know what a user will pick and if things will hang if they happen to be the same - typically I try to expose this to the user in an interface for code that does MPI communication with a default value provided

typically I try to expose this to the user in an interface for code that does MPI communication with a default value provided

How do you do that? Do all your communications use the same tag?

In this case I would expose in either the distributed tree constructor or send across network - basically any function that does MPI. If you have a single function that does multiple MPI communications which need different tags then you could ask for two or more tags or a range of tags that are acceptable for the library to use. See this for example: https://github.com/ECP-copa/Cabana/blob/89240fe1a605589d7c0d925b41b3b57bc5586459/core/src/Cabana_Distributor.hpp#L100

Isn't a much better and cleaner solution to do MPI_Comm_dup as that creates a new namespace so that any tag you use won't intersect with user communication? I think in fact this should be done in ArborX.

As long as the object is long-lived that sounds reasonable. Either way we shouldn't use a fixed tag with a user-provided communicator. Either the approach I suggested or the approach @aprokop suggested will resolve this.

Comm duplication was merged in #135.

sslattery · 2019-10-09T13:42:10Z

src/details/ArborX_DetailsDistributor.hpp

+        auto const position = it - _sources.begin();
+        auto const receive_buffer_ptr =
+            src_buffer.data() + _src_offsets[position] * num_packets;
+        std::memcpy(receive_buffer_ptr, send_buffer_ptr, message_size);


Are you guys using GPU aware MPI? If you are, this won't work. If you aren't, we should have a discussion about that.

Good point.

But src_buffer is a std::vector in the current implementation so the change proposed are valid.

Fair enough - lets make a note to discuss gpu-aware strategy in the future then.

ArborX/src/details/ArborX_DetailsDistributor.hpp

Lines 147 to 149 in b6b7d7e

// TODO

// * apply permutation on the device in a parallel for

// * switch to MPI with CUDA support (do not copy to host)

Optimize communication within the same rank

23825d1

dalg24 approved these changes Oct 9, 2019

View reviewed changes

aprokop approved these changes Oct 9, 2019

View reviewed changes

aprokop mentioned this pull request Oct 9, 2019

Fix communication for large message sizes #131

Closed

sslattery reviewed Oct 9, 2019

View reviewed changes

aprokop added the bug Something isn't working label Oct 9, 2019

dalg24 merged commit 36f7f35 into arborx:master Oct 9, 2019

masterleinad deleted the avoid_selfcommunicatuin branch October 9, 2019 17:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize communication within the same rank #134

Optimize communication within the same rank #134

masterleinad commented Oct 9, 2019

aprokop Oct 9, 2019

sslattery Oct 9, 2019

aprokop Oct 9, 2019

sslattery Oct 9, 2019

aprokop Oct 9, 2019

sslattery Oct 9, 2019

aprokop Oct 9, 2019

sslattery Oct 9, 2019

dalg24 Oct 9, 2019

dalg24 Oct 9, 2019

sslattery Oct 9, 2019 •

edited

Loading

masterleinad Oct 9, 2019

	// TODO
	// * apply permutation on the device in a parallel for
	// * switch to MPI with CUDA support (do not copy to host)

Optimize communication within the same rank #134

Optimize communication within the same rank #134

Conversation

masterleinad commented Oct 9, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sslattery Oct 9, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sslattery Oct 9, 2019 •

edited

Loading