New intra-process communication design #239

alsora · 2019-06-13T20:00:20Z

This PR adds a description of the current intra-process communication mechanism, together with a design proposal for improving its performances and supporting more QoS.
The new design is supported by experimental evaluations.

An initial discussion about this design can be found here.

The implementation of this design can be found here.

@dgoel @raghaprasad

gbiggs · 2019-06-14T01:59:31Z

Please format with one sentence per line and no line breaks within a sentence so that changes made in response to comments are easier to track.

gbiggs · 2019-06-14T02:00:53Z

articles/intraprocess_communication.md

+The choice of the buffer data-type is controlled through an additional field in the `SubscriptionOptions`. The default value for this option is denominated `CallbackDefault`, which corresponds to selecting the type between `shared_ptr<constMessageT>`
+and `unique_ptr<MessageT>` that better fits with its callback type. This is deduced looking at the output of `AnySubscriptionCallback::use_take_shared_method()`.
+
+If the history QoS is set to `keep all`, the buffers are dynamically allocated. On the other hand, if the history QoS is set to `keep last`, the buffers have a size equal to the depth of the history and they act as ring buffers (overwriting the


This is different from how these QoS settings work in DDS. Not having the same semantics for QoS settings between inter-process and intra-process would make it hard to reuse a node.

We meant to say "buffers are dynamically adjusted in size up to some max limit specified as part of KEEP_ALL QoS". Then the semantics are the same, right?

gbiggs · 2019-06-14T02:10:36Z

articles/intraprocess_communication.md

+
+### Publishing only intra-process
+
+#### Publishing unique_ptr


Sometimes there's something to be said for using UML diagrams.

Sorry, but I don't get what do you mean

@alsora I am guessing the comment is recommending UML based diagrams to describe visually some of the sections of this PR. For. Eg, Creating a publisher, Creating a subscription, Publishing only intra-process etc can have sequence diagrams in addition to the description.

gbiggs · 2019-06-14T02:22:02Z

This is a tangential comment, but I wonder if we could achieve the same zero-copies-when-same-process result by reducing the number of copies requires for going into and out of the rmw layer to zero and using a DDS implementation that also supports zero copies (ignoring that there may not be any and that the standard API may not support this, both of which are solvable issues). One of the reasons for using DDS is to push all the communication issues down into an expert-vendor-supplied library, after all.

gbiggs · 2019-06-14T02:22:40Z

The current prototype implementation and the design focus on rclcpp. What about rclc? How are we going to ensure that rclpy has exactly the same capabilities and semantics?

raghaprasad · 2019-06-14T03:29:32Z

The current prototype implementation and the design focus on rclcpp. What about rclc? How are we going to ensure that rclpy has exactly the same capabilities and semantics?

I am not sure about the status of rclc in terms of feature parity, seems like the ros2 supported clients are rclcpp & rclpy.

While I agree about maintaining feature parity between the 2 supported clients, this design proposal is an improvement to the existing intra-process support, which currently is supported only in rclcpp.

If we continue to have the intra process manager live in rclcpp, we should have a separate discussion about what it means to do the same on rclpy (and if it is even required).

raghaprasad · 2019-06-14T03:59:33Z

This is a tangential comment, but I wonder if we could achieve the same zero-copies-when-same-process result by reducing the number of copies requires for going into and out of the rmw layer to zero and using a DDS implementation that also supports zero copies (ignoring that there may not be any and that the standard API may not support this, both of which are solvable issues). One of the reasons for using DDS is to push all the communication issues down into an expert-vendor-supplied library, after all.

How about moving the intra_process_management into an rmw ?
This rmw could handle only intra_process communication and delegate inter-process communication to a any of the chosen DDS rmw implementations.

Support for zero copies is an important objective, but its not the only one. It has been observed that creating DDS participants is pretty resource heavy in terms of net memory required (atleast for FastRTPS & OpenSplice) and the discovery process is CPU intensive (due to multicast).
This new rmw could drastically simplify the discovery process and most certainly reduce the memory footprint by needing only one participant per process to support inter_process communication.

I don't want to derail this PR as this is totally tangential topic and needs more thought and planning. Should we move this discussion to a discourse post ?

alsora · 2019-06-14T09:43:57Z

The aim of this PR is to improve the current intra-process communication, that is only supported in rclcpp at the moment.

I don't want to derail this PR as this is totally tangential topic and needs more thought and planning. Should we move this discussion to a discourse post ?

As @raghaprasad says, I think that moving the intra-process manager to a different place from rclcpp
will involve many considerations not necessarily related with how the intra-process communication should actually take place, which is the focus of this PR.

However, I think that once it has been finalized and implemented there shouldn't be any major blockers avoiding to move it to a COMMON rmw layer.
This can be used to have common behavior between C++ and Python applications.

While working to this PR I also explored the idea of implementing the intra-process manager logic at the RMW layer (I have a very dummy proof of concept for RMW DPS).
One of the issues I had was that published messages have to go through the rclc layer, thus it's not possible to use smart pointers and message templates.
Apart from this, the performances were comparable with the rclcpp implementation (slightly worst).

gbiggs · 2019-06-16T23:40:52Z

One of the issues I had was that published messages have to go through the rclc layer, thus it's not possible to use smart pointers and message templates.

But it is possible to do the rmw and rcl APIs and implementations such that they manage their raw pointers properly and provide a smart_ptr interface-compatible object in rclcpp. I'm not saying it would be easy, but this is how the STL is designed to be used and it would be the most powerful solution.

I feel that having inter-process only available in rclcpp is a further continuation of not putting things in rclc where they can be easily wrapped by other languages, which makes it harder to get back to what we were told at the start of ROS 2 development is a goal: to have a universal C client library provide all the functionality and other languages, including C++, just wrap it. I understand this policy was sidelined by a need to get features out faster but if we just keep doing things in C++ we will make it increasingly difficult to get back to that approach, and in the long run end up with a bunch of mostly-compatible client libraries again.

dgoel · 2019-06-17T17:51:18Z

I feel that having inter-process only available in rclcpp is a further continuation of not putting things in rclc where they can be easily wrapped by other languages, which makes it harder to get back to what we were told at the start of ROS 2 development is a goal: to have a universal C client library provide all the functionality and other languages, including C++, just wrap it. I understand this policy was sidelined by a need to get features out faster but if we just keep doing things in C++ we will make it increasingly difficult to get back to that approach, and in the long run end up with a bunch of mostly-compatible client libraries again.

ROS2 architecture [1] depicts very explicitly that intra-process communication (along with type masquerading optimizations) belongs in the language-specific client library. Also, intra-process communication is merely an optimization, and not a "feature" per say. All the features like services, parameters, etc. are rightly implemented in rcl layer already.

[1] http://docs.ros2.org/dashing/developer_overview.html#internal-ros-interfaces

ivanpauno

The proposal looks quite good to me.
I left some minimal comments. I will do a second read before jumping to the PR.

articles/intraprocess_communication.md

ivanpauno · 2019-07-12T20:54:32Z

articles/intraprocess_communication.md

+By setting the buffer type to `shared_ptr`, no copies are needed when the `Publisher` pushes messages into the buffers.
+Eventually, the `Subscription`s will copy the data only when they are ready to process it.
+
+On the other hand, if the published data are very small, it can be advantageous to do not use C++ smart pointers, but to directly store the data into the buffers.


Does this improve performance? I don't see much benefit of this vs using an unique ptr. If memory allocation has to be avoided, an allocator can be passed.
I think it's fine to having this option too, but I'm just curious if there is another reason for this.

I think this only makes sense if you also avoid the necessity that all intra-process starts as a unique_ptr. Once you've created or were given a unique pointer, you can either just deliver that (in the case of a single owning subscription) and/or make a single copy into a shared pointer until you're ready to deliver each (in the case of more subscriptions) and only when they're taken make another copy.

articles/intraprocess_communication.md

ivanpauno · 2019-07-18T14:12:25Z

Some thoughts about moving intraprocess communication to rmw:

ROS2 architecture [1] depicts very explicitly that intra-process communication (along with type masquerading optimizations) belongs in the language-specific client library. Also, intra-process communication is merely an optimization, and not a "feature" per say. All the features like services, parameters, etc. are rightly implemented in rcl layer already.

[1] http://docs.ros2.org/dashing/developer_overview.html#internal-ros-interfaces

That's mostly "historical". I think that if we have a good rationale for implementing it at rmw level, we should.

But it is possible to do the rmw and rcl APIs and implementations such that they manage their raw pointers properly and provide a smart_ptr interface-compatible object in rclcpp. I'm not saying it would be easy, but this is how the STL is designed to be used and it would be the most powerful solution.

I think something like that could be possible, but quite complicated.

I would like to see something mimicking connext Zero Copy Transfer Over Shared Memory semantics (by default connext use shared memory, but it doesn't use zero copy transfer, which have an specific semantics). Basically, instead of creating a unique pointer and then publishing it:

auto msg = std::make_unique<MSG_TYPE>();
/* Fill the message here */
publisher->publish(std::move(msg))

You ask to the publisher a piece of memory, fill it, and then publish:

auto msg = publisher->new_message();
/* Fill the message here */
publisher->publish(std::move(msg)); // I'm using move semantics because the message will be undefined after calling publish. But how we wrap the msg for this is an implementation detail.

For dds vendors that have implemented zero copy transport, this could just wrap it.
For others, we could have a default implementation that's used in those cases. That implementation could not use shared memory that allows INTERprocess zero copy transport, but just use a preallocated buffer in each publisher that allows INTRAprocess zero copy transport. This implementation is a good start for later doing something like this (if we want to do it).

I also think this idea will look idiomatic in other languages (for example, in python), and performance should be quite similar.

As @raghaprasad says, I think that moving the intra-process manager to a different place from rclcpp
will involve many considerations not necessarily related with how the intra-process communication should actually take place, which is the focus of this PR.

I agree, I think this implementation will improve performance compared with the current intraprocess implementation. We could later decide if moving intraprocess comm to the rmw layer is a good idea or not.

emersonknapp · 2019-08-23T18:13:24Z

@ivanpauno I agree with the above discussion that we should evaluate this PR for what it is (an improvement to the existing scenario of Intra-Process existing only in rclcpp) - but I'm also very interested in making this feature more globally accessible. I was sold that one of ROS2's goals was to stop reimplementing features for each language, and to make language clients as thin as possible, therefore functionality like this should live in rcl or lower.

My question is, where can we start and maintain a focused discussion for this topic today, rather than just saying "we can decide later"? That way we can take this discussion off of this PR, but still know that we have actually begun to focus on the bigger-picture architectural discussion somewhere. Perhaps a new ros2/design PR would be most appropriate? (e.g. "Design for intra-process comms available to all ros2 language clients"?)

ivanpauno · 2019-08-23T19:07:09Z

@ivanpauno I agree with the above discussion that we should evaluate this PR for what it is (an improvement to the existing scenario of Intra-Process existing only in rclcpp) - but I'm also very interested in making this feature more globally accessible. I was sold that one of ROS2's goals was to stop reimplementing features for each language, and to make language clients as thin as possible, therefore functionality like this should live in rcl or lower.

I fully agree about making the feature globally available. Actually, it's currently available in rclcpp only for publish/subscribe communication, but for example, not for services.

My question is, where can we start and maintain a focused discussion for this topic today, rather than just saying "we can decide later"? That way we can take this discussion off of this PR, but still know that we have actually begun to focus on the bigger-picture architectural discussion somewhere. Perhaps a new ros2/design PR would be most appropriate? (e.g. "Design for intra-process comms available to all ros2 language clients"?)

Maybe a PR/issue here, and post the link in discourse too.
If you already have an idea of the design, I think it's better to directly open a PR. If we just want to continue the discussion, I would rather open an issue.

As the topic is complex, if we want to move ahead with it fast, I think that it would be good to organize a meeting for discussing the topic. I would be interested in participate.

wjwwood

I've just finished going over this again. Some of my comments maybe don't apply after reading further in the document, but I tried to remove those that I could.

I'm going through the implementation pr simultaneously and will continue that in the next few days.

articles/intraprocess_communication.md

wjwwood · 2019-09-19T18:59:05Z

articles/intraprocess_communication.md

+
+ - If the history is set to `keep_last`, then the depth of the history corresponds to the size of the ring buffer.
+ On the other hand, if the history is set to `keep_all`, the buffer becomes a standard FIFO queue with an unbounded size.
+ - The reliability is only checked by the `IntraProcessManager` in order to understand if a `Publisher` and a `Subscription` are compatible.


I mentioned this above, but this may also affect how the behavior of how a publisher should block when subscriptions are full.

articles/intraprocess_communication.md

wjwwood · 2019-09-19T20:20:19Z

articles/intraprocess_communication.md

+By setting the buffer type to `shared_ptr`, no copies are needed when the `Publisher` pushes messages into the buffers.
+Eventually, the `Subscription`s will copy the data only when they are ready to process it.
+
+On the other hand, if the published data are very small, it can be advantageous to do not use C++ smart pointers, but to directly store the data into the buffers.


I think this only makes sense if you also avoid the necessity that all intra-process starts as a unique_ptr. Once you've created or were given a unique pointer, you can either just deliver that (in the case of a single owning subscription) and/or make a single copy into a shared pointer until you're ready to deliver each (in the case of more subscriptions) and only when they're taken make another copy.

Signed-off-by: Soragna, Alberto <alberto.soragna@gmail.com>

alsora · 2019-09-20T13:58:46Z

Github does not allow me to reply to the last comment #239 (comment) ...
However, I agree with you. When I tested this I also added new APIs for publish and callbacks that do not use pointers.

clalancette · 2019-10-21T20:48:19Z

@alsora Now that ros2/rclcpp#778 is merged, is there anything we need to update here?

alsora · 2019-10-22T22:14:05Z

No I think this is fine.
On the rclcpp side a follow up PR will be necessary to re introduce the use of intra process manager impl class, but it should not affect the design

clalancette · 2019-11-05T17:37:51Z

No I think this is fine.
On the rclcpp side a follow up PR will be necessary to re introduce the use of intra process manager impl class, but it should not affect the design

All right, thanks. @wjwwood @ivanpauno You two had the earlier comments and still some unresolved conversations. Are you satisfied with this article now, or do we still need to resolve the open conversations?

alsora · 2020-03-28T09:27:39Z

Should we merge this?

ivanpauno · 2020-03-30T16:18:00Z

Merged. If there are any further comments, they can be addressed in a follow-up.

Signed-off-by: Soragna, Alberto <alberto.soragna@gmail.com>

added intraprocess improvement design

6af5978

gbiggs reviewed Jun 14, 2019

View reviewed changes

updated new lines formatting

725508b

alsora added 3 commits June 14, 2019 11:10

clarified keep all qos

fdecc40

added uml diagrams, small fixes

bee52ef

fixed link visualization

dc76577

alsora mentioned this pull request Jul 4, 2019

New Intra-Process Communication ros2/rclcpp#778

Merged

joxoby mentioned this pull request Jul 11, 2019

Disconnect entities when they are using intra-process communication ros2/rmw_cyclonedds#17

Open

ivanpauno reviewed Jul 12, 2019

View reviewed changes

fixed typo

5cb9d22

emersonknapp mentioned this pull request Aug 23, 2019

Intra-Process Communications for all language clients #251

Open

wjwwood reviewed Sep 19, 2019

View reviewed changes

correct typos and formatting

956a15b

Signed-off-by: Soragna, Alberto <alberto.soragna@gmail.com>

clarified some limitations of current proposal

2b519b4

ivanpauno approved these changes Mar 30, 2020

View reviewed changes

ivanpauno merged commit ccc7151 into ros2:gh-pages Mar 30, 2020

mlanting pushed a commit to mlanting/design that referenced this pull request Aug 17, 2020

New intra-process communication design (ros2#239)

85b52bb

Signed-off-by: Soragna, Alberto <alberto.soragna@gmail.com>

mlanting pushed a commit to mlanting/design that referenced this pull request Aug 17, 2020

New intra-process communication design (ros2#239)

3d1bed8

Signed-off-by: Soragna, Alberto <alberto.soragna@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New intra-process communication design #239

New intra-process communication design #239

alsora commented Jun 13, 2019

gbiggs commented Jun 14, 2019

gbiggs Jun 14, 2019

dgoel Jun 14, 2019

gbiggs Jun 14, 2019

alsora Jun 14, 2019

raghaprasad Jun 14, 2019

gbiggs commented Jun 14, 2019

gbiggs commented Jun 14, 2019

raghaprasad commented Jun 14, 2019

raghaprasad commented Jun 14, 2019

alsora commented Jun 14, 2019 •

edited

Loading

gbiggs commented Jun 16, 2019

dgoel commented Jun 17, 2019

ivanpauno left a comment

ivanpauno Jul 12, 2019

wjwwood Sep 19, 2019

ivanpauno commented Jul 18, 2019

emersonknapp commented Aug 23, 2019

ivanpauno commented Aug 23, 2019

wjwwood left a comment

wjwwood Sep 19, 2019

wjwwood Sep 19, 2019

alsora commented Sep 20, 2019

clalancette commented Oct 21, 2019

alsora commented Oct 22, 2019

clalancette commented Nov 5, 2019

alsora commented Mar 28, 2020

ivanpauno commented Mar 30, 2020

New intra-process communication design #239

New intra-process communication design #239

Conversation

alsora commented Jun 13, 2019

gbiggs commented Jun 14, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gbiggs commented Jun 14, 2019

gbiggs commented Jun 14, 2019

raghaprasad commented Jun 14, 2019

raghaprasad commented Jun 14, 2019

alsora commented Jun 14, 2019 • edited Loading

gbiggs commented Jun 16, 2019

dgoel commented Jun 17, 2019

ivanpauno left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ivanpauno commented Jul 18, 2019

emersonknapp commented Aug 23, 2019

ivanpauno commented Aug 23, 2019

wjwwood left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

alsora commented Sep 20, 2019

clalancette commented Oct 21, 2019

alsora commented Oct 22, 2019

clalancette commented Nov 5, 2019

alsora commented Mar 28, 2020

ivanpauno commented Mar 30, 2020

alsora commented Jun 14, 2019 •

edited

Loading