Callback after cancel #2281

jmachowinski · 2023-08-21T11:30:08Z

Note, this commit fixes a bug, that makes the current version of the tutorial
not work any more, as the code now acts according to its documentation.

rclcpp_action/include/rclcpp_action/client.hpp

alsora · 2024-04-01T18:44:52Z

Some more thoughts on this PR (also discussed in the last Client Library WG meeting).

I think that the expected behavior should be that the action client keeps the goals alive unless explicitly instructed to drop them.
As @wjwwood mentioned, it's ok that this behavior is different from what we do when creating ROS entities (e.g. if you create a subscription, you are responsible for keeping the subscription shared_ptr in scope)

This means that the documentation is wrong, but the demo code is correct.

Having to hold the future even if you don't do anything with it is a very strange user experience, since std::future is a standard C++ class with some expected usage and behavior, we shouldn't overload it with ROS-specific expectations.

I proposed to fix the bug by:

Have the client class store shared_ptr rather than weak_ptr here https://github.com/ros2/rclcpp/blob/rolling/rclcpp_action/include/rclcpp_action/client.hpp#L757
do not capture the goal_handle as a shared_ptr in the lambdas, but rather only as a weak_ptr

This should ensure proper destruction of things

jmachowinski · 2024-04-02T11:03:16Z

I think that the expected behavior should be that the action client keeps the goals alive unless explicitly instructed to drop them.

Note, this will break the code of users not setting the callback.

Just to make sure that we are all on the same page, the current behavior is:
If you give a callback to the result:

The goal will keep itself alive, as the share_ptr is caputred in the lambda to the result callback

If you don't set a result callback:

It works as documented, and the goal will terminate the second you drop the shared_ptr from the future

Whatever we decide, we will break the code of someone...

alsora · 2024-04-02T15:39:28Z

Whatever we decide, we will break the code of someone...

This is true, that's why I think we should rather focus on what we think is the correct behavior.

Current rolling code:
- inconsistent behavior depending on whether you set a result callback or not described in Callback after cancel #2281 (comment)
- has the bug described in Action Client: Feeback callback called after cancel #2265
- does not respect the documentation
- the demo examples are ok
Proposal 1: require the users to keep the std::shared_future (and goal handle) in scope
- this is the current state of this PR f574591
- abuses the meaning of a std::future, which not only is used for waiting, but also to keep the goal alive
- fixes the bug
- respects the documentation
- breaks the demo examples (where the future is ignored)
- the [[no-discard]] attribute can produce a warning (in most compilers) if users don't hold the future, but this can be easily ignored.
- breaks the code for users that ignore the warning and do not keep the future alive. Goals will be unexpectedly cancelled under the users feet.
Proposal 2: have the action client automatically keep goals alive and use a dedicated API to drop them
- proposed in the Client Library WG meeting and summarized in Callback after cancel #2281 (comment)
- fixes the bug
- does not respect the documentation
- the demo examples are ok
- breaks the code for users that were relying on dropping the future / goal handle to terminate the goal. These users now need to use the dedicated API

I think that this is a good summary of the the current state and the two proposals

My vote is for "proposal 2"

@clalancette @fujitatomoya @wjwwood @mjcarroll

mjcarroll · 2024-04-02T16:03:03Z

I think @ros2/team is worth pinging here.

jmachowinski · 2024-04-02T16:27:45Z

I vote for 1, as users at least get a compile warning.

Additional I would propose, that we directly return the goal handle by the async_send_goal method, as this would also give us a way to drop the goal, before it was accepted. This is currently a second design flaw in the API.

clalancette · 2024-04-03T19:29:43Z

I think that the expected behavior should be that the action client keeps the goals alive unless explicitly instructed to drop them.

Note, this will break the code of users not setting the callback.

Can you explain more of your thinking around this?

Today, if users are not setting the result callback, then either:

They are holding onto the std::future, and thus keeping it alive, or
They are forgetting to hold onto the handle, and getting UB (though they may get lucky and this may work by accident)

If we change it so that the action client keeps the goals alive, then in case 1., they are unnecessarily holding the std::future, and may delay it from being destroyed, but things should still work. We will fix case 2 for them, in that we'll hold onto the handle. In both cases, I don't think we make things worse for users who are not setting the result callback.

But it is entirely possible I'm missing something here.

jmachowinski · 2024-04-03T19:43:10Z

Can you explain more of your thinking around this?

Users that don't set the callback might rely on the fact, that the goal gets canceled the second they drop the handle.
If we change this to proposal 2, their code will silently break, as goals do not terminate any more after the handle gets dropped.

clalancette · 2024-04-03T20:19:04Z

Users that don't set the callback might rely on the fact, that the goal gets canceled the second they drop the handle.
If we change this to proposal 2, their code will silently break, as goals do not terminate any more after the handle gets dropped.

Ah, right. Thanks.

So I have a proposal, and then an opinion.

The proposal is option 3, where we add in a completely new API for sending goals (async_send_goal2, for lack of a better idea). This API would implement behavior 2 as specified by @alsora , i.e. it would hold onto the handle and have a dedicated API to drop it. Then we leave async_send_goal as it is, with a deprecation warning. In that case, then, we would have a viable path to telling users that we are going to break their semantics, and we have a "new" way for users to change that they have to explicitly opt into. It is entirely possible that it is difficult to implement both of these behaviors internally, but I think it at least deserves consideration.

My opinion is this: I like option 3 the best, as it is clear to users that changes to these APIs are coming. If we don't want to do option 3 today, then I think we should go for option 1, as it is the least surprising for users. In the future we can always decide to go for option 3 if we think that is better.

Thoughts?

jmachowinski · 2024-04-04T07:16:43Z

I don't think we can implement option 3 until the feature freeze next week. I would also go more into the direction of a whole API redesign, introducing two new client classes, that solve the common 'I called wait on the future and deadlocked' issue.

ros-discourse · 2024-04-04T20:20:30Z

This pull request has been mentioned on ROS Discourse. There might be relevant details there:

https://discourse.ros.org/t/client-library-wg-meeting-april-5-2024/36998/1

fujitatomoya · 2024-04-05T05:06:11Z

Edit: Sorry my vote is proposal-1.

I second proposal-1. (from #2281 (comment)) I am okay with current implementation.

(note, i do not think i can make it tomorrow meeting, depends on traffic.)

fujitatomoya

still under discussion but lgtm with green CI atm.

jmachowinski · 2024-04-05T16:10:51Z

Summary of the discussion:
We'll add

async_send_goal_unowned()

This version will work as described in the documentation.
We will keep async_send_goal with the current broken behavior, but update the documentation.
Open for discussion : Will we add a [[deprecated, "Deprecated, as the behavior bla bla bla"]] prior to jazzy.

My vote is for adding the deprecated prior to jazzy, as users might then recognize a bug they were experiencing.

fujitatomoya · 2024-04-05T16:25:04Z

Open for discussion : Will we add a [[deprecated, "Deprecated, as the behavior bla bla bla"]] prior to jazzy.

just checking. so the decision is, to keep async_send_goal at this moment with broken behavior, but eventually we will remove async_send_goal. and will be replaced with async_send_goal_unowned?
Or do we keep both of them?

alsora · 2024-04-05T18:27:27Z

Or do we keep both of them?

Jazzy will have both of them.
The future is less set in stone =)

jmachowinski · 2024-04-08T10:21:45Z

While implementing the proposed solution I rechecked the code and figured out, that the problem is different than described by me in this thread before.

The current behavior of the action client library is :

async_send_goal

Send a goal, that is fire & forget. It will be executed regardless if you hold the handle or not.
The given handle only controls, if you will receive callbacks.
The need to keep the handle is inconsistent.
- If you only set the feedback callback, you will need to hold the handle, or you will receive no feedback.
- As soon as you set the result callback the handle is self referencing.

Using the current implementation of async_send_goal will create a memory leak, as the goal handle will not be deleted.

This can be patched for nominal cases, as we could delete the result callback after the result callback was called.

I am undecided on what to do with the async_send_goal function, still migrate to a new version owned ?

I would propose, to add a new function

void stop_callbacks(typename GoalHandle::SharedPtr goal_handle)

instead of

void drop_goal_handle(typename GoalHandle::SharedPtr goal_handle)

I would also update the documentation.

Opinions? @clalancette @fujitatomoya @alsora @mjcarroll @wjwwood

jmachowinski · 2024-04-12T16:24:15Z

@alsora updated docs

mjcarroll · 2024-04-12T16:28:21Z

Linux
Linux-aarch64
Windows

alsora · 2024-04-12T16:37:42Z

Thank you!
Looks good to me with green CI

This function allows us to drop the handle in a locked context. If we do not do this within a lock, there will be a race condition between the deletion of the shared_ptr of the handle and the result / feedback callbacks. Signed-off-by: Janosch Machowinski <J.Machowinski@cellumation.com>

This fixes deadlocks due to release of goal handles in callbacks etc. Signed-off-by: Janosch Machowinski <j.machowinski@cellumation.com>

This fixes a memory leak due to a self reference in the ClientGoalHandle. Note, this fix will only work, if the ClientGoalHandle ever receives a result callback. Signed-off-by: Janosch Machowinski <j.machowinski@cellumation.com>

Signed-off-by: Janosch Machowinski <j.machowinski@cellumation.com>

Signed-off-by: Janosch Machowinski <J.Machowinski@cellumation.com>

jmachowinski · 2024-04-13T13:11:00Z

rebased to rolling.

CI seems to hang, can someone restart it ?

wjwwood · 2024-04-14T08:12:21Z

I don't see where it's hung, but I restarted CI:

Linux
Linux-aarch64
Windows

jmachowinski · 2024-04-15T14:21:25Z

Ready for merge ?

fujitatomoya

@jmachowinski lgtm anyway, but one question for doc section.

rclcpp_action/include/rclcpp_action/client.hpp

jmachowinski requested review from ivanpauno, hidmic and wjwwood as code owners August 21, 2023 11:30

fujitatomoya reviewed Aug 21, 2023

View reviewed changes

rclcpp_action/include/rclcpp_action/client.hpp Outdated Show resolved Hide resolved

fujitatomoya self-assigned this Aug 25, 2023

fujitatomoya reviewed Aug 28, 2023

View reviewed changes

rclcpp_action/include/rclcpp_action/client.hpp Outdated Show resolved Hide resolved

CursedRock17 reviewed Aug 29, 2023

View reviewed changes

rclcpp_action/include/rclcpp_action/client.hpp Show resolved Hide resolved

jmachowinski force-pushed the callback_after_cancel branch from b70093d to 061201a Compare September 5, 2023 15:53

jmachowinski mentioned this pull request Sep 6, 2023

!fix(Client): Do not hold goal_handle ptr if result cb is set. #2296

Closed

jmachowinski force-pushed the callback_after_cancel branch 2 times, most recently from 8e4ec02 to 371bf4e Compare February 22, 2024 08:55

jmachowinski requested a review from fujitatomoya February 22, 2024 09:58

jmachowinski force-pushed the callback_after_cancel branch from 371bf4e to f574591 Compare March 25, 2024 13:41

jmachowinski mentioned this pull request Mar 25, 2024

fix(action_tutorials_cpp): Do not drop future returned by async_send_… ros2/demos#649

Closed

fujitatomoya approved these changes Apr 5, 2024

View reviewed changes

jmachowinski force-pushed the callback_after_cancel branch 3 times, most recently from d419b35 to f366e26 Compare April 12, 2024 16:15

alsora approved these changes Apr 12, 2024

View reviewed changes

jmachowinski force-pushed the callback_after_cancel branch from f366e26 to b475151 Compare April 12, 2024 17:12

Janosch Machowinski and others added 5 commits April 13, 2024 13:13

fix: make Client goal handle recursive

27e8927

This fixes deadlocks due to release of goal handles in callbacks etc. Signed-off-by: Janosch Machowinski <j.machowinski@cellumation.com>

fix(ActionGoalClient): Fixed memory leak for nominal case

0fcec47

This fixes a memory leak due to a self reference in the ClientGoalHandle. Note, this fix will only work, if the ClientGoalHandle ever receives a result callback. Signed-off-by: Janosch Machowinski <j.machowinski@cellumation.com>

doc: Updated documentation of rclcpp_action::Client::async_send_goal

9dfd8be

Signed-off-by: Janosch Machowinski <j.machowinski@cellumation.com>

docs: Made the async_send_goal documentation more explicit

60b5b1a

Signed-off-by: Janosch Machowinski <J.Machowinski@cellumation.com>

jmachowinski force-pushed the callback_after_cancel branch from b475151 to 60b5b1a Compare April 13, 2024 11:14

jmachowinski pushed a commit to jmachowinski/build_desc that referenced this pull request Apr 13, 2024

PR ros2/rclcpp#2281

de082cd

alsora approved these changes Apr 15, 2024

View reviewed changes

mjcarroll approved these changes Apr 15, 2024

View reviewed changes

mjcarroll merged commit 839348c into ros2:rolling Apr 15, 2024
3 checks passed

fujitatomoya reviewed Apr 15, 2024

View reviewed changes

rclcpp_action/include/rclcpp_action/client.hpp Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Callback after cancel #2281

Callback after cancel #2281

jmachowinski commented Aug 21, 2023

alsora commented Apr 1, 2024

jmachowinski commented Apr 2, 2024

alsora commented Apr 2, 2024 •

edited

mjcarroll commented Apr 2, 2024

jmachowinski commented Apr 2, 2024

clalancette commented Apr 3, 2024

jmachowinski commented Apr 3, 2024 •

edited

clalancette commented Apr 3, 2024

jmachowinski commented Apr 4, 2024

ros-discourse commented Apr 4, 2024

fujitatomoya commented Apr 5, 2024 •

edited

fujitatomoya left a comment

jmachowinski commented Apr 5, 2024

fujitatomoya commented Apr 5, 2024

alsora commented Apr 5, 2024

jmachowinski commented Apr 8, 2024

jmachowinski commented Apr 12, 2024

mjcarroll commented Apr 12, 2024 •

edited by clalancette

alsora commented Apr 12, 2024

jmachowinski commented Apr 13, 2024

wjwwood commented Apr 14, 2024

jmachowinski commented Apr 15, 2024

fujitatomoya left a comment

Callback after cancel #2281

Callback after cancel #2281

Conversation

jmachowinski commented Aug 21, 2023

alsora commented Apr 1, 2024

jmachowinski commented Apr 2, 2024

alsora commented Apr 2, 2024 • edited

mjcarroll commented Apr 2, 2024

jmachowinski commented Apr 2, 2024

clalancette commented Apr 3, 2024

jmachowinski commented Apr 3, 2024 • edited

clalancette commented Apr 3, 2024

jmachowinski commented Apr 4, 2024

ros-discourse commented Apr 4, 2024

fujitatomoya commented Apr 5, 2024 • edited

fujitatomoya left a comment

Choose a reason for hiding this comment

jmachowinski commented Apr 5, 2024

fujitatomoya commented Apr 5, 2024

alsora commented Apr 5, 2024

jmachowinski commented Apr 8, 2024

jmachowinski commented Apr 12, 2024

mjcarroll commented Apr 12, 2024 • edited by clalancette

alsora commented Apr 12, 2024

jmachowinski commented Apr 13, 2024

wjwwood commented Apr 14, 2024

jmachowinski commented Apr 15, 2024

fujitatomoya left a comment

Choose a reason for hiding this comment

alsora commented Apr 2, 2024 •

edited

jmachowinski commented Apr 3, 2024 •

edited

fujitatomoya commented Apr 5, 2024 •

edited

mjcarroll commented Apr 12, 2024 •

edited by clalancette