Fix JTC from immediately returning success #565

MarqRazz · 2023-04-07T20:17:51Z

When using the JTC action server we have found a race condition where it can return SUCCESS as soon as the trajectory goal is accepted. I have confirmed that after the first goal is given to the controller the update() method maintains access to the last traj_point_active_ptr_ causing it continuously command the hardware to the end of the last goal's trajectory while also setting:

auto res = std::make_shared<FollowJTrajAction::Result>();
res->set__error_code(FollowJTrajAction::Result::SUCCESSFUL);
active_goal->setSucceeded(res);

If a new goal is received while processing update() there is a chance for it to immediately return SUCCESS but continue to execute the new goal.

This PR removes the ability for the update() method to command the arm when it does not have an active goal by setting traj_point_active_ptr_ to a nullptr after it has completed/aborted. When a new goal is received it is copied over to the traj_point_active_ptr_ allowing it to execute again.

Background on our problem:

This causes problems with one of our pipeline where additional motion planning is staged up after each trajectory is completed. When the goal returns and the arm is still in motion the next plan grabs the start_state from the robot while it's moving and then generates and sends the new plan for execution. The controller happly accepts the new goal which will cause the arm to quickly jump back to where it grabbed the start_state and then execute the new trajectory. This jump in the joint commands frequently causes our hardware to throw a fault. Below is a timeline of the happening on our hardware along with a graph of the joint_commands.

mechwiz · 2023-04-10T01:47:35Z

I think I noticed a similar if not the same issue a while back and made a PR that handles the issue here - #410. However your implementation may be better. Will review

MarqRazz · 2023-04-10T14:03:27Z

I think I noticed a similar if not the same issue a while back and made a PR that handles the issue here - #410. However your implementation may be better. Will review

I'm not sure if your PR will fix the whole issue. I think it might keep the second trajectory from being executed but will still immediately return SUCCESS for the action result if this section of code is being executed when a new goal is received.

mechwiz · 2023-04-10T19:10:13Z

I would agree with you if the active-goal is queried within that section of code. However, my PR specifically removes that query (auto active_goal = *rt_active_goal_.readFromRT();) from occurring in that section of code for the reason you are describing and instead does it earlier when checking when a new external message has been received which, in my opinion, is where that query should exist.

MarqRazz · 2023-04-10T19:57:50Z

I'm open to either fix. If you have changes you would like made here or we can continue to work on your PR just let me know.

I do think that this section of code should not be executed unless we have an active goal though which means it should be checked in your if statement...

if (active_goal && traj_point_active_ptr_ && (*traj_point_active_ptr_)->has_trajectory_msg() && !mismatch )

mechwiz · 2023-04-14T22:53:26Z

I worked through the logic between the 2 PRs and I think I prefer your implementation since yours will definitely ensure that that section of code won't execute after the previous goal finished until a new goal has successfully been obtained externally. (I think mine solves the issue in a less elegant way). I would +1 but I don't have any approval rights

MarqRazz · 2023-05-01T14:29:33Z

Any feedback from the ros2_control maintainer team?

bmagyar

Thanks for the insightful fix! The double accounting we do with the pointer Vs the buffer is the root of the issue which we may be able to refactor so we can get rid of this... Or maybe not! For now I'm happy with the change

moriarty · 2023-05-02T17:14:58Z

I believe this PR is failing on Rolling because some of the github ci jobs do pull in current dependencies to the upstream_workspace and a change to control_msgs has renamed some msg fields ros-controls/control_msgs#86

bmagyar · 2023-05-02T17:40:00Z

Aye I'm aware of those, that's why we have all these variants for the CI ;) eventually pending PRs on the releases will be merged and synced

MarqRazz · 2023-05-02T20:06:40Z

Do we need to do anything special to get this back ported to humble?

bmagyar · 2023-05-02T20:13:22Z

Nothing hopefully, but to remind me which you've already done!

bmagyar · 2023-05-02T20:13:37Z

@Mergifyio backport humble

mergify · 2023-05-02T20:13:43Z

backport humble

✅ Backports have been created

#592 Fix JTC from immediately returning success (backport #565) has been created for branch humble

Co-authored-by: Bence Magyar <bence.magyar.robotics@gmail.com> (cherry picked from commit 634e6fe)

egordon · 2023-06-14T23:26:57Z

Unfortunately, this has introduced a bug on our end (realized with 2.20.0-1jammy.20230522.072811, but just noticed this week) with the use of a velocity hardware interface.

If we are within the goal tolerance (but not yet commanding identically 0 velocity because we aren't exactly at the goal), the controller stops commanding the hardware, and this means that the hardware interface is left with a non-0 velocity at the end of the trajectory, causing the robot to drift from the goal position over time.

I've confirmed that the bug is fixed by just commenting out L336 in the PR (L339 in the current humble build). I'll make a bug report.

This reverts commit 634e6fe.

Fix JTC from immediately returning success

30aa978

github-actions bot requested review from bmagyar, destogl, mcbed, peterdavidfagan, progtologist, rosterloh and Serafadam April 7, 2023 20:18

DatSpace added a commit to vtikha/ros2_controllers that referenced this pull request Apr 26, 2023

Fixe race condition from ros-controls#565

1e8e2db

bmagyar mentioned this pull request May 1, 2023

[JTC] Fix race condition & interpolation bug #410

Closed

bmagyar approved these changes May 1, 2023

View reviewed changes

Merge branch 'master' into pr-fix_immediate_jtc_return

4cac618

bmagyar merged commit 634e6fe into ros-controls:master May 2, 2023
9 of 12 checks passed

mergify bot pushed a commit that referenced this pull request May 2, 2023

Fix JTC from immediately returning success (#565)

663d9b6

Co-authored-by: Bence Magyar <bence.magyar.robotics@gmail.com> (cherry picked from commit 634e6fe)

mergify bot mentioned this pull request May 2, 2023

Fix JTC from immediately returning success (backport #565) #592

Merged

bmagyar pushed a commit that referenced this pull request May 2, 2023

Fix JTC from immediately returning success (#565) (#592)

2f54606

MarqRazz deleted the pr-fix_immediate_jtc_return branch May 2, 2023 21:17

moriarty mentioned this pull request May 3, 2023

Cartesian twist controller #300

Open

destogl mentioned this pull request May 16, 2023

[Parameters] Avoid deprecation warnings." #616

Merged

schornakj mentioned this pull request May 26, 2023

Apply fix for trajectory execution race condition to ScaledJointTrajectoryController UniversalRobots/Universal_Robots_ROS2_Driver#697

Closed

panagelak mentioned this pull request Jun 5, 2023

Fix SJTC from immediately returning success (#697) UniversalRobots/Universal_Robots_ROS2_Driver#710

Closed

egordon mentioned this pull request Jun 14, 2023

[JTC] Non-0 velocity commanded after closed-loop execution #671

Closed

mechwiz pushed a commit to mechwiz/ros2_controllers that referenced this pull request Jul 3, 2023

Revert "Fix JTC from immediately returning success (ros-controls#565)"

3edfe5e

This reverts commit 634e6fe.

egordon mentioned this pull request Jul 7, 2023

[JTC] Segmentation fault with action tests #688

Closed

christophfroehlich mentioned this pull request Jul 18, 2023

[JTC] Explicitly set hold position #558

Merged

12 tasks

mechwiz pushed a commit to mechwiz/ros2_controllers that referenced this pull request Aug 3, 2023

Revert "Fix JTC from immediately returning success (ros-controls#565)"

ac45c7a

This reverts commit 634e6fe.

mechwiz pushed a commit to mechwiz/ros2_controllers that referenced this pull request Aug 8, 2023

Revert "Fix JTC from immediately returning success (ros-controls#565)"

4b72d12

This reverts commit 634e6fe.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix JTC from immediately returning success #565

Fix JTC from immediately returning success #565

MarqRazz commented Apr 7, 2023

mechwiz commented Apr 10, 2023

MarqRazz commented Apr 10, 2023

mechwiz commented Apr 10, 2023

MarqRazz commented Apr 10, 2023

mechwiz commented Apr 14, 2023 •

edited

MarqRazz commented May 1, 2023

bmagyar left a comment

moriarty commented May 2, 2023

bmagyar commented May 2, 2023

MarqRazz commented May 2, 2023

bmagyar commented May 2, 2023

bmagyar commented May 2, 2023

mergify bot commented May 2, 2023 •

edited

egordon commented Jun 14, 2023

Fix JTC from immediately returning success #565

Fix JTC from immediately returning success #565

Conversation

MarqRazz commented Apr 7, 2023

Background on our problem:

mechwiz commented Apr 10, 2023

MarqRazz commented Apr 10, 2023

mechwiz commented Apr 10, 2023

MarqRazz commented Apr 10, 2023

mechwiz commented Apr 14, 2023 • edited

MarqRazz commented May 1, 2023

bmagyar left a comment

Choose a reason for hiding this comment

moriarty commented May 2, 2023

bmagyar commented May 2, 2023

MarqRazz commented May 2, 2023

bmagyar commented May 2, 2023

bmagyar commented May 2, 2023

mergify bot commented May 2, 2023 • edited

✅ Backports have been created

egordon commented Jun 14, 2023

mechwiz commented Apr 14, 2023 •

edited

mergify bot commented May 2, 2023 •

edited