fix the issue (#198) that loop closures can rotate the map 180° if the lidar is mounted backwards #326

WLwind · 2021-02-03T10:42:44Z

Set the edges (constraints) based on the robot base frame (base_footprint) instead of sensor frame (laser).

Basic Info

Info	Please fill out this column
Ticket(s) this addresses	(#198 )
Primary OS tested on	(Ubuntu 20.04 ROS Noetic and Ubuntu 18.04 ROS Melodic)
Robotic platform tested on	(Handsfree mini with RPLIDAR A2 mounted backwards)

Description of contribution in a few bullet points

The reason of weird results of loop closures when lidars are mounted backwards is that the vertices and the edges of the solver are not based on the same frame. The original code sets the pose of the base link (base_footprint) as the vertex pose, but the edges are constraints of 2 poses of sensor frame (laser). That's really unreasonable because the tf between base poses and the tf between their corresponding sensor poses are not the same! The differences are significantly big especially when the sensor is at the opposite orientation of the robot base. So this uncommon loop closure will always happen (more or less) unless the sensor frame is at the exact pose of the base. This is regardless of whether the lidar is 360° or whether it's an RPLIDAR.

I made the edges and vertices based on the same frame. I simply set the edges (constraints) based on the robot base frame (base_footprint) instead of sensor frame (laser).
It works well with my backwards mounted RPLIDAR A2.

Description of documentation updates required from your changes

Future work that may be required in bullet points

I don't have a ROS2 robot yet. Is anyone there be kind to test this modification on a ROS2 robot?

…ap 180° if the lidar is mounted backwards Set the edges (constraints) based on the robot base frame (base_footprint) instead of sensor frame (laser).

SteveMacenski · 2021-02-03T17:19:16Z

So by backwards, you do not mean up-side down, correct? By the way, just going to chat on this PR since all the others are the same, I'll just merge them all at once but use this as the proxy.

#198 (comment) <-- Can you show that you can work on this dataset and have it work? They give a pretty good backwards-lidar dataset here. We resolved this by changing a loss function, so please make sure you don't use that one for your testing to show that it also fixes the issue (and does so more reliably).

So I think we're getting more to the root of all these rplidar problems then. It sounds like there were CPU (fixed), backwards (or rather, non-directionally aligned), and 360 param read in wrong (fixed). There's also this "flipping" problem only 360 lidar users report (#281) that I also wonder if this would fix if this indeed is a functional solution... (also makes me wonder if this was my issue with the multi-lidar support testing... mhm...). It would be mighty convenient if this fixed that too. I've never been able to replicate on professional 270 lidars so its been hard for me to be able to spend the time to fix it. Something along these lines could be the root cause or at least a new way of thinking I can use to see if I can spot other locations in the Karto code this is wrong.

I'll review in a moment. I super appreciate you digging into this code. It's been hard to fix all these issues without someone else to bounce ideas off of that's dug into the core SLAM code. 👏

slam_toolbox/lib/karto_sdk/src/Mapper.cpp

WLwind · 2021-02-04T16:11:35Z

So by backwards, you do not mean up-side down, correct? By the way, just going to chat on this PR since all the others are the same, I'll just merge them all at once but use this as the proxy.

#198 (comment) <-- Can you show that you can work on this dataset and have it work? They give a pretty good backwards-lidar dataset here. We resolved this by changing a loss function, so please make sure you don't use that one for your testing to show that it also fixes the issue (and does so more reliably).

So I think we're getting more to the root of all these rplidar problems then. It sounds like there were CPU (fixed), backwards (or rather, non-directionally aligned), and 360 param read in wrong (fixed). There's also this "flipping" problem only 360 lidar users report (#281) that I also wonder if this would fix if this indeed is a functional solution... (also makes me wonder if this was my issue with the multi-lidar support testing... mhm...). It would be mighty convenient if this fixed that too. I've never been able to replicate on professional 270 lidars so its been hard for me to be able to spend the time to fix it. Something along these lines could be the root cause or at least a new way of thinking I can use to see if I can spot other locations in the Karto code this is wrong.

I'll review in a moment. I super appreciate you digging into this code. It's been hard to fix all these issues without someone else to bounce ideas off of that's dug into the core SLAM code. clap

I've recorded a video of mapping process using #198 dataset.
https://user-images.githubusercontent.com/46443331/106920506-1ef9ef80-6746-11eb-9145-7b2a9c26ff76.mp4
It works pretty well with 18 loop closures.

SteveMacenski · 2021-02-04T17:27:14Z

Thanks for the verification. @coderkarl also verified which is good to have an external thumbs up. I agree this is an issue and now I agree that your solution is workable. Now we're just on the details of making sure this is the right place to put it / if there's a better way to accomplish the same task.

WLwind · 2021-02-05T15:41:40Z

When I try localization mode with new code, every time loop closure comes, there comes a warning of ceres CeresSolver: Ceres could not find a usable solution to optimize.. So I think there is something I didn't consider.

SteveMacenski · 2021-02-05T17:53:38Z

The only difference I can tell is that old L1611 SetCorrectedPose() sets m_CorrectedPose but did not call Update() so from your pose-swapping, the values might be different? Also in SetCorrectedPose() it sets m_IsDirty = true but I don't think that should cause something. Both of those would serialize though for a localization session.

Did you check the inputs / outputs of the new function to make sure its working properly? If mapping works OK and just localization, I feel like it does, but just checking. I highly doubt its the issue, but also try removing inline. Maybe after serialization that's making something unhappy.

WLwind · 2021-02-05T18:40:30Z

The only difference I can tell is that old L1611 SetCorrectedPose() sets m_CorrectedPose but did not call Update() so from your pose-swapping, the values might be different? Also in SetCorrectedPose() it sets m_IsDirty = true but I don't think that should cause something. Both of those would serialize though for a localization session.

Did you check the inputs / outputs of the new function to make sure its working properly? If mapping works OK and just localization, I feel like it does, but just checking. I highly doubt its the issue, but also try removing inline. Maybe after serialization that's making something unhappy.

I mean b0800ed, not just 5cb6690.
Both of them show warning CeresSolver: Ceres could not find a usable solution to optimize. in localization mode. But 5cb6690 shows more:
W0206 02:13:16.236030 10238 parameter_block.h:349] Local parameterization Jacobian computation returnedan invalid matrix for x: -nan Jacobian matrix : -nan
I think the problem is from deserialization. Because when I tried to make that "1 line change" to make sensor poses as ceres nodes, the deserialization process threw an exception terminate called after throwing an instance of 'karto::Exception'. For this reason I switched to another way, making constraints between 2 base poses. Although it doesn't threw exceptions during deserialization now, I still think that there might be some faults.

SteveMacenski · 2021-02-05T23:03:01Z

To repeat that back to you, you think this change makes the serialized data wrong so things are failing? that would make sense, we're messing with things at a somewhat low level. ABI breaking changes like this might impact any data you've previously recorded, but its necessary to fix the bug.

What happens if you run it over a newly generated serialized file done with the code in this PR after the change?

WLwind · 2021-02-07T04:46:22Z

Firstly I tried to find the problem with deserialization when I use "1 line change" solution, and (after adding lots of print function) I found that the problem is from addNode. The function need to GetCorrectedPose and it calls GetLaserRangeFinder. But I don't think the LaserRangeFinder is correctly set before deserialization in localization mode. So I think that is what makes the program throw the exception.
Let's go back to "all base pose" strategy. I've carefully compared the differences between 5cb6690 and the original code 44c9f84 and the main difference is in MapperGraph::CorrectPoses(). When it calls SetSensorPose it does an Update() after setting the pose, but SetCorrectedPose doesn't do that updating. I added the update and test with the dataset with mapping and localization mode. There was nothing new with mapping (18 loop closures). The localization mode made 9 loop closures but only 1 popped the warning CeresSolver: Ceres could not find a usable solution to optimize.. I can clearly see some loop closures optimizing the "walls", so I think this new commit solves the problem with localization mode mostly. It's workable now.

SteveMacenski · 2021-02-08T20:28:03Z

I don't understand why calling Update would change Ceres's ability to find a solution. Update is messing with the point values and resetting m_isDirty. It seems like is dirty is the major issue but every time we try to access the dirty info, there's a check and update is called anyhow in the object

slam_toolbox/slam_toolbox/lib/karto_sdk/include/karto_sdk/Karto.h

Line 5503 in 171f282

class LocalizedRangeScan : public LaserRangeScan

Does CeresSolver: Ceres could not find a usable solution to optimize. pop up when using a new serialized file from this current set of code? I suspect that it now has nothing to do with Update() if it happens on occasion and more do with this change pEdge->SetLabel(new LinkInfo(pFromScan->GetCorrectedPose(), pToScan->GetCorrectedAt(rMean), rCovariance)); since the graph now would have data in it assuming rMean and then new data that is pToScan->GetCorrectedAt(rMean). If it goes away from new serialized data, then I think we're good. If it still appears, that's still a problem we need to solve.

A single small dataset of 9 loop closures having it appear once might not be the worst-case situation. I don't think it was happening at all prior. If we have new serialized files with the new code, then it should be acting at least as good as before in localization mode if we did everything correctly, yes?

SteveMacenski · 2021-02-16T19:01:10Z

@WLwind any update? Just trying to iterate on the best solution to this issue so that it doesn't come back and bite you

WLwind · 2021-02-17T07:54:58Z

@WLwind any update? Just trying to iterate on the best solution to this issue so that it doesn't come back and bite you

I'm busy these days and have little time to consider about this issue, sorry.

SteveMacenski · 2021-03-10T00:50:43Z

Closed the other 2 PRs since we're still working on getting this one passing completely

WLwind · 2021-03-24T02:26:04Z

Thank you for testing! I really don't have time to test it these weeks.

SteveMacenski · 2021-03-24T02:29:13Z

Thanks for identifying this issue so we can finally put this issue to bed!

fix the issue (SteveMacenski#198) that loop closures can rotate the m…

b0800ed

…ap 180° if the lidar is mounted backwards Set the edges (constraints) based on the robot base frame (base_footprint) instead of sensor frame (laser).

SteveMacenski requested changes Feb 3, 2021

View reviewed changes

slam_toolbox/lib/karto_sdk/src/Mapper.cpp Show resolved Hide resolved

slam_toolbox/lib/karto_sdk/src/Mapper.cpp Outdated Show resolved Hide resolved

slam_toolbox/lib/karto_sdk/src/Mapper.cpp Outdated Show resolved Hide resolved

SteveMacenski mentioned this pull request Feb 3, 2021

Map temporarily rotates or flips [rplidar] #281

Closed

linting things

23902e3

add function GetCorrectedAt to simplify getting robot pose

5cb6690

add update() when setting corrected pose after optimizing

171f282

SteveMacenski mentioned this pull request Mar 4, 2021

Release updated Noetic/Dashing/Foxy binaries #350

Closed

5 tasks

SteveMacenski linked an issue Mar 12, 2021 that may be closed by this pull request

[rplidar users] Potential errors with drivers + 360 lidar support #198

Closed

This was referenced Mar 12, 2021

fixing 360 lidar issue for ros2 branch #360

Merged

fixing 360 lidar issue for dashing branch #361

Merged

fixing 360 lidar issue for foxy branch #362

Merged

SteveMacenski merged commit cf99c83 into SteveMacenski:melodic-devel Mar 23, 2021

PGotzmann mentioned this pull request May 31, 2021

Irregular "Ceres could not find a usable solution" warnings in localization mode #398

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix the issue (#198) that loop closures can rotate the map 180° if the lidar is mounted backwards #326

fix the issue (#198) that loop closures can rotate the map 180° if the lidar is mounted backwards #326

WLwind commented Feb 3, 2021

SteveMacenski commented Feb 3, 2021 •

edited

Loading

WLwind commented Feb 4, 2021

SteveMacenski commented Feb 4, 2021

WLwind commented Feb 5, 2021

SteveMacenski commented Feb 5, 2021 •

edited

Loading

WLwind commented Feb 5, 2021

SteveMacenski commented Feb 5, 2021 •

edited

Loading

WLwind commented Feb 7, 2021

SteveMacenski commented Feb 8, 2021

SteveMacenski commented Feb 16, 2021 •

edited

Loading

WLwind commented Feb 17, 2021 •

edited

Loading

SteveMacenski commented Mar 10, 2021

WLwind commented Mar 24, 2021

SteveMacenski commented Mar 24, 2021

fix the issue (#198) that loop closures can rotate the map 180° if the lidar is mounted backwards #326

fix the issue (#198) that loop closures can rotate the map 180° if the lidar is mounted backwards #326

Conversation

WLwind commented Feb 3, 2021

Basic Info

Description of contribution in a few bullet points

Description of documentation updates required from your changes

Future work that may be required in bullet points

SteveMacenski commented Feb 3, 2021 • edited Loading

WLwind commented Feb 4, 2021

SteveMacenski commented Feb 4, 2021

WLwind commented Feb 5, 2021

SteveMacenski commented Feb 5, 2021 • edited Loading

WLwind commented Feb 5, 2021

SteveMacenski commented Feb 5, 2021 • edited Loading

WLwind commented Feb 7, 2021

SteveMacenski commented Feb 8, 2021

SteveMacenski commented Feb 16, 2021 • edited Loading

WLwind commented Feb 17, 2021 • edited Loading

SteveMacenski commented Mar 10, 2021

WLwind commented Mar 24, 2021

SteveMacenski commented Mar 24, 2021

SteveMacenski commented Feb 3, 2021 •

edited

Loading

SteveMacenski commented Feb 5, 2021 •

edited

Loading

SteveMacenski commented Feb 5, 2021 •

edited

Loading

SteveMacenski commented Feb 16, 2021 •

edited

Loading

WLwind commented Feb 17, 2021 •

edited

Loading