Optimize beam sensor model runtime performance #200

glpuga · 2023-05-28T15:54:42Z

Proposed changes

Minor optimizations to the tightest execution loops in the beam sensor model.

While these changes do improve performance a bit, they barely make a difference in the performance disadvantage we have against Nav2 AMCL. We are still missing something much bigger than these optimizations.

Type of change

🐛 Bugfix (change which fixes an issue)
🚀 Feature (change which adds functionality)
📚 Documentation (change which fixes or extends documentation)

Checklist

Put an x in the boxes that apply. This is simply a reminder of what we will require before merging your code.

Lint and unit tests (if any) pass locally with my changes
I have added tests that prove my fix is effective or that my feature works
I have added necessary documentation (if appropriate)
All commmits have been signed for DCO

Additional comments

These changes change the performance profile, as seen through perf from this:

to this:

Note: These flamegraphs assume the changes in #199 have already been merged, since there are grid optimizations that are common to both Likelihood and Beam.

Notice that relative to the unmodified the beluga::Bresenham2i::Line block of code (that had no changes done to it, neither in performance per execution nor in total number of executions), the overall time spent in the importance_weight function seems to have reduced significantly.

Notice also the removal of the second stack call tower on the right, which appears to be related to queuing in the MessageFilter.

However, these changes barely changed the cpu usage profile:

And the change is barely noticeable in the difference against Nav2 AMCL.

While it's still possible our implementation of Bresenham is less peforming than Nav2's, even if we somehow reduced the tracing runtime cost to 0 with some magic implementation, that would still get us to perform at basically the same level as Nav2 amcl. To me that indicates that the problem is somewhere else.

I suspect we are actually doing more work than Nav2, but I haven't been able to find any proof of that.

Further work is still needed.

hidmic

LGTM

beluga/include/beluga/sensor/beam_model.hpp

nahueespinosa

@glpuga Left two super minor comments. It is also worth noting that RelWithDebInfo reduces the optimization level to -O2 instead of -O3, so flamegraphs might not be as good at comparing micro-optimizations changes like these.

Microbenchmarks using googlebenchmark compiled in Release mode may be better at detecting true progress.

beluga/include/beluga/algorithm/raycasting.hpp

beluga/include/beluga/sensor/data/regular_grid.hpp

Signed-off-by: Gerardo Puga <glpuga@ekumenlabs.com>

Adds an updated report including the following changes from the last: - Includes the changes merged in #195 #199 #200 #207 - Measured using the 1x replay speed to prevent distortions to the CPU results - Fixes typos in configuration files, found during review Signed-off-by: Gerardo Puga <glpuga@ekumenlabs.com>

glpuga force-pushed the glpuga/speed_up_beam branch from 1df6f39 to c56cb0a Compare May 28, 2023 15:56

glpuga requested review from nahueespinosa and hidmic May 28, 2023 16:00

glpuga added enhancement New feature or request cpp Related to C++ code labels May 28, 2023

hidmic approved these changes May 29, 2023

View reviewed changes

beluga/include/beluga/sensor/beam_model.hpp Show resolved Hide resolved

beluga/include/beluga/sensor/beam_model.hpp Show resolved Hide resolved

nahueespinosa reviewed May 29, 2023

View reviewed changes

beluga/include/beluga/algorithm/raycasting.hpp Outdated Show resolved Hide resolved

beluga/include/beluga/sensor/data/regular_grid.hpp Outdated Show resolved Hide resolved

nahueespinosa assigned glpuga May 29, 2023

glpuga force-pushed the glpuga/speed_up_beam branch from d776da1 to 16083dc Compare May 30, 2023 00:15

Optimize Beam sensor model runtime performance

4bf3ac6

Signed-off-by: Gerardo Puga <glpuga@ekumenlabs.com>

glpuga force-pushed the glpuga/speed_up_beam branch from 16083dc to 4bf3ac6 Compare May 30, 2023 00:45

nahueespinosa changed the title ~~Optimize Beam sensor model runtime performance~~ Optimize beam sensor model runtime performance May 30, 2023

nahueespinosa approved these changes May 30, 2023

View reviewed changes

glpuga merged commit 74ed2f5 into main May 30, 2023
5 checks passed

glpuga deleted the glpuga/speed_up_beam branch May 30, 2023 14:22

glpuga mentioned this pull request Jun 3, 2023

Generate new report after recent performance updates #208

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize beam sensor model runtime performance #200

Optimize beam sensor model runtime performance #200

glpuga commented May 28, 2023 •

edited

Loading

hidmic left a comment

nahueespinosa left a comment •

edited

Loading

Optimize beam sensor model runtime performance #200

Optimize beam sensor model runtime performance #200

Conversation

glpuga commented May 28, 2023 • edited Loading

Proposed changes

Type of change

Checklist

Additional comments

hidmic left a comment

Choose a reason for hiding this comment

nahueespinosa left a comment • edited Loading

Choose a reason for hiding this comment

glpuga commented May 28, 2023 •

edited

Loading

nahueespinosa left a comment •

edited

Loading