Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dima/covar speedup #399

Merged
merged 3 commits into from
May 13, 2021
Merged

Dima/covar speedup #399

merged 3 commits into from
May 13, 2021

Conversation

dorezyuk
Copy link
Contributor

@dorezyuk dorezyuk commented Apr 18, 2021

Hey everyone,
here is a small contribution to speedup the conversion of the flat array into a nested array. The new implementation is brachless and yields under gcc9.3 in a 6 (with O2) to 8 (with O3) times faster computation:

https://quick-bench.com/q/rShFc-mEK290Dcms2cUx1XcwuMs

Play around with the lib-versions and settings (don't pick clang since clang will remove the operations entirely).

Best,
Dima

@clalancette clalancette self-assigned this May 6, 2021
Signed-off-by: Chris Lalancette <clalancette@openrobotics.org>
Copy link
Contributor

@clalancette clalancette left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I made a couple of changes to remove auto where I could. Otherwise, this looks good to me; I'll run CI on it next. Thanks for the contribution!

@clalancette
Copy link
Contributor

CI:

  • Linux Build Status
  • Linux-aarch64 Build Status
  • macOS Build Status
  • Windows Build Status

@clalancette clalancette merged commit 8df50cd into ros2:ros2 May 13, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants