perf: Optimize Eigen usage in covariance engine #1183

stephenswat · 2022-03-03T15:25:01Z

This commit optimizes some of the Eigen usage in the covariance engine, specifically in the critical path for the propagation examples. The first optimisation we make is to introduce a tiled matrix multiplication method, which takes 2i×2j matrices, and performs four i×j multiplications instead, which Eigen can optimize far more easily. Secondly, we reduce the number of floating point operations performed by working with smaller submatrices wherever possible.

On my machine, the following performance is achieved in the propagation example before this patch: 53.555595 ms/event. After this patch, we take 43.750143 ms/event. This performance gain is independent from the performance gain of #1181.

codecov · 2022-03-03T16:09:20Z

Codecov Report

Merging #1183 (057000a) into main (3b264d7) will not change coverage.
The diff coverage is 0.00%.

@@           Coverage Diff           @@
##             main    #1183   +/-   ##
=======================================
  Coverage   47.81%   47.81%           
=======================================
  Files         360      360           
  Lines       18591    18591           
  Branches     8769     8769           
=======================================
  Hits         8890     8890           
  Misses       3649     3649           
  Partials     6052     6052

Impacted Files	Coverage Δ
Core/src/Propagator/detail/CovarianceEngine.cpp	`52.06% <0.00%> (ø)`

📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more

Core/src/Propagator/detail/CovarianceEngine.cpp

paulgessinger · 2022-03-04T09:31:57Z

On my end I see wall-time for the generic propagation example go from ~26s to 22s for 500 events. Good stuff!

This commit optimizes some of the Eigen usage in the covariance engine, specifically in the critical path for the propagation examples. The first optimisation we make is to introduce a tiled matrix multiplication method, which takes 2i×2j matrices, and performs four i×j multiplications instead, which Eigen can optimize far more easily. Secondly, we reduce the number of floating point operations performed by working with smaller submatrices wherever possible. On my machine, the following performance is achieved in the propagation example before this patch: 53.555595 ms/event. After this patch, we take 43.750143 ms/event. This performance gain is independent from the performance gain of acts-project#1181.

stephenswat · 2022-03-15T22:20:52Z

This is now ready to go in.

stephenswat requested a review from paulgessinger March 3, 2022 15:25

stephenswat added Component - Core Affects the Core module Impact - Minor Nuissance bug and/or affects only a single module Improvement Changes to an existing feature labels Mar 3, 2022

stephenswat added this to the next milestone Mar 3, 2022

stephenswat mentioned this pull request Mar 3, 2022

perf: Tile 8×8 covariance matrix multiplication #1181

Merged

paulgessinger reviewed Mar 4, 2022

View reviewed changes

Core/src/Propagator/detail/CovarianceEngine.cpp Outdated Show resolved Hide resolved

paulgessinger reviewed Mar 4, 2022

View reviewed changes

Core/src/Propagator/detail/CovarianceEngine.cpp Outdated Show resolved Hide resolved

paulgessinger reviewed Mar 4, 2022

View reviewed changes

Core/src/Propagator/detail/CovarianceEngine.cpp Show resolved Hide resolved

stephenswat added the 🚧 WIP Work-in-progress label Mar 4, 2022

stephenswat force-pushed the perf/covariance_engine_gemm branch from 9f8f38c to cf24e2d Compare March 15, 2022 22:19

stephenswat removed the 🚧 WIP Work-in-progress label Mar 15, 2022

stephenswat requested a review from paulgessinger March 15, 2022 22:20

paulgessinger approved these changes Mar 16, 2022

View reviewed changes

paulgessinger added the automerge label Mar 16, 2022

paulgessinger self-assigned this Mar 18, 2022

Merge branch 'main' into perf/covariance_engine_gemm

057000a

kodiakhq bot merged commit 6969a87 into acts-project:main Mar 18, 2022

paulgessinger modified the milestones: next, v18.0.0 Apr 4, 2022

andiwand mentioned this pull request Apr 19, 2024

refactor: Remove blockMult from boundToCurvilinearTransportJacobian #3127

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Optimize Eigen usage in covariance engine #1183

perf: Optimize Eigen usage in covariance engine #1183

stephenswat commented Mar 3, 2022

codecov bot commented Mar 3, 2022 •

edited

paulgessinger commented Mar 4, 2022

stephenswat commented Mar 15, 2022

perf: Optimize Eigen usage in covariance engine #1183

perf: Optimize Eigen usage in covariance engine #1183

Conversation

stephenswat commented Mar 3, 2022

codecov bot commented Mar 3, 2022 • edited

Codecov Report

paulgessinger commented Mar 4, 2022

stephenswat commented Mar 15, 2022

codecov bot commented Mar 3, 2022 •

edited