Speed-up RBF matrix assembly #1320

davidscn · 2022-06-08T13:33:12Z

Main changes of this PR

Speed up matrix assembly of the RBF matrices by a factor of ~2.2 for compute intensive RBFs and ~6 for simple RBFs (measured for matrix A).

Motivation and additional information

The matrix assembly is a considerable factor in the computeMapping step (depending on the system size). For small systems, the assembly step is dominant, for big system, the decomposition is dominant. These changes lead for the computeMapping step in a speedup of ~2.2 - 6.2 (for 150 vertices) to ~1.13 - 1.06 (for 13.300 vertices). Therefore, the interesting region for partition of unity mapping is accelerated considerably. In addition, there are several considerations to improve the decomposition strategies as well, which will make the overall relevance of an improved assembly more significant (see eg davidscn#3 ).

Depends on #1319.

Author's checklist

I added a changelog file with make changelog if there are user-observable changes since the last release.
I ran make format to ensure everything is formatted correctly.
I sticked to C++14 features.
I sticked to CMake version 3.16.3.
I squashed / am about to squash all commits that should be seen as one.

Reviewers' checklist

Does the changelog entry make sense? Is it formatted correctly?
Do you understand the code changes?

src/mapping/RadialBasisFctSolver.h

fsimonis · 2022-06-15T14:39:37Z

We should consider using Eigen::Index for the iteration instead of hard coding the type.
Alternatively, we could use auto boost irange to hide both the type and the iteration.

davidscn · 2022-06-20T11:50:44Z

We should consider using Eigen::Index for the iteration instead of hard coding the type.
Alternatively, we could use auto boost irange to hide both the type and the iteration.

Thinking about it: how about using auto (which should in principle resolve to Eigen::Index) ?

fsimonis · 2022-06-20T11:56:21Z

The problem with using auto in for loops is that normally the end value has the desired type, but you need the type in the initializer of the for construct.

Defining the end outside the loop and then using decltype is one option.
Using some abstraction that captures the type like boost::irange is another possibility.

davidscn · 2022-06-20T12:09:07Z

Ah I see (and didn't know that). Then I will use the boost solution, thanks!

davidscn · 2022-06-22T12:54:08Z

We should consider using Eigen::Index for the iteration instead of hard coding the type.
Alternatively, we could use auto boost irange to hide both the type and the iteration.

For the sake of completeness let me add that we cannot exceed int in the current implementation though, because that's what we use in many places in preCICE, e.g., the mesh data structures.

davidscn · 2022-06-22T22:00:05Z

@fsimonis had a brief look at it.

davidscn added the enhancement A new feature, a new functionality of preCICE (from user perspective) label Jun 8, 2022

davidscn self-assigned this Jun 8, 2022

davidscn marked this pull request as draft June 8, 2022 13:33

davidscn commented Jun 8, 2022

View reviewed changes

src/mapping/RadialBasisFctSolver.h Outdated Show resolved Hide resolved

davidscn force-pushed the speedup-asm branch from 2ebef0e to 6b9f3e7 Compare June 8, 2022 14:23

davidscn mentioned this pull request Jun 14, 2022

Extract RBF solver into a separate class #1319

Merged

7 tasks

davidscn added 6 commits June 20, 2022 12:56

Extract RBF solver to a separate class

0e13c2b

More assertions

7e17793

Fix error message

526e397

Simplify code and make things const

28e767b

Initial draft

de46e7c

Rebase onto develop part 2

b6fa69b

davidscn force-pushed the speedup-asm branch from 6b9f3e7 to b6fa69b Compare June 20, 2022 11:13

Replace type specification by auto

25e3909

davidscn marked this pull request as ready for review June 20, 2022 11:50

davidscn requested a review from fsimonis June 20, 2022 11:50

Replace auto loop initialization by boost::irange

8f0a3f7

davidscn mentioned this pull request Jun 22, 2022

Add all polynomial options to RBF Eigen mappings #1335

Merged

8 tasks

Add changelog entry

680131f

davidscn merged commit f6c4d97 into precice:develop Jun 22, 2022

davidscn deleted the speedup-asm branch June 22, 2022 22:00

davidscn mentioned this pull request Jun 23, 2022

Improve basis function implementation #1338

Merged

7 tasks

davidscn mentioned this pull request Jul 19, 2022

Apply Cholesky decomposition for s.p.d. mapping matrices #1372

Merged

7 tasks

davidscn added this to the Version 2.5.0 milestone Aug 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed-up RBF matrix assembly #1320

Speed-up RBF matrix assembly #1320

davidscn commented Jun 8, 2022 •

edited

fsimonis commented Jun 15, 2022

davidscn commented Jun 20, 2022

fsimonis commented Jun 20, 2022

davidscn commented Jun 20, 2022

davidscn commented Jun 22, 2022

davidscn commented Jun 22, 2022

Speed-up RBF matrix assembly #1320

Speed-up RBF matrix assembly #1320

Conversation

davidscn commented Jun 8, 2022 • edited

Main changes of this PR

Motivation and additional information

Author's checklist

Reviewers' checklist

fsimonis commented Jun 15, 2022

davidscn commented Jun 20, 2022

fsimonis commented Jun 20, 2022

davidscn commented Jun 20, 2022

davidscn commented Jun 22, 2022

davidscn commented Jun 22, 2022

davidscn commented Jun 8, 2022 •

edited