Copy scalars in vector operations to make compiler optimize more #14253

kronbichler · 2022-09-12T08:53:36Z

If we keep the scalar variables, say a in an update a . x + y, as class variables, the compiler cannot prove that the variable a does not alias with one of the vector arrays. As a result, it keeps re-loading the variable over and over again, even though we know that it won't change. While the compiler can do that for simple code, it does not here because the functors get passed around between multiple functions and call variants, including TBB parallelization. In this PR, it is solved in two different way for the two use cases:

For the typical vector update loops, we simply create a local copy inside the part that runs the loop on a subrange, which will then signal to the compiler that its value is indeed constant throughout the loop. (Note that the compiler does not rely on the function being marked const, it really needs the local scope.)
For the reduction operations, we instead copy the functor (and with it, the value of the scalar) as we move into the worker routine. As the functors are small with only some pointer variables, this is cheap enough.

To make sure we do not lose these optimizations and their reason, I added some comments.

drwells

Nice.

Could you add your explanation of why some functors are passed by value and others by reference to the header?

kronbichler · 2022-09-12T14:41:36Z

Could you add your explanation of why some functors are passed by value and others by reference to the header?

I will: In general, the rule is to copy the functors that act on individual elements (reductions), but not the ones still defining loops (vector-add-style functions).

Copy scalars in vector operations to make compiler optimize more

e98a0a3

kronbichler added Linear Algebra ready to test labels Sep 12, 2022

drwells approved these changes Sep 12, 2022

View reviewed changes

tamiko approved these changes Sep 12, 2022

View reviewed changes

Add additional comments regarding aliasing

0e9f7eb

drwells approved these changes Sep 12, 2022

View reviewed changes

peterrum approved these changes Sep 12, 2022

View reviewed changes

drwells merged commit fb2242d into dealii:master Sep 12, 2022

kronbichler mentioned this pull request Sep 13, 2022

Avoid warning about function that might not be inlinable #14264

Merged

kronbichler deleted the avoid_aliasing branch August 10, 2023 16:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Copy scalars in vector operations to make compiler optimize more #14253

Copy scalars in vector operations to make compiler optimize more #14253

kronbichler commented Sep 12, 2022

drwells left a comment

kronbichler commented Sep 12, 2022

Copy scalars in vector operations to make compiler optimize more #14253

Copy scalars in vector operations to make compiler optimize more #14253

Conversation

kronbichler commented Sep 12, 2022

drwells left a comment

Choose a reason for hiding this comment

kronbichler commented Sep 12, 2022