Tensor product kernels: specialize a function for complex numbers #16754

kronbichler · 2024-03-15T09:46:12Z

I realized that the code generated by compilers for the matrix-free tensor product kernels when used for complex numbers is rather poor, because it will not exploit fused multiply-add functions in the inner reduction loops and thus performs unnecessary work. This is easy to fix, especially with if constexpr facilities: We write the code in a way that lets the compiler straight-forwardly use FMA operations on both the real and imaginary part.

kronbichler added 2 commits March 15, 2024 10:42

Tensor product kernels: specialize a function for complex numbers

2ad3412

New test cases

21d8639

kronbichler added Matrix-free ready to test labels Mar 15, 2024

masterleinad approved these changes Mar 15, 2024

View reviewed changes

kronbichler merged commit 4ca6ffc into dealii:master Mar 16, 2024
16 checks passed

kronbichler deleted the fix_complex_numbers branch March 18, 2024 08:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tensor product kernels: specialize a function for complex numbers #16754

Tensor product kernels: specialize a function for complex numbers #16754

kronbichler commented Mar 15, 2024

Tensor product kernels: specialize a function for complex numbers #16754

Tensor product kernels: specialize a function for complex numbers #16754

Conversation

kronbichler commented Mar 15, 2024