Update CPU Architectures, Update YATeTo #962

davschneller · 2023-09-22T10:22:52Z

Adds several new CPU architectures, as shown in YATeTo right now (but not yet apple-m2). Also adds dummy architectures for SVE and NEON.
Splits between ALIGNMENT and VECTORSIZE. The latter, VECTORSIZE should equal the YaTeTo value; it's from now on used in calculating any aligned reals or basis functions. The former value, ALIGNMENT may be larger (e.g. as large as a cache line), but mostly, it's still the same value.
Updates YATeTo. That also includes Remove some deepcop[ies] to speed up strengthReduction yateto#66 , and by that a significant speed improvement for the kernel generation.
Reworks the GEMM_TOOLS_LIST=auto option to include LIBXSMM_JIT before LIBXSMM, and the new architectures. Also, all of these programs need to be found by CMake before they are considered for auto. Eigen is now always added at the end in this case.
Redzone is now disabled by default (especially for noarch), and enabled for most X86_64 CPUs again.

davschneller · 2023-09-22T10:25:15Z

Maybe also related to #960 . (i.e. the split between ALIGNMENT and VECTORSIZE)

codecov-commenter · 2023-09-22T10:58:23Z

Codecov Report

Merging #962 (c83b8d9) into master (1579dfa) will not change coverage.
Report is 2 commits behind head on master.
The diff coverage is 0.00%.

❗ Current head c83b8d9 differs from pull request most recent head 6a1eb0e. Consider uploading reports for the commit 6a1eb0e to get more accurate results

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

@@           Coverage Diff           @@
##           master     #962   +/-   ##
=======================================
  Coverage   14.39%   14.39%           
=======================================
  Files         253      253           
  Lines       14223    14223           
=======================================
  Hits         2047     2047           
  Misses      12176    12176

Files	Coverage Δ
src/Kernels/common.hpp	`0.00% <0.00%> (ø)`

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

sebwolf-de · 2023-09-22T11:19:59Z

src/Kernels/common.hpp

@@ -126,7 +126,7 @@ constexpr unsigned int
 * @return aligned number of reals.
 **/
 constexpr unsigned int getNumberOfAlignedReals(unsigned int i_numberOfReals,
-                                               unsigned int i_alignment = ALIGNMENT) {
+                                               unsigned int i_alignment = VECTORSIZE) {


Should we then also change the variable name to i_vectorsize?

IMO no, because it is an alignment (to vectorsize)

(same below)

Ok, I've left it that way for now—since even though it's VECTORSIZE, we're aligning everything. Instead, we may have to rename the ALIGNMENT variable soon, I'd guess. And also, make both of them a C++ constexpr maybe.

I've also cleaned up the file a bit (i.e. remove i_ etc.) and templated the functions with the size of the reals used. ... That could also become a parameter instead.

sebwolf-de · 2023-09-22T11:20:15Z

src/Kernels/common.hpp

@@ -145,7 +145,7 @@ constexpr unsigned int getNumberOfAlignedReals(unsigned int i_numberOfReals,
 **/
 constexpr unsigned int
    getNumberOfAlignedBasisFunctions(unsigned int i_convergenceOrder = CONVERGENCE_ORDER,
-                                     unsigned int i_alignment = ALIGNMENT) {
+                                     unsigned int i_alignment = VECTORSIZE) {


Change variable name?

sebwolf-de · 2023-09-22T11:20:19Z

src/Kernels/common.hpp

@@ -161,7 +161,7 @@ constexpr unsigned int
 **/
 constexpr unsigned
    getNumberOfAlignedDerivativeBasisFunctions(unsigned int i_convergenceOrder = CONVERGENCE_ORDER,
-                                               unsigned int i_alignment = ALIGNMENT) {
+                                               unsigned int i_alignment = VECTORSIZE) {


Change variable name?

…issol into davschneller/update-cpu-archs

davschneller added 5 commits September 21, 2023 10:47

Update Yateto

772e6eb

Add more CPU architectures, better auto GEMM_TOOLS

8ecfbe2

Switch ALIGNMENT to VECTORSIZE for aligned reals

9f526c3

Update GEMM Tools finder

1f1ef9e

Add Zen 1, 2, 4

2dbb4f9

davschneller changed the title ~~Update CPU Architectures, update YATeto~~ Update CPU Architectures, update YATeTo Sep 22, 2023

davschneller changed the title ~~Update CPU Architectures, update YATeTo~~ Update CPU Architectures, Update YATeTo Sep 22, 2023

Remove dummy arch flags

4b1701a

sebwolf-de reviewed Sep 22, 2023

View reviewed changes

davschneller added 3 commits September 26, 2023 10:45

Eigen only when no other generator is available

47e2c82

Code cleanup, templating

3e03fdb

Merge branch 'davschneller/update-cpu-archs' of github.com:seissol/se…

c83b8d9

…issol into davschneller/update-cpu-archs

davschneller mentioned this pull request Sep 26, 2023

Add support for MacOS & M1/M2 #963

Merged

Concat GEMM tools finding again

6a1eb0e

sebwolf-de approved these changes Sep 27, 2023

View reviewed changes

sebwolf-de added this pull request to the merge queue Sep 27, 2023

Merged via the queue into master with commit 7901b7c Sep 27, 2023
5 checks passed

krenzland deleted the davschneller/update-cpu-archs branch September 27, 2023 09:29

davschneller mentioned this pull request Nov 9, 2023

Update CMakeLists.txt #989

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update CPU Architectures, Update YATeTo #962

Update CPU Architectures, Update YATeTo #962

davschneller commented Sep 22, 2023 •

edited

davschneller commented Sep 22, 2023

codecov-commenter commented Sep 22, 2023 •

edited

sebwolf-de Sep 22, 2023

krenzland Sep 22, 2023

krenzland Sep 22, 2023

davschneller Sep 26, 2023

sebwolf-de Sep 22, 2023

sebwolf-de Sep 22, 2023

Update CPU Architectures, Update YATeTo #962

Update CPU Architectures, Update YATeTo #962

Conversation

davschneller commented Sep 22, 2023 • edited

davschneller commented Sep 22, 2023

codecov-commenter commented Sep 22, 2023 • edited

Codecov Report

sebwolf-de Sep 22, 2023

Choose a reason for hiding this comment

krenzland Sep 22, 2023

Choose a reason for hiding this comment

krenzland Sep 22, 2023

Choose a reason for hiding this comment

davschneller Sep 26, 2023

Choose a reason for hiding this comment

sebwolf-de Sep 22, 2023

Choose a reason for hiding this comment

sebwolf-de Sep 22, 2023

Choose a reason for hiding this comment

davschneller commented Sep 22, 2023 •

edited

codecov-commenter commented Sep 22, 2023 •

edited