Fix vectorization pragmas for icx compiler #3246

Vika-F · 2025-06-03T09:55:34Z

Description

Vectorization pragmas were defined for icx/icpx compilers. Previously they were defined only for icc.
Vectorization pragmas were redefined for all compilers in attempt to use OpenMP 5 #pragma omp simd for vectorization where possible:
cpp/daal/src/services/service_defines.h
Compilation warnings were fixed after pragmas re-definition

Note: The use of #pargma omp simd was not implemented for MSVC because it is required to link with OpenMP to support the feature leading to additional dependency in Windows build.

PR should start as a draft, then move to ready for review state after CI is passed and all applicable checkboxes are closed.
This approach ensures that reviewers don't spend extra time asking for regular requirements.

You can remove a checkbox as not applicable only if it doesn't relate to this PR in any way.
For example, PR with docs update doesn't require checkboxes for performance while PR with any change in actual code should have checkboxes and justify how this code change is expected to affect performance (or justification should be self-evident).

Checklist to comply with before moving PR from draft:

PR completeness and readability

I have reviewed my changes thoroughly before submitting this pull request.
I have commented my code, particularly in hard-to-understand areas.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details).
I have added a respective label(s) to PR if I have a permission for that.
I have resolved any merge conflicts that might occur with the base branch.

Testing

I have run it locally and tested the changes extensively.
All CI jobs are green or I have provided justification why they aren't.

Performance

I have measured performance for affected algorithms using scikit-learn_bench and provided at least summary table with measured data, if performance change is expected.
I have provided justification why performance has changed or why changes are not expected.
I have provided justification why quality metrics have changed or why changes are not expected.
I have extended benchmarking suite and provided corresponding scikit-learn_bench PR if new measurable functionality was introduced in this PR.

cpp/daal/src/services/service_defines.h

david-cortes-intel · 2025-06-03T10:56:17Z

cpp/daal/src/services/service_defines.h

@@ -44,59 +44,49 @@ DAAL_EXPORT bool daal_check_is_intel_cpu();

 #define DAAL_CHECK_CPU_ENVIRONMENT (daal_check_is_intel_cpu())

-#if defined(__INTEL_COMPILER)
+#if defined(__INTEL_COMPILER) || defined(__INTEL_LLVM_COMPILER)
    #define PRAGMA_FORCE_SIMD       _Pragma("ivdep")
    #define PRAGMA_NOVECTOR         _Pragma("novector")
    #define PRAGMA_VECTOR_ALIGNED   _Pragma("vector aligned")


Does this play along with omp simd? How about passing the alignment to the OMP pragma? It also supports #pragma omp simd aligned(pointer_name:64).

The idea is to pass any arguments to #pragma omp simd with PRAGMA_OMP_SIMD() macro.
For example:
https://github.com/uxlfoundation/oneDAL/pull/3246/files#diff-61b267e32558b19a2dba159ff0128be35c0c8eafe9b140791a7073e262af17f7R1201

Alignment options can be passed similarly.

But does the compiler actually use the hint if put before omp simd?

david-cortes-intel · 2025-06-03T10:57:20Z

cpp/daal/src/services/service_defines.h

    #define DAAL_TYPENAME typename
 #elif defined(_MSC_VER)
-    #define PRAGMA_FORCE_SIMD
-    #define PRAGMA_NOVECTOR
+    #define PRAGMA_FORCE_SIMD _Pragma("loop(ivdep)")


MSVC supports omp simd if enabling experimental mode:
https://learn.microsoft.com/en-us/cpp/parallel/openmp/openmp-simd?view=msvc-170

david-cortes-intel · 2025-06-03T10:57:51Z

cpp/daal/src/services/service_defines.h

-        #define PRAGMA_FORCE_SIMD _Pragma("omp simd")
+        #define PRAGMA_FORCE_SIMD     _Pragma("omp simd")
+        #define PRAGMA_TO_STR(ARGS)   _Pragma(#ARGS)
+        #define PRAGMA_OMP_SIMD(ARGS) PRAGMA_TO_STR(omp simd ARGS)
    #else
        #define PRAGMA_FORCE_SIMD


omp simd is supported by GCC on all platforms as far as I am aware.

cpp/oneapi/dal/backend/common.hpp

david-cortes-intel · 2025-06-05T13:31:37Z

cpp/daal/src/algorithms/cordistance/cordistance_impl.i

+/// \param[in] x                Pointer to the input matrix x of size nRows * nColumns
+/// \param[out] sum              Pointer to the output array of size nRows, where the sum of each row of x will be stored
+template <typename algorithmFPType, CpuType cpu>
+void sumByRows(const size_t nRows, const size_t nColumns, const algorithmFPType * x, algorithmFPType * sum)


Why not call MKL here? https://www.intel.com/content/www/us/en/docs/onemkl/developer-reference-c/2025-1/vslsseditsums.html

I think it is out of the scope of this PR.
Here I am just trying to enable the pragmas for ICX. The refactoring was made just to reduce code duplication and not to do the same modifications in 3 files.

david-cortes-intel · 2025-06-05T13:33:00Z

cpp/daal/src/algorithms/cordistance/cordistance_impl.i

+    {
+        if (block[i * blockSize + i] > (algorithmFPType)0.0)
+        {
+            block[i * blockSize + i] = (algorithmFPType)1.0 / daal::internal::MathInst<algorithmFPType, cpu>::sSqrt(block[i * blockSize + i]);


There is a PR adding a function for this from MKL: #3227

Perhaps that other one could be merged first.

I am Ok to have that one merged first and reuse that functionality; but I think the performance have to be measured, as that PR clearly might have performance impact on the algorithm.

Vika-F · 2025-06-17T12:59:43Z

/intelci: run

Vika-F added 4 commits May 19, 2025 06:28

Initial commit

376d071

Fix compiler warnings in correlation distance, remove code duplications

40713d4

Fix vectorization

cac8198

Vectorization fixes

66f1e17

Vika-F added the enhancement label Jun 3, 2025

david-cortes-intel reviewed Jun 3, 2025

View reviewed changes

Refactoring

8b0cba2

david-cortes-intel reviewed Jun 5, 2025

View reviewed changes

Vika-F added 11 commits June 6, 2025 15:54

Replace PRAGMA_FORSE_SIMD with PRAGMA_OMP_SIMD

2767476

Fix no argument error in macros

420947a

Add -fopenmp-simd flag to GCC build

6d54505

clang-format

1a5b2f8

Fix CI failures

bf15254

Remove unneeded include

82396e7

Fix MSVS build

b9136ca

Fix MSVC build

e69b8ec

Remove OMP dependency on Windows

8df6709

Remove -openmp:experimental compiler flag from Windows build

ef56204

Merge pull request #55 from uxlfoundation/main

66a39ad

Vika-F mentioned this pull request Jun 18, 2025

Add cosine distance algorithm #3248

Draft

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix vectorization pragmas for icx compiler #3246

Fix vectorization pragmas for icx compiler #3246

Uh oh!

Vika-F commented Jun 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

david-cortes-intel Jun 3, 2025

Uh oh!

Vika-F Jun 5, 2025

Uh oh!

david-cortes-intel Jun 5, 2025

Uh oh!

david-cortes-intel Jun 3, 2025

Uh oh!

david-cortes-intel Jun 3, 2025

Uh oh!

Vika-F Jun 18, 2025

Uh oh!

Uh oh!

david-cortes-intel Jun 5, 2025

Uh oh!

Vika-F Jun 5, 2025

Uh oh!

david-cortes-intel Jun 5, 2025

Uh oh!

Vika-F Jun 5, 2025

Uh oh!

Vika-F commented Jun 17, 2025

Uh oh!

Uh oh!

Fix vectorization pragmas for icx compiler #3246

Are you sure you want to change the base?

Fix vectorization pragmas for icx compiler #3246

Uh oh!

Conversation

Vika-F commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Vika-F commented Jun 17, 2025

Uh oh!

Uh oh!

Vika-F commented Jun 3, 2025 •

edited

Loading