[LLVM][GPU][+refactoring] Replacement of math intrinsics with library calls #835

georgemitenkov · 2022-03-30T14:42:26Z

This PR adds a LLVM pass that replaces math intrinsics
with calls to math library. In particular:

Functionality of replacement with SIMD functions is factored
out into a separate file and LLVM version dependencies are
dropped (we use LLVM 13 already anyway).
A pass to replace intrinsics with libdevice calls when targeting
CUDA platforms has been added. So far only exp is supported
(single and double precision)
Added a test to check the replacement

Note: factoring replacement functionality into a separate file
allows us to completely drop the dependency on target information
inside LLVMCodegenVisitor😊

bbpbuildbot · 2022-03-30T15:01:16Z

Logfiles from GitLab pipeline #44990 (:white_check_mark:) have been uploaded here!

Status and direct links:

iomaganaris

LGTM
Just a few small suggestions

iomaganaris · 2022-04-05T14:58:07Z

src/codegen/llvm/replace_with_lib_functions.cpp

+            DISPATCH("llvm.pow.f64", "_ZGVeN8vv_pow", FIXED(8))
+            // clang-format on
+        };
+#undef DISPATCH


FIXED should also be undefined?

Suggested change

#undef DISPATCH

#undef DISPATCH

#undef FIXED

iomaganaris · 2022-04-05T15:10:54Z

src/codegen/llvm/replace_with_lib_functions.cpp

+        // Add vectorizable functions to the target library info.
+        switch (library->second) {
+        case VecLib::LIBMVEC_X86:
+            if (!triple.isX86() || !triple.isArch64Bit())
+                break;
+        default:
+            tli.addVectorizableFunctionsFromVecLib(library->second);
+            break;
+        }


Just a personal opinion, not sure what should be the proper way to do it or what could be the benefit of the switch but I think it would be more understandable to write this like:

Suggested change

// Add vectorizable functions to the target library info.

switch (library->second) {

case VecLib::LIBMVEC_X86:

if (!triple.isX86() || !triple.isArch64Bit())

break;

default:

tli.addVectorizableFunctionsFromVecLib(library->second);

break;

}

if (library->second != VecLib::LIBMVEC_X86 || (triple.isX86() && triple.isArch64Bit())) {

tli.addVectorizableFunctionsFromVecLib(library->second);

}

Feel free to keep this as you prefer

iomaganaris · 2022-04-05T15:15:24Z

src/codegen/llvm/replace_with_lib_functions.cpp

+
+    // Map of supported replacements. For now it is only exp.
+    static const std::map<std::string, std::string> libdevice_name = {
+            {"llvm.exp.f32", "__nv_expf"},


I think it would be good to add also pow and maybe look for any other math function commonly used in the mod files and also add it to the x86 and aarch64 maps. I can look at this also the next days

bbpbuildbot · 2022-04-07T14:02:26Z

Logfiles from GitLab pipeline #47044 (:no_entry:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2022-04-07T14:39:47Z

Logfiles from GitLab pipeline #47063 (:white_check_mark:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2022-04-07T14:39:58Z

Logfiles from GitLab pipeline #47062 (:white_check_mark:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2022-04-07T17:49:25Z

Logfiles from GitLab pipeline #47154 (:white_check_mark:) have been uploaded here!

Status and direct links:

bbpbuildbot · 2022-04-07T17:49:30Z

Logfiles from GitLab pipeline #47153 (:white_check_mark:) have been uploaded here!

Status and direct links:

… calls (#835) Added an LLVM pass that replaces math intrinsics with calls to math library. In particular: * Functionality of replacement with SIMD functions is factored out into a separate file and LLVM version dependencies are dropped (LLVM 13 is already used anyway). * A pass to replace intrinsics with libdevice calls when targeting CUDA platforms has been added. So far only `exp` and `pow` are supported (single and double precision). * Added a test to check the replacement Co-authored-by: Ioannis Magkanaris <iomagkanaris@gmail.com>

georgemitenkov added 3 commits March 29, 2022 23:44

Implemented replacement with SIMD libcalls as LLVM pass

0cf0eae

Implemented replacement with libdevice calls as LLVM pass

612027f

Added a test and comments

060d234

georgemitenkov added the llvm label Mar 30, 2022

georgemitenkov requested review from pramodk and iomaganaris March 30, 2022 14:43

iomaganaris mentioned this pull request Apr 1, 2022

[LLVM][GPU] Added CUDADriver to execute benchmark on GPU #829

Merged

8 tasks

iomaganaris approved these changes Apr 5, 2022

View reviewed changes

Merge branch 'llvm' into georgemitenkov/llvm-math-library-replacement

c08a22e

Fix merge

c2157e2

georgemitenkov added 2 commits April 7, 2022 19:30

Addressed comments

f25522d

clang-format

85d42c7

georgemitenkov merged commit 6c3fe22 into llvm Apr 8, 2022

georgemitenkov deleted the georgemitenkov/llvm-math-library-replacement branch April 8, 2022 07:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLVM][GPU][+refactoring] Replacement of math intrinsics with library calls #835

[LLVM][GPU][+refactoring] Replacement of math intrinsics with library calls #835

georgemitenkov commented Mar 30, 2022 •

edited

Loading

bbpbuildbot commented Mar 30, 2022

iomaganaris left a comment

iomaganaris Apr 5, 2022

iomaganaris Apr 5, 2022

iomaganaris Apr 5, 2022

bbpbuildbot commented Apr 7, 2022

bbpbuildbot commented Apr 7, 2022

bbpbuildbot commented Apr 7, 2022

bbpbuildbot commented Apr 7, 2022

bbpbuildbot commented Apr 7, 2022

[LLVM][GPU][+refactoring] Replacement of math intrinsics with library calls #835

[LLVM][GPU][+refactoring] Replacement of math intrinsics with library calls #835

Conversation

georgemitenkov commented Mar 30, 2022 • edited Loading

bbpbuildbot commented Mar 30, 2022

iomaganaris left a comment

Choose a reason for hiding this comment

iomaganaris Apr 5, 2022

Choose a reason for hiding this comment

iomaganaris Apr 5, 2022

Choose a reason for hiding this comment

iomaganaris Apr 5, 2022

Choose a reason for hiding this comment

bbpbuildbot commented Apr 7, 2022

bbpbuildbot commented Apr 7, 2022

bbpbuildbot commented Apr 7, 2022

bbpbuildbot commented Apr 7, 2022

bbpbuildbot commented Apr 7, 2022

georgemitenkov commented Mar 30, 2022 •

edited

Loading