Fix runTest segfault (remove cudaDeviceReset) and simplify googletest template usage #909

valassi · 2024-07-12T17:40:56Z

This is a PR to fix #907, finally removing the issues blocking the implementation of two tests #896. This also implied a simplification in the usage of googletest templates in our code. I made a successful proof of concept of adding two tests, which I want for #896 in the work I am doing on master_june24 for channelids (previously this was blocked by bug #907).

@oliviermattelaer can you please review? Note, this PR sits on top of PR #908, which itself sits on top of #900 and #905. So I suggest reviewing and merging in this order

Disable OpenMP by default (enable it only if USEOPENMP=1) #900 and Fix clang16 and clang17 builds #905 (they do not depend on anything else)
then Fix segfault from constexpr_math.h and adapt testmisc.cc to higher tolerances when running through valgrind #908 that depends on them
then this one that depends on Fix segfault from constexpr_math.h and adapt testmisc.cc to higher tolerances when running through valgrind #908

Thanks
Andrea

…n both CPU and GPU (prepare for madgraph5#896) - the C++ tests succeed but the CUDA tests segfaults madgraph5#903

…from release-1.11.0 to v1.14.0 to solve madgraph5#903, but the segfault remains - will revert

…ase-1.11.0 Revert "[gtest/june24] in CODEGEN cudacpp_test.mk, try to upgrade googletest from release-1.11.0 to v1.14.0 to solve madgraph5#903, but the segfault remains - will revert" This reverts commit 34cd623.

…cc build in CUDA while debugging madgraph5#903 With testmisc.cc, valgrind gives a confusing error ==2887713== Stack overflow in thread #1: can't grow stack to 0x1ffe801000 ==2887713== ==2887713== Process terminating with default action of signal 11 (SIGSEGV): dumping core ==2887713== Access not within mapped region at address 0x1FFE801FF8 ==2887713== Stack overflow in thread #1: can't grow stack to 0x1ffe801000 ==2887713== at 0x449C06: mg5amcGpu::constexpr_sin_quad(long double, bool) (constexpr_math.h:156) ==2887713== If you believe this happened as a result of a stack ==2887713== overflow in your program's main thread (unlikely but ==2887713== possible), you can try to increase the size of the ==2887713== main thread stack using the --main-stacksize= flag. ==2887713== The main thread stack size used in this run was 8388608. ==2887713== ==2887713== HEAP SUMMARY: ==2887713== in use at exit: 21,309,363 bytes in 13,995 blocks ==2887713== total heap usage: 18,083 allocs, 4,088 frees, 51,971,780 bytes allocated ==2887713== ==2887713== LEAK SUMMARY: ==2887713== definitely lost: 0 bytes in 0 blocks ==2887713== indirectly lost: 0 bytes in 0 blocks ==2887713== possibly lost: 2,599,608 bytes in 825 blocks ==2887713== still reachable: 18,709,755 bytes in 13,170 blocks ==2887713== suppressed: 0 bytes in 0 blocks ==2887713== Rerun with --leak-check=full to see details of leaked memory ==2887713== ==2887713== For lists of detected and suppressed errors, rerun with: -s ==2887713== ERROR SUMMARY: 0 errors from 0 contexts (suppressed: 0 from 0) Segmentation fault (core dumped) Without testmisc.cc instead [ RUN ] SIGMA_SM_GG_TTX_GPU2/MadgraphTest.CompareMomentaAndME/0 INFO: Opening reference file ../../test/ref/dump_CPUTest.Sigma_sm_gg_ttx.txt ==2889432== Invalid write of size 8 ==2889432== at 0x484E2DB: memmove (vg_replace_strmem.c:1385) ==2889432== by 0x41A6EA: double* std::__copy_move<false, true, std::random_access_iterator_tag>::__copy_m<double>(double const*, double const*, double*) (stl_algobase.h:431) ==2889432== by 0x41A49B: double* std::__copy_move_a2<false, double*, double*>(double*, double*, double*) (stl_algobase.h:494) ==2889432== by 0x41A1A5: double* std::__copy_move_a1<false, double*, double*>(double*, double*, double*) (stl_algobase.h:522) ==2889432== by 0x419F4D: double* std::__copy_move_a<false, __gnu_cxx::__normal_iterator<double*, std::vector<double, std::allocator<double> > >, double*>(__gnu_cxx::__normal_iterator<double*, std::vector<double, std::allocator<double> > >, __gnu_cxx::__normal_iterator<double*, std::vector<double, std::allocator<double> > >, double*) (stl_algobase.h:529) ==2889432== by 0x419D0C: double* std::copy<__gnu_cxx::__normal_iterator<double*, std::vector<double, std::allocator<double> > >, double*>(__gnu_cxx::__normal_iterator<double*, std::vector<double, std::allocator<double> > >, __gnu_cxx::__normal_iterator<double*, std::vector<double, std::allocator<double> > >, double*) (stl_algobase.h:619) ==2889432== by 0x419950: mg5amcGpu::CommonRandomNumberKernel::generateRnarray() (CommonRandomNumberKernel.cc:34) ==2889432== by 0x44443D: CUDATest::prepareRandomNumbers(unsigned int) (runTest.cc:202) ==2889432== by 0x440D98: MadgraphTest_CompareMomentaAndME_Test::TestBody() (MadgraphTest.h:253) ==2889432== by 0x48790F: void testing::internal::HandleSehExceptionsInMethodIfSupported<testing::Test, void>(testing::Test*, void (testing::Test::*)(), char const*) (gtest.cc:2607) ==2889432== by 0x480EF8: void testing::internal::HandleExceptionsInMethodIfSupported<testing::Test, void>(testing::Test*, void (testing::Test::*)(), char const*) (gtest.cc:2643) ==2889432== by 0x459587: testing::Test::Run() (gtest.cc:2682) ==2889432== Address 0x2fc0f200 is not stack'd, malloc'd or (recently) free'd ==2889432== ==2889432== ==2889432== Process terminating with default action of signal 11 (SIGSEGV): dumping core ==2889432== Access not within mapped region at address 0x2FC0F200 ==2889432== at 0x484E2DB: memmove (vg_replace_strmem.c:1385) ... Segmentation fault (core dumped)

…cc build while debugging madgraph5#903 also for C++ The test does not segfault without valgrind, but it does segfault in valgrind! (NB this all realted to debug builds, in C++ and in CUDA) And with testmisc.cc, valgrind gives a confusing error for C++ (cppnone here) as in CUDA: ==2893804== Process terminating with default action of signal 11 (SIGSEGV): dumping core ==2893804== Access not within mapped region at address 0x1FFE801FF8 ==2893804== Stack overflow in thread #1: can't grow stack to 0x1ffe801000 ==2893804== at 0x431835: mg5amcCpu::constexpr_sin_quad(long double, bool) (in /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/gg_tt.mad/SubProcesses/P1_gg_ttx/runTest_cpp.exe) So I disable testmisc but now the C++ test (cppnone here) no longer segfaults...?!

…pp.exe by adding -no-pie madgraph5#904

…ng OMP only for clang16 madgraph5#904

…6 builds madgraph5#904 (disabling OMP only for clang16; add -no-pie for fcheck_cpp.exe)

…ng16 builds madgraph5#904

…pp.exe by adding -no-pie madgraph5#904

…ng OMP only for clang16 madgraph5#904

…6 builds madgraph5#904 (disabling OMP only for clang16; add -no-pie for fcheck_cpp.exe)

Revert "[gtest/june24] in gg_tt.mad cudacpp.mk, TEMPORARELY disable testmisc.cc build while debugging madgraph5#903 also for C++" This reverts commit 944caab. Will now test with clang16 (after recent fixes) and valgrind (after upgrading to 3.23)

…ster for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

…ng OMP only for clang17 madgraph5#904

…7 builds madgraph5#904 (disable OMP also for clang17)

…ng17 builds madgraph5#904

…ng OMP only for clang17 madgraph5#904

…7 builds madgraph5#904 (disable OMP also for clang17)

…ster for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

…ph5#904: remove link-time -no-pie, add compiler-time -fPIC to fortran

…5#904: remove link-time -no-pie, add compiler-time -fPIC to fortran

…adgraph5#904, adding -fPIC to fortran compilation

…ODEGEN logs from the latest upstream/master for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

…g constexpr_sin: now valgrind on c++ runTest succeds again?! However cuda still fails (even without valgrind) madgraph5#903

… now valgrind runTest_cpp.exe will fail Revert "[gtest/june24] in gg_tt.mad testmisc.cc, comment out the section using constexpr_sin: now valgrind on c++ runTest succeds again?!" This reverts commit 975f7aacb8661807a329ec1f51b2d7d8dba45167.

…ph5#904: remove link-time -no-pie, add compiler-time -fPIC to fortran

…5#904: remove link-time -no-pie, add compiler-time -fPIC to fortran

…et() at the end, but an abort reappears INFO: The following Floating Point Exceptions will cause SIGFPE program aborts: FE_DIVBYZERO, FE_INVALID, FE_OVERFLOW [==========] Running 4 tests from 4 test suites. [----------] Global test environment set-up. [----------] 1 test from SIGMA_SM_GG_TTX_GPU_XXX [ RUN ] SIGMA_SM_GG_TTX_GPU_XXX.testxxx [ OK ] SIGMA_SM_GG_TTX_GPU_XXX.testxxx (1 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_XXX (1 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MISC [ RUN ] SIGMA_SM_GG_TTX_GPU_MISC.testmisc [ OK ] SIGMA_SM_GG_TTX_GPU_MISC.testmisc (14 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MISC (14 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH1 [ RUN ] SIGMA_SM_GG_TTX_GPU_MADGRAPH1.compareMomAndME INFO: Opening reference file ../../test/ref/dump_CPUTest.Sigma_sm_gg_ttx.txt [ OK ] SIGMA_SM_GG_TTX_GPU_MADGRAPH1.compareMomAndME (194 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH1 (194 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH2 [ RUN ] SIGMA_SM_GG_TTX_GPU_MADGRAPH2.compareMomAndME INFO: Opening reference file ../../test/ref/dump_CPUTest.Sigma_sm_gg_ttx.txt [ OK ] SIGMA_SM_GG_TTX_GPU_MADGRAPH2.compareMomAndME (174 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH2 (174 ms total) [----------] Global test environment tear-down [==========] 4 tests from 4 test suites ran. (384 ms total) [ PASSED ] 4 tests. INFO: No Floating Point Exceptions have been reported ERROR! assertGpu: 'invalid argument' (1) in MemoryBuffers.h:155 runTest_cuda.exe: GpuRuntime.h:26: void assertGpu(cudaError_t, const char*, int, bool): Assertion `code == gpuSuccess' failed. Aborted (core dumped)

…st.cc to the main in testxxx.cc, but an abort reappears INFO: The following Floating Point Exceptions will cause SIGFPE program aborts: FE_DIVBYZERO, FE_INVALID, FE_OVERFLOW [==========] Running 4 tests from 4 test suites. [----------] Global test environment set-up. [----------] 1 test from SIGMA_SM_GG_TTX_GPU_XXX [ RUN ] SIGMA_SM_GG_TTX_GPU_XXX.testxxx [ OK ] SIGMA_SM_GG_TTX_GPU_XXX.testxxx (1 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_XXX (1 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MISC [ RUN ] SIGMA_SM_GG_TTX_GPU_MISC.testmisc [ OK ] SIGMA_SM_GG_TTX_GPU_MISC.testmisc (14 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MISC (14 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH1 [ RUN ] SIGMA_SM_GG_TTX_GPU_MADGRAPH1.compareMomAndME INFO: Opening reference file ../../test/ref/dump_CPUTest.Sigma_sm_gg_ttx.txt [ OK ] SIGMA_SM_GG_TTX_GPU_MADGRAPH1.compareMomAndME (198 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH1 (198 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH2 [ RUN ] SIGMA_SM_GG_TTX_GPU_MADGRAPH2.compareMomAndME INFO: Opening reference file ../../test/ref/dump_CPUTest.Sigma_sm_gg_ttx.txt [ OK ] SIGMA_SM_GG_TTX_GPU_MADGRAPH2.compareMomAndME (180 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH2 (180 ms total) [----------] Global test environment tear-down [==========] 4 tests from 4 test suites ran. (395 ms total) [ PASSED ] 4 tests. INFO: No Floating Point Exceptions have been reported ERROR! assertGpu: 'invalid argument' (1) in MemoryBuffers.h:155 runTest_cuda.exe: GpuRuntime.h:26: void assertGpu(cudaError_t, const char*, int, bool): Assertion `code == gpuSuccess' failed. Aborted (core dumped)

… to the atexit function, but this STILL crashes! madgraph5#907 WILL THEREFORE COMMENT OUT THIS CALL... INFO: The following Floating Point Exceptions will cause SIGFPE program aborts: FE_DIVBYZERO, FE_INVALID, FE_OVERFLOW [==========] Running 4 tests from 4 test suites. [----------] Global test environment set-up. [----------] 1 test from SIGMA_SM_GG_TTX_GPU_XXX [ RUN ] SIGMA_SM_GG_TTX_GPU_XXX.testxxx [ OK ] SIGMA_SM_GG_TTX_GPU_XXX.testxxx (1 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_XXX (1 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MISC [ RUN ] SIGMA_SM_GG_TTX_GPU_MISC.testmisc [ OK ] SIGMA_SM_GG_TTX_GPU_MISC.testmisc (14 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MISC (14 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH1 [ RUN ] SIGMA_SM_GG_TTX_GPU_MADGRAPH1.compareMomAndME INFO: Opening reference file ../../test/ref/dump_CPUTest.Sigma_sm_gg_ttx.txt [ OK ] SIGMA_SM_GG_TTX_GPU_MADGRAPH1.compareMomAndME (198 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH1 (198 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH2 [ RUN ] SIGMA_SM_GG_TTX_GPU_MADGRAPH2.compareMomAndME INFO: Opening reference file ../../test/ref/dump_CPUTest.Sigma_sm_gg_ttx.txt [ OK ] SIGMA_SM_GG_TTX_GPU_MADGRAPH2.compareMomAndME (179 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH2 (179 ms total) [----------] Global test environment tear-down [==========] 4 tests from 4 test suites ran. (393 ms total) [ PASSED ] 4 tests. INFO: No Floating Point Exceptions have been reported ERROR! assertGpu: 'invalid argument' (1) in MemoryBuffers.h:155 runTest_cuda.exe: GpuRuntime.h:26: void assertGpu(cudaError_t, const char*, int, bool): Assertion `code == gpuSuccess' failed. Aborted (core dumped)

… to avoid all crashes madgraph5#907 (FIXME? avoid cuda api calls in dtors?) INFO: The following Floating Point Exceptions will cause SIGFPE program aborts: FE_DIVBYZERO, FE_INVALID, FE_OVERFLOW [==========] Running 4 tests from 4 test suites. [----------] Global test environment set-up. [----------] 1 test from SIGMA_SM_GG_TTX_GPU_XXX [ RUN ] SIGMA_SM_GG_TTX_GPU_XXX.testxxx [ OK ] SIGMA_SM_GG_TTX_GPU_XXX.testxxx (1 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_XXX (1 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MISC [ RUN ] SIGMA_SM_GG_TTX_GPU_MISC.testmisc [ OK ] SIGMA_SM_GG_TTX_GPU_MISC.testmisc (14 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MISC (14 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH1 [ RUN ] SIGMA_SM_GG_TTX_GPU_MADGRAPH1.compareMomAndME INFO: Opening reference file ../../test/ref/dump_CPUTest.Sigma_sm_gg_ttx.txt [ OK ] SIGMA_SM_GG_TTX_GPU_MADGRAPH1.compareMomAndME (199 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH1 (199 ms total) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH2 [ RUN ] SIGMA_SM_GG_TTX_GPU_MADGRAPH2.compareMomAndME INFO: Opening reference file ../../test/ref/dump_CPUTest.Sigma_sm_gg_ttx.txt [ OK ] SIGMA_SM_GG_TTX_GPU_MADGRAPH2.compareMomAndME (181 ms) [----------] 1 test from SIGMA_SM_GG_TTX_GPU_MADGRAPH2 (181 ms total) [----------] Global test environment tear-down [==========] 4 tests from 4 test suites ran. (396 ms total) [ PASSED ] 4 tests. INFO: No Floating Point Exceptions have been reported INFO: No Floating Point Exceptions have been reported

…ting::Test argument to the compareME function, to allow the use f HasFailure This essentially COMPLETES the fixes for madgraph5#907 and preparatory work for madgraph5#896

…pare to comment out test2 (preparatory work for madgraph5#896) All tests succeed on cuda and all simd

…ry work for madgraph5#896) All tests succeed on cuda and all simd - will backport to CODEGEN now

…st.cc, testxxx.cc: simplify gtest templates, remove cudaDeviceReset to fix madgraph5#907, complete preparation of two-test infrastructure madgraph5#896 More in detail: - move to the simplest "TEST(" use case of Google tests in MadgraphTest.h and runTest.cc (remove unnecessary levels of templating) - move gpuDeviceReset() to an atexit function of main in testxxx and comment it out anyway, to fix the segfaults madgraph5#907 (eventually it may be necessary to remove all CUDA API calls from destructors, if ever we need to put this back in) - in runTest.cc, complete a proff of concept for adding two separate tests (without/with multichannel madgraph5#896) Fix some clang formatting issues with respect to the last gg_tt.mad

…ting fixes)

… the latest upstream/master for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

oliviermattelaer

Hi Andrea,

Thanks for this, this sounds good and no issue.
Obviously, this is another PR based on #905 which means that the status of this one, rely on it (like #908). Like in #908, I do approve this PR but we need to agree on what to do for #905 first (which should be quite easy)

Thanks,

Olivier

…h5#900 and submod madgraph5#897) into clang

…r if OpenMP builds are attempted on clang16/17 (as discussed with Olivier in madgraph5#905)

…s from the latest upstream/master for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

…aster with OMP madgraph5#900 and submod madgraph5#897) into gtest Fix conflicts in epochX/cudacpp/gg_tt.mad/CODEGEN_mad_gg_tt_log.txt git checkout clang gg_tt.mad/CODEGEN_mad_gg_tt_log.txt Note: MG5AMC has been updated including mg5amcnlo#107

…s from the latest upstream/master for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

…aster with clang madgraph5#905, OMP madgraph5#900 and submod madgraph5#897) into gtest Fix conflicts in epochX/cudacpp/gg_tt.mad/CODEGEN_mad_gg_tt_log.txt git checkout clang gg_tt.mad/CODEGEN_mad_gg_tt_log.txt Note: MG5AMC has been updated including mg5amcnlo#107

…ogs from the latest upstream/master for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

valassi · 2024-07-16T12:37:06Z

Hi Andrea,

Thanks for this, this sounds good and no issue. Obviously, this is another PR based on #905 which means that the status of this one, rely on it (like #908). Like in #908, I do approve this PR but we need to agree on what to do for #905 first (which should be quite easy)

Thanks,

Olivier

Thanks Olivier :-)

I again updated this and regenerated as a check. Will run the CI then merge.

Andrea

valassi · 2024-07-16T13:03:28Z

The CI completed with
163 successful and 6 failing checks
This is as expected

Merging now

…ion, removing the attempts to add two tests madgraph5#896 My last commit was showing the segfault issue madgraph5#907 solved in upcoming PR madgraph5#909 (and bits of madgraph5#908). I will cherry pick the CODEGEN from madgraph5#909 (and madgraph5#908) first and try again. git checkout 3eb4c29 gg_tt.mad/SubProcesses/runTest.cc

…ng PR madgraph5#905, constexpr_math.h PR madgraph5#908 and runTest/cudaDeviceReset PR madgraph5#909 Add valgrind.h and its symlink in the repo for gg_tt.mad The new runTest.cc template now has a (commented out) proof of concept for including two tests (with/without multichannel) madgraph5#896, I will resume from there After building bldall, the following succeeds for bck in none sse4 avx2 512y 512z cuda; do echo $bck; ./build.${bck}_d_inl0_hrd0/runTest_*.exe; done This instead is crashing (again?) for some AVX values for bck in none sse4 avx2 512y 512z cuda; do echo $bck; valgrind ./build.${bck}_d_inl0_hrd0/runTest_*.exe; done On closer inspection, this is because valgrind does not support AVX512, so this is ok

valassi added 30 commits July 11, 2024 13:31

[gtest/june24] in gg_tt.mad runTest.cc, include two identical tests o…

8f2c16b

…n both CPU and GPU (prepare for madgraph5#896) - the C++ tests succeed but the CUDA tests segfaults madgraph5#903

[gtest/june24] in CODEGEN cudacpp_test.mk, try to upgrade googletest …

34cd623

…from release-1.11.0 to v1.14.0 to solve madgraph5#903, but the segfault remains - will revert

[clang/june24] in gg_tt.mad cudacpp.mk, fix clang16 build of fcheck_c…

5cdb95a

…pp.exe by adding -no-pie madgraph5#904

[clang/june24] in gg_tt.mad cudacpp.mk, fix clang16 builds by disabli…

a7efcb7

…ng OMP only for clang16 madgraph5#904

[clang/june24] in CODEGEN (backport gg_tt.mad) cudacpp.mk, fix clang1…

8a62982

…6 builds madgraph5#904 (disabling OMP only for clang16; add -no-pie for fcheck_cpp.exe)

[clang/june24] regenerate all processes with cudacpp.mk fixes for cla…

403180d

…ng16 builds madgraph5#904

[clang/june24] in gg_tt.mad cudacpp.mk, fix clang16 build of fcheck_c…

d7a5889

…pp.exe by adding -no-pie madgraph5#904

[clang/june24] in gg_tt.mad cudacpp.mk, fix clang16 builds by disabli…

82202d6

…ng OMP only for clang16 madgraph5#904

[clang/june24] in CODEGEN (backport gg_tt.mad) cudacpp.mk, fix clang1…

74ceaec

…6 builds madgraph5#904 (disabling OMP only for clang16; add -no-pie for fcheck_cpp.exe)

[clang/june24] in tools/compilers add clang17 wrappers for cvmfs

197c0fb

[clang/june24] move again to CODEGEN logs from the latest upstream/ma…

9ab418a

…ster for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

[clang/june24] in tools/compilers add clang17 wrappers for cvmfs

acdac67

[clang/june24] in gg_tt.mad cudacpp.mk, fix clang17 builds by disabli…

ac2602f

…ng OMP only for clang17 madgraph5#904

[clang/june24] in CODEGEN (backport gg_tt.mad) cudacpp.mk, fix clang1…

7f112f8

…7 builds madgraph5#904 (disable OMP also for clang17)

[clang/june24] regenerate all processes with cudacpp.mk fixes for cla…

5c4c80f

…ng17 builds madgraph5#904

[clang/june24] in gg_tt.mad cudacpp.mk, fix clang17 builds by disabli…

c28a7bf

…ng OMP only for clang17 madgraph5#904

[clang/june24] in CODEGEN (backport gg_tt.mad) cudacpp.mk, fix clang1…

c95c43d

…7 builds madgraph5#904 (disable OMP also for clang17)

[clang/june24] move again to CODEGEN logs from the latest upstream/ma…

9ef9e8e

…ster for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

[clang/june24] in gg_tt.mad cudacpp.mk, improve clang16/17 fix madgra…

80023f6

…ph5#904: remove link-time -no-pie, add compiler-time -fPIC to fortran

[clang/june24] in CODEGEN cudacpp.mk, improve clang16/17 fix madgraph…

0d9d036

…5#904: remove link-time -no-pie, add compiler-time -fPIC to fortran

[clang/june24] regenerate all processes with cudacpp.mk new fixes for m…

c673f21

…adgraph5#904, adding -fPIC to fortran compilation

[clang/june24] ** COMPLETE CLANG (clang16/clang17) ** move again to C…

0a8d8b3

…ODEGEN logs from the latest upstream/master for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

[gtest/june24] in gg_tt.mad testmisc.cc, comment out the section usin…

66d6de9

…g constexpr_sin: now valgrind on c++ runTest succeds again?! However cuda still fails (even without valgrind) madgraph5#903

[clang/june24] in gg_tt.mad cudacpp.mk, improve clang16/17 fix madgra…

70aa0f5

…ph5#904: remove link-time -no-pie, add compiler-time -fPIC to fortran

[clang/june24] in CODEGEN cudacpp.mk, improve clang16/17 fix madgraph…

ee1118b

…5#904: remove link-time -no-pie, add compiler-time -fPIC to fortran

valassi added 11 commits July 12, 2024 17:47

[gtest2/june24] in gg_tt.mad MadgraphTest.h and runTest.cc, add a tes…

3ea39b5

…ting::Test argument to the compareME function, to allow the use f HasFailure This essentially COMPLETES the fixes for madgraph5#907 and preparatory work for madgraph5#896

[gtest2/june24] in gg_tt.mad runTest.cc, reorder test1 and test2, pre…

3f19b67

…pare to comment out test2 (preparatory work for madgraph5#896) All tests succeed on cuda and all simd

[gtest2/june24] in gg_tt.mad runTest.cc, comment out test2 (preparato…

ad389c9

…ry work for madgraph5#896) All tests succeed on cuda and all simd - will backport to CODEGEN now

[gtest2/june24] regenerate gg_tt.mad, check all ok (with clang format…

2a56d28

…ting fixes)

[gtest2/june24] regenerate all processes

55631af

[gtest2/june24] ** COMPLETE GTEST2 ** move again to CODEGEN logs from…

7a38ab3

… the latest upstream/master for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

valassi requested a review from oliviermattelaer July 12, 2024 17:40

valassi self-assigned this Jul 12, 2024

valassi mentioned this pull request Jul 12, 2024

segfault in CommonRandomNumberKernel for cuda when adding a second gtest #907

Closed

oliviermattelaer approved these changes Jul 16, 2024

View reviewed changes

valassi added 10 commits July 16, 2024 13:44

Merge remote-tracking branch 'upstream/master' (including OMP madgrap…

fb7bd76

…h5#900 and submod madgraph5#897) into clang

[clang/june24] in gg_tt.mad and CODEGEN cudacpp.mk, fail with an erro…

284df28

…r if OpenMP builds are attempted on clang16/17 (as discussed with Olivier in madgraph5#905)

[clang/june24] regenerate all processes

9629535

[clang/june24] ** COMPLETE CLANG (again) ** move again to CODEGEN log…

c05ffdd

…s from the latest upstream/master for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

[gtest/june24] regenerate all processes, check all is ok no changes

41485fd

[gtest/june24] ** COMPLETE GTEST (again) ** move again to CODEGEN log…

4703af2

…s from the latest upstream/master for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

[gtest2/june24] regenerate all processes, check all is ok no changes

0224786

[gtest2/june24] ** COMPLETE GTEST2 (again) ** move again to CODEGEN l…

5c8e97d

…ogs from the latest upstream/master for easier merging git checkout upstream/master $(git ls-tree --name-only HEAD */CODEGEN*txt)

valassi merged commit 606ee3b into madgraph5:master Jul 16, 2024
163 of 169 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix runTest segfault (remove cudaDeviceReset) and simplify googletest template usage #909

Fix runTest segfault (remove cudaDeviceReset) and simplify googletest template usage #909

valassi commented Jul 12, 2024

oliviermattelaer left a comment

valassi commented Jul 16, 2024

valassi commented Jul 16, 2024

Fix runTest segfault (remove cudaDeviceReset) and simplify googletest template usage #909

Fix runTest segfault (remove cudaDeviceReset) and simplify googletest template usage #909

Conversation

valassi commented Jul 12, 2024

oliviermattelaer left a comment

Choose a reason for hiding this comment

valassi commented Jul 16, 2024

valassi commented Jul 16, 2024