Upgrade to Olivier's latest gpucpp branch upstream ("split_nonidentical_grouping") #619

valassi · 2023-03-31T09:18:41Z

Upgrade to Olivier's latest gpucpp branch upstream

The only change is that in eemumu.mad the P1_ll_ll subdirectory is now P1_epem_mupmum

Thisis related to #272, see #272 (comment) . Th elast change by @oliviermattelaer now prevents directories with nprocesses>1. See the discussion at the meeting earlier this week https://indico.cern.ch/event/1263518/

valassi · 2023-04-03T15:01:13Z

This PR in itself is complete and could be merged, but I would like to do a few things before moving there. I keep this in draft for the moment.

Olivier's changes I refer to are those that introduce the "split_nonidentical_grouping" mode: mg5amcnlo/mg5amcnlo@cd272f8 and mg5amcnlo/mg5amcnlo@aa37739

…l ok with no change This completes the first "susy" patch: now susy_gg_tt can be generated correctly (but it does not build). In practice, the main (only?) issue it addresses is madgraph5#622 Further patches (susy2 and possibly more) will attempt to fix these builds. NB: At this stage, CODEGEN is still using the upstream mg5amcnlo without "split_nonidentical_grouping" (PR madgraph5#619 and madgraph5#272)

…(revert to previous 11 codegen logs for easier rebasing) Revert "[susy] ** COMPLETE SUSY (PART 1) ** regenerate five processes mad, all ok with no change" This reverts commit acbe689. Revert "[susy] regenerate 6 processes SA, all ok with no change" This reverts commit 806e7d7.

…o change

…w gpucpp usptream This will make it easier to check if there are any other differences... git mv ee_mumu.mad/SubProcesses/P1_ll_ll/ ee_mumu.mad/SubProcesses/P1_epem_mupmum/

…ing of the P1 subdirectory

…t and tmad

STARTED AT Fri Mar 31 10:10:01 AM CEST 2023 ./tput/teeThroughputX.sh -mix -hrd -makej -eemumu -ggtt -ggttg -ggttgg -ggttggg -makeclean ENDED(1) AT Fri Mar 31 10:26:06 AM CEST 2023 [Status=0] ./tput/teeThroughputX.sh -flt -hrd -makej -eemumu -ggtt -ggttgg -inlonly -makeclean ENDED(2) AT Fri Mar 31 10:30:57 AM CEST 2023 [Status=0] ./tput/teeThroughputX.sh -makej -eemumu -ggtt -ggttg -ggttgg -ggttggg -flt -bridge -makeclean ENDED(3) AT Fri Mar 31 10:32:03 AM CEST 2023 [Status=0] ./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -rmbhst ENDED(4) AT Fri Mar 31 10:32:52 AM CEST 2023 [Status=0] ./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -curhst ENDED(5) AT Fri Mar 31 10:33:40 AM CEST 2023 [Status=0] (Later rerun the tests as the GPU was not configured correctly)

…nly for eemumu Revert "[gpucpp] TEMPORARY hack to only run eemumu testst in allTees from tput and tmad" This reverts commit 6634d0f.

valassi · 2023-04-03T15:37:37Z

I have rebased to after the first susy patch #622

NB this has a single P1 subdirectory gq_ttxq! (but nprocesses=1 in each see madgraph5#272) The Fortran code has DSIG1 and also DSIG2. Clearly the DSIG2 code is not correctly interfaced to our cudacpp The code fails to build: ... ccache g++ -O3 -std=c++17 -I. -I../../src -I../../../../../tools -DUSE_NVTX -Wall -Wshadow -Wextra -ffast-math -fopenmp -march=skylake-avx512 -mprefer-vector-width=256 -DMGONGPU_FPTYPE_DOUBLE -DMGONGPU_FPTYPE2_DOUBLE -I/usr/local/cuda-12.0/include/ -fPIC -c CPPProcess.cc -o CPPProcess.o In file included from CPPProcess.cc:25: coloramps.h:18:3: error: too many initializers for ‘const bool [5][4]’ 18 | }; | ^ ... ccache /usr/local/cuda-12.0/bin/nvcc -O3 -lineinfo -I. -I../../src -I../../../../../tools -I/usr/local/cuda-12.0/include/ -DUSE_NVTX -gencode arch=compute_70,code=compute_70 -gencode arch=compute_70,code=sm_70 -use_fast_math -std=c++17 -ccbin /usr/lib64/ccache/g++ -DMGONGPU_FPTYPE_DOUBLE -DMGONGPU_FPTYPE2_DOUBLE -Xcompiler -fPIC -c gCPPProcess.cu -o gCPPProcess.o coloramps.h(13): error: too many initializer values NB this is still using the old upstream mg5amcnlo before Olivier's split_nonidentical_grouping (see madgraph5#619) I have previewed that after merging that, there will be two separate P1 subdirectories also in .mad (as in .sa)

valassi · 2023-04-05T05:42:19Z

I have completed several tests for the effect of this MR on a process with DSIG2 (gq to qqt, in #626 which will include this as a merge half-way). This can now be merged.

Self-merging.

cc: @oliviermattelaer @roiser @zeniheisser

NB this has a single P1 subdirectory gq_ttxq! (but nprocesses=1 in each see madgraph5#272) The Fortran code has DSIG1 and also DSIG2. Clearly the DSIG2 code is not correctly interfaced to our cudacpp The code fails to build: ... ccache g++ -O3 -std=c++17 -I. -I../../src -I../../../../../tools -DUSE_NVTX -Wall -Wshadow -Wextra -ffast-math -fopenmp -march=skylake-avx512 -mprefer-vector-width=256 -DMGONGPU_FPTYPE_DOUBLE -DMGONGPU_FPTYPE2_DOUBLE -I/usr/local/cuda-12.0/include/ -fPIC -c CPPProcess.cc -o CPPProcess.o In file included from CPPProcess.cc:25: coloramps.h:18:3: error: too many initializers for ‘const bool [5][4]’ 18 | }; | ^ ... ccache /usr/local/cuda-12.0/bin/nvcc -O3 -lineinfo -I. -I../../src -I../../../../../tools -I/usr/local/cuda-12.0/include/ -DUSE_NVTX -gencode arch=compute_70,code=compute_70 -gencode arch=compute_70,code=sm_70 -use_fast_math -std=c++17 -ccbin /usr/lib64/ccache/g++ -DMGONGPU_FPTYPE_DOUBLE -DMGONGPU_FPTYPE2_DOUBLE -Xcompiler -fPIC -c gCPPProcess.cu -o gCPPProcess.o coloramps.h(13): error: too many initializer values NB this is still using the old upstream mg5amcnlo before Olivier's split_nonidentical_grouping (see madgraph5#619) I have previewed that after merging that, there will be two separate P1 subdirectories also in .mad (as in .sa)

…proc and codegen_procid (for sanity checks only madgraph5#272 and madgraph5#619)

…oc and codegen_procid (for sanity checks only madgraph5#272 and madgraph5#619)

… (prepare to regenerate with Olivier's madgraph5#619 patch that splits in in two) git mv gq_ttq.mad/SubProcesses/P1_gq_ttxq gq_ttq.mad/SubProcesses/P1_gu_ttxu cp -dpr gq_ttq.mad/SubProcesses/P1_gu_ttxu gq_ttq.mad/SubProcesses/P1_gux_ttxux/ git add gq_ttq.mad/SubProcesses/P1_gux_ttxux/

This merges the contents of MR madgraph5#619 "split_nonidentical_grouping" I will now regenerate gq_ttq with the new upstream code by Olivier, which removes DSIG2 and fixes the issues

…tical_grouping madgraph5#619 Two (different) P1 subdirectories are now generated, each with only one DSIG1 (i.e. with MAXSPROC=1). Previously a single P1 was generated, with DSIG1 and DSIG2 (i.e. with MAXSPROC=2). Note also that a single LOGICAL ICOLAMP(4,5,2) is now replaced by two separate LOGICAL ICOLAMP(4,5,1) Note however that nprocesses is always 1 in the cudacpp code in all P1 before and after. The code builds and check.exe runs successfully in the two P1 directories (previously the build was failing)

…1_gux_ttxux to P1_gu_ttxu The gqttq tests fail anyway and will need to be fixed (madgraph5#630). However, this completes the addition of gq_ttq as a new process to the repo. In particular it includes proof that Olivier's "split_nonidentical_grouping" madgraph5#619 fixes the gqttq builds. It also includes a lot of cleanup for "nprocesses" (madgraph5#272 and madgraph5#343) Revert "[gqttq] retry the tmad gqttq test with the P1_gu_ttxu directory - the test continues to fail (madgraph5#630)" This reverts commit 2dea1f7. Revert "[gqttq] temporarely use P1_gu_ttxu instead of P1_gux_ttxux for gqttq tmad tests" This reverts commit ea23a9a.

…dgraph5#272 and madgraph5#343 (see also PRs madgraph5#619, madgraph5#626, madgraph5#360 and madgraph5#396)

…proc and codegen_procid (for sanity checks only madgraph5/madgraph4gpu#272 and madgraph5/madgraph4gpu#619)

…oc and codegen_procid (for sanity checks only madgraph5/madgraph4gpu#272 and madgraph5/madgraph4gpu#619)

…dgraph5/madgraph4gpu#272 and madgraph5/madgraph4gpu#343 (see also PRs madgraph5/madgraph4gpu#619, madgraph5/madgraph4gpu#626, madgraph5/madgraph4gpu#360 and madgraph5/madgraph4gpu#396)

valassi mentioned this pull request Mar 31, 2023

Add an example of a calculation with nprocesses>1 #272

Closed

valassi marked this pull request as draft April 3, 2023 14:52

valassi changed the title ~~Upgrade to Olivier's latest gpucpp branch upstream~~ WIP: Upgrade to Olivier's latest gpucpp branch upstream Apr 3, 2023

valassi changed the title ~~WIP: Upgrade to Olivier's latest gpucpp branch upstream~~ WIP: Upgrade to Olivier's latest gpucpp branch upstream ("split_nonidentical_grouping") Apr 3, 2023

This was referenced Apr 3, 2023

Fixes for SUSY code generation of gg_tt #624

Merged

fix susy gg_tt builds and add it to the repo #625

Merged

valassi added 11 commits April 3, 2023 17:35

[gpucpp] Upgrade CODEGEN to Olivier's latest gpucpp upstream

31ca407

[gpucpp] regenerate siz processes SA with new gpucpp - no change

843109d

[gpucpp] regenerate 4 ggtt* mad processes with new gpucpp usptream, n…

7d0f181

…o change

[gpucpp] manually rename eemumu P1 subdirectory as expected by the ne…

5a11737

…w gpucpp usptream This will make it easier to check if there are any other differences... git mv ee_mumu.mad/SubProcesses/P1_ll_ll/ ee_mumu.mad/SubProcesses/P1_epem_mupmum/

[gpucpp] regenerate eemumu.mad - all is the same except for the renam…

a8287a7

…ing of the P1 subdirectory

[gpucpp] rename P1_ll_ll as P1_epem_mupmum in tput and tmad scripts

1738f82

[gpucpp] TEMPORARY hack to only run eemumu testst in allTees from tpu…

5cd187b

…t and tmad

[gpucpp] rerun tmad alltees for eemumu only

0483f02

[gpucpp] ** COMPLETE GPUCPP ** revert temporary hack to run allTees o…

422872b

…nly for eemumu Revert "[gpucpp] TEMPORARY hack to only run eemumu testst in allTees from tput and tmad" This reverts commit 6634d0f.

valassi force-pushed the gpucpp branch from d5ce77b to 422872b Compare April 3, 2023 15:37

This was referenced Apr 3, 2023

Add SM gq to ttq (and gq to ttllq) - example of a process with DSIG1 and DSIG2 #626

Merged

My "vecsize" patches have broken mg5amcnlo "launch" for the default Fortran madevent #629

Closed

valassi self-assigned this Apr 5, 2023

valassi marked this pull request as ready for review April 5, 2023 05:40

valassi merged commit 5f646a2 into madgraph5:master Apr 5, 2023

valassi added a commit to valassi/madgraph4gpu that referenced this pull request Apr 5, 2023

[gqttq] in CODEGEN sigmakin template, replace nprocesses by codegen_n…

090ee11

…proc and codegen_procid (for sanity checks only madgraph5#272 and madgraph5#619)

valassi added a commit to valassi/madgraph4gpu that referenced this pull request Apr 5, 2023

[gqttq] in CODEGEN sigmakin template, improve handling of codegen_npr…

0d32c1d

…oc and codegen_procid (for sanity checks only madgraph5#272 and madgraph5#619)

valassi mentioned this pull request Apr 5, 2023

xsec from fortran and cpp differ in gg_uu tmad tests (bug in getGoodHel implementation) #630

Closed

valassi changed the title ~~WIP: Upgrade to Olivier's latest gpucpp branch upstream ("split_nonidentical_grouping")~~ Upgrade to Olivier's latest gpucpp branch upstream ("split_nonidentical_grouping") Apr 6, 2023

valassi added a commit to valassi/madgraph4gpu that referenced this pull request Apr 7, 2023

[gqttq] in CODEGEN, improve the comment on nprocesses>2 for issues ma…

0033615

…dgraph5#272 and madgraph5#343 (see also PRs madgraph5#619, madgraph5#626, madgraph5#360 and madgraph5#396)

valassi mentioned this pull request Apr 7, 2023

processes for the paper #344

Open

valassi added a commit to mg5amcnlo/mg5amcnlo_cudacpp that referenced this pull request Aug 16, 2023

[gqttq] in CODEGEN sigmakin template, replace nprocesses by codegen_n…

c123859

…proc and codegen_procid (for sanity checks only madgraph5/madgraph4gpu#272 and madgraph5/madgraph4gpu#619)

valassi added a commit to mg5amcnlo/mg5amcnlo_cudacpp that referenced this pull request Aug 16, 2023

[gqttq] in CODEGEN sigmakin template, improve handling of codegen_npr…

bacc669

…oc and codegen_procid (for sanity checks only madgraph5/madgraph4gpu#272 and madgraph5/madgraph4gpu#619)

valassi mentioned this pull request Jul 24, 2024

WIP add pp_tt to repo (plus obsolete fixes for bug 872 via reset_cumulative_variable, now fixed by Olivier via a single helicity filter) #935

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade to Olivier's latest gpucpp branch upstream ("split_nonidentical_grouping") #619

Upgrade to Olivier's latest gpucpp branch upstream ("split_nonidentical_grouping") #619

valassi commented Mar 31, 2023

valassi commented Apr 3, 2023

valassi commented Apr 3, 2023

valassi commented Apr 5, 2023

Upgrade to Olivier's latest gpucpp branch upstream ("split_nonidentical_grouping") #619

Upgrade to Olivier's latest gpucpp branch upstream ("split_nonidentical_grouping") #619

Conversation

valassi commented Mar 31, 2023

valassi commented Apr 3, 2023

valassi commented Apr 3, 2023

valassi commented Apr 5, 2023