Jorgen's addition of HIP/CUDA abstraction to codegen and all processes (with extra fixes and tests: "jt774" replacing PR #774) #801

valassi · 2024-01-25T18:05:48Z

Hi @oliviermattelaer @roiser @nscottnichols as discussed this is the WIP PR replacing Jorgen's PR #774.

It includes all of Jorgen's changes for HIP in PR #774 (GpuAbstraction.h and hipcc build instructions) in CODEGEN, plus a merge of the current upstream/master and extra fixes that were necessary to fix codegen and/or CUDA/C++ builds. Those extra fixes were partly derived from earlier work I had done with Jorgen in July/August (that I kept for reference in a PR #800, that I opened and immediately closed).

This PR has a fully functioning CODEGEN and regerenated processes, with baisc builds tested for ggtt.mad, but it is still in WIP.

To start with, I need to run all cuda/c++ tests (some will run in the CI, the performance suite I will run tonight).
More importantly, ** I HAVE NOT TESTED ANY HIP YET **, neither the build nor the runtime. I need to install hipcc to test the build. And I need to setup a node with an AMD GPU to test the code.

I will close PR #774 because it is replaced by this one.

… improvements

…Abstraction.h

…/AUTHORS

…of NVCC

…p_src.mk

…rc.mk and cudacpp.mk

…se HIP else neither" in CODEGEN cudacpp.mk

Fix conflicts: epochX/cudacpp/CODEGEN/PLUGIN/CUDACPP_SA_OUTPUT/madgraph/iolibs/template_files/gpu/check_sa.cc epochX/cudacpp/CODEGEN/PLUGIN/CUDACPP_SA_OUTPUT/madgraph/iolibs/template_files/gpu/cudacpp.mk epochX/cudacpp/CODEGEN/PLUGIN/CUDACPP_SA_OUTPUT/madgraph/iolibs/template_files/gpu/cudacpp_src.mk epochX/cudacpp/CODEGEN/PLUGIN/CUDACPP_SA_OUTPUT/madgraph/iolibs/template_files/gpu/mgOnGpuConfig.h epochX/cudacpp/CODEGEN/PLUGIN/CUDACPP_SA_OUTPUT/output.py

…ce % by %% (code generation was failing)

…EN (code generation was failing clang formatting checks)

CUDA_HOME=none HIP_HOME=none make |& more ... ccache g++ -O3 -std=c++17 -I. -fPIC -Wall -Wshadow -Wextra -ffast-math -fopenmp -march=skylake-avx512 -mprefer-vector-width=256 -DMGONGPU_FPTYPE_DOU BLE -DMGONGPU_FPTYPE2_DOUBLE -DMGONGPU_HAS_NO_CURAND -fPIC -c Parameters_sm.cc -o Parameters_sm.o In file included from /usr/include/c++/11/locale:41, from /usr/include/c++/11/iomanip:43, from Parameters_sm.cc:17: /usr/include/c++/11/bits/locale_facets_nonio.h:59:39: error: ‘locale’ has not been declared 59 | struct __timepunct_cache : public locale::facet | ^~~~~~

This reverts commit def02b5.

…ors.h and process_matrix.inc as in branch jthip24 These are changes that in that branch I included in commitcommit 6e90139 (Tue Jul 18 18:25:34 2023 +0200), which consisted in a backport to CODEGEN of earlier changes in ggttggg.mad.

…Abstraction.h

…GONGPUCPP_GPUIMPL... not clear why this was not done yet In branch jthip24, this is coming from Jorgen's commit 6741186 (Thu Jul 13 15:15:41 2023 +0200) which includes many such changes

…ggttgg.mad on Tue Jul 18 18:11:04 2023 +0200) Fix conflicts: epochX/cudacpp/CODEGEN/PLUGIN/CUDACPP_SA_OUTPUT/madgraph/iolibs/template_files/cpp_model_parameters_h.inc epochX/cudacpp/CODEGEN/PLUGIN/CUDACPP_SA_OUTPUT/madgraph/iolibs/template_files/gpu/check_sa.cc epochX/cudacpp/CODEGEN/PLUGIN/CUDACPP_SA_OUTPUT/madgraph/iolibs/template_files/gpu/mgOnGpuConfig.h NB: this is very strange, because this same commit 1b5c0fd is already included in the jt774 branch earlier on...

oliviermattelaer · 2024-01-31T10:43:01Z

So for me this simple madgraph test is not working anymore:

generate g g > t t~ g
output madevent_simd PROC_TEST
launch
set cudacpp_backend FORTRAN

crash is

Command "import /Users/omattelaer/Documents/git_workspace/3.1.1_lo_vectorization/cmd_multi" interrupted in sub-command:
"output madevent_simd PROC_TEST" with error:
AttributeError : 'SIMD_ProcessExporter' object has no attribute 'export_processes'
Please report this bug on https://bugs.launchpad.net/mg5amcnlo
More information is found in 'MG5_debug'.
Please attach this file to your report.

Does it work for you andrea?
If yes which of the MG5aMC branch is needed to have this working?
(I bet that this might be my issue)

oliviermattelaer · 2024-01-31T10:56:40Z

I was using gpucpp_wrap, indeed gpucpp is working. Let me check with that one that SIMD is working now on my mac

valassi · 2024-01-31T11:25:35Z

Hi @oliviermattelaer thanks yes I am using gpucpp branch. Or more precisely: I am using whatever mg5amcnlo commit we have in the module:
https://github.com/valassi/madgraph4gpu/tree/jt774/MG5aMC
Now this points to
https://github.com/mg5amcnlo/mg5amcnlo/tree/23f61b93fdf268a1cdcbd363cd449c88b3511d7a

This is a rather older version of gpucpp. If you want, I can also try to include the upgrade to the latest gpucpp here. But first check with 23f61b9 please... again, I'd avoid mixing everything together if possible (it's ok if it works, but if it does not work I would rather debug the issues separately). Thanks!

valassi · 2024-01-31T11:26:30Z

PS 23f61b9 is also what we have now in master
https://github.com/madgraph5/madgraph4gpu/tree/master/MG5aMC

oliviermattelaer · 2024-01-31T13:04:23Z

I can not test with that MG5aMC version since my python version is too recent for that "old" version of MG5aMC...
So I had to update (both gpucpp and gpucpp_wrap) to merge with 3.5.3...
But seems working so far...

oliviermattelaer · 2024-01-31T13:20:03Z

Ok, it does not work for me actually.
They are some issue with the python interface of the plugin (nothing related to the hip part).
I guess that such issue is solve in another branch (can not be working like that). Will push here some cherry-pick or other method to fix the issue that I have (and then work on the next one).

oliviermattelaer · 2024-01-31T15:05:46Z

note for later, we will need to re-run the test on that commit since the reported crash is an github API issue (i.e. not related to the commit per say)

…ted before Olivier's commit] STARTED AT Tue Jan 30 01:27:55 AM CET 2024 ./tput/teeThroughputX.sh -mix -hrd -makej -eemumu -ggtt -ggttg -ggttgg -gqttq -ggttggg -makeclean ENDED(1) AT Tue Jan 30 05:12:08 AM CET 2024 [Status=0] ./tput/teeThroughputX.sh -flt -hrd -makej -eemumu -ggtt -ggttgg -inlonly -makeclean ENDED(2) AT Tue Jan 30 05:41:46 AM CET 2024 [Status=0] ./tput/teeThroughputX.sh -makej -eemumu -ggtt -ggttg -gqttq -ggttgg -ggttggg -flt -bridge -makeclean ENDED(3) AT Tue Jan 30 05:52:05 AM CET 2024 [Status=0] ./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -rmbhst ENDED(4) AT Tue Jan 30 05:55:35 AM CET 2024 [Status=0] ./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -curhst ENDED(5) AT Tue Jan 30 05:59:00 AM CET 2024 [Status=0]

…ted before Olivier's commit] STARTED AT Tue Jan 30 06:02:30 AM CET 2024 ENDED AT Tue Jan 30 10:37:48 AM CET 2024 Status=0 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_m_inl0_hrd0.txt

valassi · 2024-01-31T15:15:01Z

Ok, it does not work for me actually. They are some issue with the python interface of the plugin (nothing related to the hip part). I guess that such issue is solve in another branch (can not be working like that). Will push here some cherry-pick or other method to fix the issue that I have (and then work on the next one).

Hi @oliviermattelaer thanks for testing and for adding the fix 9ed3aaf

As you saw I opened PR #811 in parallel about the update of mg5amcnlo to the latest gpucpp. But I suggest we postpone that, and we use the older gpucpp for this PR #801.

And yes we do too much for github and its CI ;-)

….py changes, while c++/cuda/hip is unchanged

oliviermattelaer · 2024-01-31T15:49:30Z

I have check the plugin/interface part and this is good to merge for me.

However, this would need to add a new backend within launch_plugin.py
(for the moment, this only support Fortran, CPP, CUDA).
Which means that a "normal" user can not run the code (even if all the script that we use for testing are working).

I would say that our target here would be to be able to run via the following script:

generate g g > t t~ g
output madevent_gpu PROC_TEST
launch
set cudacpp_backend HIP

(maybe a good time to change the name of that variable)
I can implement that within this PR (this is easy for me) but I will not be able to test it (no access to LUMI).
Or do you prefer that we discuss/add that possibility in another branch? (both are fine for me)
If this is the case, then we can move forward for the moment.

oliviermattelaer

let me change the flag to approve, since my "requested change" are not strictly necessary for us to merge this (and in that case, I will create a new branch that will implement such change and ask you the check them)

…qttq (madgraph5#806) (1) Step 1 - build on the login node (almost 24 hours!) STARTED AT Tue 30 Jan 2024 02:27:18 AM EET ./tput/teeThroughputX.sh -mix -hrd -makej -eemumu -ggtt -ggttg -ggttgg -gqttq -ggttggg -makeclean -makeonly ENDED(1) AT Wed 31 Jan 2024 12:32:21 AM EET [Status=0] ./tput/teeThroughputX.sh -flt -hrd -makej -eemumu -ggtt -ggttgg -inlonly -makeclean -makeonly ENDED(2) AT Wed 31 Jan 2024 01:01:06 AM EET [Status=0] ./tput/teeThroughputX.sh -makej -eemumu -ggtt -ggttg -gqttq -ggttgg -ggttggg -flt -bridge -makeclean -makeonly ENDED(3) AT Wed 31 Jan 2024 01:13:56 AM EET [Status=0] ./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -rmbhst -makeonly ENDED(4) AT Wed 31 Jan 2024 01:16:06 AM EET [Status=0] ./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -rorhst -makeonly ENDED(5) AT Wed 31 Jan 2024 01:18:44 AM EET [Status=0] (2) Step 2 - run tests on the worker node (less than 2 hours) NB this is "./tput/allTees.sh" WITHOUT the -hip flag (no "-rorhst" added) STARTED AT Wed 31 Jan 2024 01:16:39 PM EET ./tput/teeThroughputX.sh -mix -hrd -makej -eemumu -ggtt -ggttg -ggttgg -gqttq -ggttggg -makeclean ENDED(1) AT Wed 31 Jan 2024 02:09:05 PM EET [Status=2] ./tput/teeThroughputX.sh -flt -hrd -makej -eemumu -ggtt -ggttgg -inlonly -makeclean ENDED(2) AT Wed 31 Jan 2024 02:26:12 PM EET [Status=0] ./tput/teeThroughputX.sh -makej -eemumu -ggtt -ggttg -gqttq -ggttgg -ggttggg -flt -bridge -makeclean ENDED(3) AT Wed 31 Jan 2024 02:45:10 PM EET [Status=2] ./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -rmbhst ENDED(4) AT Wed 31 Jan 2024 02:48:54 PM EET [Status=0] ./tput/teeThroughputX.sh -eemumu -ggtt -ggttgg -flt -curhst ENDED(5) AT Wed 31 Jan 2024 02:51:15 PM EET [Status=0] ./tput/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd0_bridge.txt:Backtrace for this error: ./tput/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd0_bridge.txt:ERROR! Fortran calculation (F77/CUDA) crashed ./tput/logs_gqttq_mad/log_gqttq_mad_m_inl0_hrd1.txt:Backtrace for this error: ./tput/logs_gqttq_mad/log_gqttq_mad_m_inl0_hrd1.txt:ERROR! Fortran calculation (F77/CUDA) crashed ./tput/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd1.txt:Backtrace for this error: ./tput/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd1.txt:ERROR! Fortran calculation (F77/CUDA) crashed ./tput/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd0.txt:Backtrace for this error: ./tput/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd0.txt:ERROR! Fortran calculation (F77/CUDA) crashed ./tput/logs_gqttq_mad/log_gqttq_mad_m_inl0_hrd0.txt:Backtrace for this error: ./tput/logs_gqttq_mad/log_gqttq_mad_m_inl0_hrd0.txt:ERROR! Fortran calculation (F77/CUDA) crashed ./tput/logs_gqttq_mad/log_gqttq_mad_f_inl0_hrd1.txt:Backtrace for this error: ./tput/logs_gqttq_mad/log_gqttq_mad_f_inl0_hrd1.txt:ERROR! Fortran calculation (F77/CUDA) crashed ./tput/logs_gqttq_mad/log_gqttq_mad_f_inl0_hrd0.txt:Backtrace for this error: ./tput/logs_gqttq_mad/log_gqttq_mad_f_inl0_hrd0.txt:ERROR! Fortran calculation (F77/CUDA) crashed ./tput/logs_gqttq_mad/log_gqttq_mad_f_inl0_hrd0_bridge.txt:Backtrace for this error: ./tput/logs_gqttq_mad/log_gqttq_mad_f_inl0_hrd0_bridge.txt:ERROR! Fortran calculation (F77/CUDA) crashed

…qttq (madgraph5#806) NB this is "./tmad/allTees.sh" WITHOUT the -hip flag (no "-rorhst" added) STARTED AT Wed 31 Jan 2024 02:54:59 PM EET ENDED AT Wed 31 Jan 2024 06:02:10 PM EET Status=0 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_d_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_f_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_m_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_d_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_f_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_m_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_d_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_f_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_m_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_d_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_f_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_m_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_d_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_f_inl0_hrd0.txt 16 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_m_inl0_hrd0.txt 12 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd0.txt 12 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_f_inl0_hrd0.txt 12 /users/valassia/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_m_inl0_hrd0.txt

…d90 before merging to master

valassi · 2024-01-31T16:59:12Z

I have check the plugin/interface part and this is good to merge for me.

However, this would need to add a new backend within launch_plugin.py (for the moment, this only support Fortran, CPP, CUDA). Which means that a "normal" user can not run the code (even if all the script that we use for testing are working).

I would say that our target here would be to be able to run via the following script:
generate g g > t t~ g
output madevent_gpu PROC_TEST
launch
set cudacpp_backend HIP

Thanks Olivier, I will merge. I committed a few more manual test logs, all OK. And the CI is good.

After this merge 801, I would suggest going in the following order:

adapt and merge Jorgen's makefiles with separate builds for CUDA, HIP and C++ #798 about separate cpp, cuda, hip builds (built over Jorgen's Includes all the changes to the makefiles, adding new targets for CUDA/cppavx builds #775)
adapt and merge remove gXXX.cu symlinks (build XXX_cu.o from XXX.cc) #368, removing gXXX.cu links and using XXX.o and XXX_cu.o instead (and possibly also XXX_hip.o, now)

Also, later on,

adapt and merge update the mg5amcnlo submodule to the latest commit in branch gpucpp #811 about moving to the latest gpucpp
adapt and merge add support for ROCRAND (via HIPRAND) #809 about rocrand (this is actually almost ready too, I just realised that RocrandHost cannot be implemented apparently, can be studied in more detail, maybe needs a more recent HIP, but I think that's ok for the moment)

And then later all the other non hip stuff (warp size, more on makefiles, etc).

valassi · 2024-01-31T17:02:56Z

And thanks @Jooorgen again! :-)

…are to merg eupstream/master with HIP madgraph5#801 git checkout 0dc3d50~ $(git ls-tree --name-only HEAD */CODEGEN*txt)

… from PR madgraph5#801) into gpucpp

…the mg5amcnlo update: no changes except in codegen logs (changes in individual processes have been merged already)

…raph5#801 and gpucpp PR madgraph5#811) into rocrand Fix conflicts here (plus some in gg_tt.mad fixed by checking out rocrand version) epochX/cudacpp/CODEGEN/PLUGIN/CUDACPP_SA_OUTPUT/madgraph/iolibs/template_files/gpu/check_sa.cc epochX/cudacpp/tput/allTees.sh epochX/cudacpp/tput/throughputX.sh

…raph5#801 and gpucpp PR madgraph5#811, and possibly more) into mch

…nd maybe more) ** rerun 18 tmad tests on itscrd90, all ok STARTED AT Sat Feb 3 07:02:02 PM CET 2024 ENDED AT Sat Feb 3 11:20:09 PM CET 2024 Status=0 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_eemumu_mad/log_eemumu_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttggg_mad/log_ggttggg_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttgg_mad/log_ggttgg_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggttg_mad/log_ggttg_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_ggtt_mad/log_ggtt_mad_m_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_d_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_f_inl0_hrd0.txt 24 /data/avalassi/GPU2023/madgraph4gpuX/epochX/cudacpp/tmad/logs_gqttq_mad/log_gqttq_mad_m_inl0_hrd0.txt

…dgraph5#801 and gpucpp PR madgraph5#811) into makefiles Fix conflicts in epochX/cudacpp/CODEGEN/PLUGIN/CUDACPP_SA_OUTPUT/madgraph/iolibs/template_files/gpu/cudacpp.mk

Jooorgen and others added 30 commits August 24, 2023 13:18

[CODEGEN] Added GPU abstraction to CODEGEN

22a0ac0

[jthip] change % to %% in CODEGEN cudacpp.mk

81cf765

[jthip] clang-format GpuAbstraction.h both in CODEGEN and in ggttgg.mad

b83f8c9

[jthip] clang-format GpuRuntime.h both in CODEGEN and in ggttgg.mad

1afbafc

Made the codegenerated files same as the templated files in gg_ttgg

d1f5c5b

[jthip] backport to CODEGEN from ggttgg.mad

1b5c0fd

[jthip] complete backport to CODEGEN from ggttgg.mad, including a few…

0f1b811

… improvements

[jthip] in CODEGEN, remove the copying to src of GpuRuntime.h and Gpu…

71ff5e2

…Abstraction.h

[jthip] In CODEGEN, acknowledge Joergen in each file and in COPYRIGHT…

a37fb41

…/AUTHORS

[CODEGEN] Added HIP runtime include in mgOnGpuConfig.h in codegen

428aa50

[jthip/namespace] backport latest changes from ggttgg.mad to CODEGEN

24fbbb6

[jthip] in CODEGEN, backport also cudacpp_src.mk using GPUCC instead …

10df703

…of NVCC

[CODEGEN] Added changes from gg_ttgg.mad to code generator

43e0c64

[CODEGEN] Added export of GPUCC and GPUFLAGS to codegen

e99a2b8

Fixed warning and changed HIPARCHFLAGS export so it exports to cudacp…

4adb62f

…p_src.mk

[CODEGEN] Fixed error in runTest.cc and reverted changes in cudacpp_s…

e18c882

…rc.mk and cudacpp.mk

Merge branch 'madgraph5:master' into gpu_abstraction_only

f588cd4

[CODEGEN] Added GPU abstraction to CODEGEN

597de73

Updated first name in Author list

1a6496a

[jt774] (before merging upstream/master) improve logic of "if CUDA el…

d2e2f47

…se HIP else neither" in CODEGEN cudacpp.mk

[jt774] (before merging usptream/master) remove CODEGEN #cudacpp.mk#

8e9120c

[jt774] (after merging upstream/master) fix CODEGEN cudacpp.mk: repla…

cf8875b

…ce % by %% (code generation was failing)

[jt774] (after merging upstream/master) fix clang formatting in CODEG…

e32bc4e

…EN (code generation was failing clang formatting checks)

Revert "[jt774] regenerate gg_tt.mad - the build fails"

d4200cf

This reverts commit def02b5.

[jthip] in CODEGEN, remove the copying to src of GpuRuntime.h and Gpu…

7363e1f

…Abstraction.h

[jt774] in CODEGEN mgOnGpuFptypes.h, replace one more __CUDACC__ by M…

47e2b8f

…GONGPUCPP_GPUIMPL... not clear why this was not done yet In branch jthip24, this is coming from Jorgen's commit 6741186 (Thu Jul 13 15:15:41 2023 +0200) which includes many such changes

valassi mentioned this pull request Jan 31, 2024

update the mg5amcnlo submodule to the latest commit in branch gpucpp #811

Merged

fixing typo in the allowed option

9ed3aaf

valassi added 2 commits January 31, 2024 16:10

[jt774] regenerate all code with Olivier's patch - only launch_plugin…

fc19f84

….py changes, while c++/cuda/hip is unchanged

oliviermattelaer approved these changes Jan 31, 2024

View reviewed changes

valassi added 3 commits January 31, 2024 17:41

[jt774] ** COMPLETE JT774 ** go back to tput and tmad logs from itscr…

60299b7

…d90 before merging to master

valassi mentioned this pull request Jan 31, 2024

Jorgen's makefiles with separate builds for CUDA, HIP and C++ #798

Merged

valassi merged commit bae9fbe into madgraph5:master Jan 31, 2024
57 checks passed

valassi added a commit to valassi/madgraph4gpu that referenced this pull request Feb 1, 2024

Merge remote-tracking branch 'upstream/master' (including HIP changes…

6e3634e

… from PR madgraph5#801) into gpucpp

valassi added a commit to valassi/madgraph4gpu that referenced this pull request Feb 1, 2024

[gpucpp] regenerate all processes - including both madgraph5#801 and …

0971849

…the mg5amcnlo update: no changes except in codegen logs (changes in individual processes have been merged already)

valassi mentioned this pull request Feb 2, 2024

make -j fails on lumi #745

Closed

valassi added a commit to valassi/madgraph4gpu that referenced this pull request Feb 5, 2024

Merge remote-tracking branch 'upstream/master' (including HIP PR madg…

62744a6

…raph5#801 and gpucpp PR madgraph5#811, and possibly more) into mch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jorgen's addition of HIP/CUDA abstraction to codegen and all processes (with extra fixes and tests: "jt774" replacing PR #774) #801

Jorgen's addition of HIP/CUDA abstraction to codegen and all processes (with extra fixes and tests: "jt774" replacing PR #774) #801

valassi commented Jan 25, 2024

oliviermattelaer commented Jan 31, 2024

oliviermattelaer commented Jan 31, 2024

valassi commented Jan 31, 2024

valassi commented Jan 31, 2024

oliviermattelaer commented Jan 31, 2024

oliviermattelaer commented Jan 31, 2024

oliviermattelaer commented Jan 31, 2024

valassi commented Jan 31, 2024

oliviermattelaer commented Jan 31, 2024

oliviermattelaer left a comment

valassi commented Jan 31, 2024

valassi commented Jan 31, 2024

Jorgen's addition of HIP/CUDA abstraction to codegen and all processes (with extra fixes and tests: "jt774" replacing PR #774) #801

Jorgen's addition of HIP/CUDA abstraction to codegen and all processes (with extra fixes and tests: "jt774" replacing PR #774) #801

Conversation

valassi commented Jan 25, 2024

oliviermattelaer commented Jan 31, 2024

oliviermattelaer commented Jan 31, 2024

valassi commented Jan 31, 2024

valassi commented Jan 31, 2024

oliviermattelaer commented Jan 31, 2024

oliviermattelaer commented Jan 31, 2024

oliviermattelaer commented Jan 31, 2024

valassi commented Jan 31, 2024

oliviermattelaer commented Jan 31, 2024

oliviermattelaer left a comment

Choose a reason for hiding this comment

valassi commented Jan 31, 2024

valassi commented Jan 31, 2024