Build ROCM 7.1.0 to check tests on GPU #10600
Conversation
|
A new Pull Request was created by @akritkbehera for branch IB/CMSSW_17_0_X/rocm. @akritkbehera, @cmsbuild, @iarspider, @raoatifshad, @smuzaffar can you please review it and eventually sign? Thanks. |
|
cms-bot internal usage |
|
enable gpu |
|
please test for CMSSW_17_0_ROCM_X |
|
please abort |
|
enable gpu |
|
please test for CMSSW_17_0_ROCM_X |
|
-1 Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-63daa1/53617/summary.html Failed External BuildI found compilation error when building: libcudacxx_DIR:PATH=/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc13/external/cuda/12.9.1-2f902b8cd69fc02665180a65ec16b3a4/lib64/cmake/libcudacxx
nlohmann_json_DIR:PATH=/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc13/external/json/3.12.0-97f7be797298126e9adee032bbaec39f/share/cmake/nlohmann_json
pybind11_DIR:PATH=/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc13/external/py3-pybind11/3.0.1-40a12f1b4fe2393aef934ccb44fb2efc/share/cmake/pybind11
rocprim_DIR:PATH=/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc13/external/rocprim/rocm-7.1.0-8951effe5969e9fbd67ae983b5434da0/lib/cmake/rocprim
rocthrust_DIR:PATH=/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el8_amd64_gcc13/external/rocthrust/rocm-7.1.0-daff82ac6ea95b1872a079c529915cf8/lib/cmake/rocthrust
error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.EjFLhh (%build)
RPM build warnings:
Macro expanded in comment on line 488: %{pkginstroot}/python
|
|
Pull request #10600 was updated. |
|
please test for CMSSW_17_0_ROCM_X |
|
Pull request #10600 was updated. |
|
please test for CMSSW_17_0_ROCM_X |
|
-1 Failed Tests: Build Failed BuildI found compilation error when building: >> Leaving Package Utilities/RelMon >> Package Utilities/RelMon built Copying tmp/el8_amd64_gcc13/src/DataFormats/SoATemplate/test/SoACustomizedMethodsHip/libSoACustomizedMethodsHip_rocm.a to productstore area: cp: cannot stat 'tmp/el8_amd64_gcc13/src/DataFormats/SoATemplate/test/SoACustomizedMethodsHip/libSoACustomizedMethodsHip_rocm.a': No such file or directory >> Deleted: tmp/el8_amd64_gcc13/src/DataFormats/SoATemplate/test/SoACustomizedMethodsHip/libSoACustomizedMethodsHip_rocm.a gmake: *** [config/SCRAM/GMake/Makefile.rules:1920: tmp/el8_amd64_gcc13/src/DataFormats/SoATemplate/test/SoACustomizedMethodsHip/libSoACustomizedMethodsHip_rocm.a] Error 1 Copying tmp/el8_amd64_gcc13/src/DataFormats/SoATemplate/test/testRocmSoALayoutAndView_t/libtestRocmSoALayoutAndView_t_rocm.a to productstore area: cp: cannot stat 'tmp/el8_amd64_gcc13/src/DataFormats/SoATemplate/test/testRocmSoALayoutAndView_t/libtestRocmSoALayoutAndView_t_rocm.a': No such file or directory >> Deleted: tmp/el8_amd64_gcc13/src/DataFormats/SoATemplate/test/testRocmSoALayoutAndView_t/libtestRocmSoALayoutAndView_t_rocm.a gmake: *** [config/SCRAM/GMake/Makefile.rules:1920: tmp/el8_amd64_gcc13/src/DataFormats/SoATemplate/test/testRocmSoALayoutAndView_t/libtestRocmSoALayoutAndView_t_rocm.a] Error 1 Copying tmp/el8_amd64_gcc13/src/DataFormats/TrivialSerialisation/test/TestDataFormatsTrivialSerialisationPortableROCmAsync/libTestDataFormatsTrivialSerialisationPortableROCmAsync_rocm.a to productstore area: |
|
So ibamd_comgr.so.3 is back built to being build as a shared library the patches weren't enough. The rest errors are hip not being able to find the device library objects. Hmm |
|
Pull request #10600 was updated. |
|
please test for CMSSW_17_0_ROCM_X |
|
Pull request #10600 was updated. |
|
enable gpu |
|
please test for CMSSW_17_0_ROCM_X |
|
-1 Failed Tests: nvidia_h100UnitTests The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic: You can see more details here: Comparison SummarySummary:
|
|
gpu relvals didn't run? |
4515043
into
cms-sw:IB/CMSSW_17_0_X/rocm
No description provided.