Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update to CUDA 11.0 RC (11.0.182) #5935

Conversation

fwyzard
Copy link
Contributor

@fwyzard fwyzard commented Jun 12, 2020

Update to CUDA 11.0 RC:

  • CUDA version 11.0.182
  • NVIDIA drivers version 450.36.06

Use the same package structure on Intel/AMD (x86_64), Power (ppc64le) and ARMv8/SBSA (aarch64).

Include support for c++17, gcc 9, clang 9.

See https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#title-new-features .

@fwyzard
Copy link
Contributor Author

fwyzard commented Jun 12, 2020

please test

@cmsbuild
Copy link
Contributor

The tests are being triggered in jenkins.

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @fwyzard (Andrea Bocci) for branch IB/CMSSW_11_2_X/master.

@cmsbuild, @smuzaffar, @mrodozov, @tulamor can you please review it and eventually sign? Thanks.
cms-bot commands are listed here

@fwyzard
Copy link
Contributor Author

fwyzard commented Jun 12, 2020

abort

@cmsbuild
Copy link
Contributor

Jenkins tests are aborted.

@fwyzard fwyzard force-pushed the IB/CMSSW_11_2_X/master-CUDA_11_0_RC branch from 3197fa8 to 2051095 Compare June 12, 2020 18:18
@fwyzard
Copy link
Contributor Author

fwyzard commented Jun 12, 2020

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 12, 2020

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-run-pr-tests/7010/console Started: 2020/06/12 20:29

@cmsbuild
Copy link
Contributor

Pull request #5935 was updated.

@cmsbuild
Copy link
Contributor

-1

Tested at: 2051095

  • Build:

I found compilation error when building:

+ mv /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/BUILD/slc7_amd64_gcc820/external/cuda/11.0.1-822ae7/build/drivers/libcuda.so.450.36.06 /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/552f79b48d3201b48f14f7132c8017d2/opt/cmssw/slc7_amd64_gcc820/external/cuda/11.0.1-822ae7/drivers/
+ ln -sf libcuda.so.450.36.06 /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/552f79b48d3201b48f14f7132c8017d2/opt/cmssw/slc7_amd64_gcc820/external/cuda/11.0.1-822ae7/drivers/libcuda.so.1
+ ln -sf libcuda.so.1 /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/552f79b48d3201b48f14f7132c8017d2/opt/cmssw/slc7_amd64_gcc820/external/cuda/11.0.1-822ae7/drivers/libcuda.so
+ mv /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/BUILD/slc7_amd64_gcc820/external/cuda/11.0.1-822ae7/build/drivers/libnvidia-fatbinaryloader.so.450.36.06 /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/BUILDROOT/552f79b48d3201b48f14f7132c8017d2/opt/cmssw/slc7_amd64_gcc820/external/cuda/11.0.1-822ae7/drivers/
mv: cannot stat '/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/BUILD/slc7_amd64_gcc820/external/cuda/11.0.1-822ae7/build/drivers/libnvidia-fatbinaryloader.so.450.36.06': No such file or directory
error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.NoCcKx (%install)


RPM build errors:
Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.NoCcKx (%install)



You can see the results of the tests here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-822ae7/7010/summary.html

@cmsbuild
Copy link
Contributor

Pull request #5935 was updated.

@fwyzard fwyzard force-pushed the IB/CMSSW_11_2_X/master-CUDA_11_0_RC branch from 823b56d to 6b6b373 Compare June 12, 2020 19:54
@cmsbuild
Copy link
Contributor

Pull request #5935 was updated.

@cmsbuild
Copy link
Contributor

Comparison job queued.

@smuzaffar
Copy link
Contributor

@fwyzard , this looks good to go. Do you have any other changesing mind? One simplification we can do is to set cppstd in cuda-flags.file and then use it where needed e.g in cuda-toolfile.spec, cupla.spec and cuda-flags.file

@fwyzard
Copy link
Contributor Author

fwyzard commented Jun 14, 2020

Do you mean having a way to synchronise the -std=c++## flags in

?
Yes, that sounds like a good idea.

Should I do it here, or open a separate PR for it ?

We should also update the cub toolkit, now that it comes as a part of the cuda package.

@smuzaffar
Copy link
Contributor

yes that is the idea, so that in future we only update host std flag in one place.
Yes please, update it here, I will rerun a final test for production architecture ( just to get the labels green).

Define in a central place the C++ standard to use for building host and device code with nvcc.
@cmsbuild
Copy link
Contributor

Pull request #5935 was updated.

@fwyzard
Copy link
Contributor Author

fwyzard commented Jun 14, 2020

Let's see if it works ...

@fwyzard
Copy link
Contributor Author

fwyzard commented Jun 14, 2020

@cmsbuild, please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 14, 2020

The tests are being triggered in jenkins.

@cmsbuild
Copy link
Contributor

+1
Tested at: d27811b
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-822ae7/7055/summary.html
CMSSW: CMSSW_11_2_X_2020-06-14-2300
SCRAM_ARCH: slc7_amd64_gcc820

@cmsbuild
Copy link
Contributor

Comparison job queued.

@cmsbuild
Copy link
Contributor

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-822ae7/7055/summary.html

Comparison Summary:

  • No significant changes to the logs found
  • Reco comparison results: 4 differences found in the comparisons
  • DQMHistoTests: Total files compared: 36
  • DQMHistoTests: Total histograms compared: 2778811
  • DQMHistoTests: Total failures: 4
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2778757
  • DQMHistoTests: Total skipped: 50
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 35 files compared)
  • Checked 152 log files, 16 edm output root files, 36 DQM output files

@smuzaffar
Copy link
Contributor

+externals

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next IB/CMSSW_11_2_X/master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @silviodonato, @dpiparo (and backports should be raised in the release meeting by the corresponding L2)

@smuzaffar smuzaffar merged commit bf5e05c into cms-sw:IB/CMSSW_11_2_X/master Jun 15, 2020
@fwyzard fwyzard deleted the IB/CMSSW_11_2_X/master-CUDA_11_0_RC branch July 28, 2020 16:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants