Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update CUDA to version 10.0.326, on aarch64 (ARMv8 64) #5192

Conversation

fwyzard
Copy link
Contributor

@fwyzard fwyzard commented Sep 3, 2019

For aarch64 (ARMv8 64), update to JetPack 4.2.1:

  • Linux for Tegra L4T R32.2
  • CUDA version 10.0.326 for ARMv8

See the release notes at https://docs.nvidia.com/jetson/jetpack/release-notes/index.html .

@fwyzard
Copy link
Contributor Author

fwyzard commented Sep 3, 2019

@cmsbuild, please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-run-pr-tests/2327/console Started: 2019/09/03 11:50

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

A new Pull Request was created by @fwyzard (Andrea Bocci) for branch IB/CMSSW_11_0_X/gcc700.

@cmsbuild, @smuzaffar, @mrodozov can you please review it and eventually sign? Thanks.
cms-bot commands are listed here

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

Comparison job queued.

cuda.spec Outdated
@@ -55,7 +55,7 @@ ar p %{SOURCE0} data.tar.xz | tar xv --xz -C %_builddir/tmp
# extract the contents from the individual .deb archives into
# %_builddir/tmp/usr/local/cuda-10.0/...
for FILE in %_builddir/tmp/var/cuda-repo-10-0-local-%{realversion}/*.deb; do
ar p $FILE data.tar.xz | tar xv --xz -C %_builddir/tmp
ar p %FILE data.tar.xz | tar xv --xz -C %_builddir/tmp
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@fwyzard , do you really mean %FILE here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no... thanks for catching this !

For aarch64 (ARMv8 64), update to JetPack 4.2.1:
  * Linux for Tegra L4T R32.2
  * CUDA version 10.0.326 for ARMv8

See the release notes at https://docs.nvidia.com/jetson/jetpack/release-notes/index.html .
@fwyzard fwyzard force-pushed the IB/CMSSW_11_0_X/gcc700_jetpack_421 branch from 8b7376c to 987d929 Compare September 3, 2019 15:39
@fwyzard
Copy link
Contributor Author

fwyzard commented Sep 3, 2019

@cmsbuild, please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-run-pr-tests/2332/console Started: 2019/09/03 17:40

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

Pull request #5192 was updated.

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-880bb4/2327/summary.html

Comparison Summary:

  • No significant changes to the logs found
  • Reco comparison results: 4 differences found in the comparisons
  • DQMHistoTests: Total files compared: 34
  • DQMHistoTests: Total histograms compared: 2955700
  • DQMHistoTests: Total failures: 2
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2955357
  • DQMHistoTests: Total skipped: 341
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 33 files compared)
  • Checked 145 log files, 15 edm output root files, 34 DQM output files

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

Comparison job queued.

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-880bb4/2332/summary.html

Comparison Summary:

  • No significant changes to the logs found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 34
  • DQMHistoTests: Total histograms compared: 2955700
  • DQMHistoTests: Total failures: 4
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2955355
  • DQMHistoTests: Total skipped: 341
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 33 files compared)
  • Checked 145 log files, 15 edm output root files, 34 DQM output files

@smuzaffar
Copy link
Contributor

please test for slc7_aarch64_gcc700

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-run-pr-tests/2339/console Started: 2019/09/03 22:24

@smuzaffar
Copy link
Contributor

+externals

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

This pull request is fully signed and it will be integrated in one of the next IB/CMSSW_11_0_X/gcc700 IBs after it passes the integration tests. This pull request will now be reviewed by the release team before it's merged. @davidlange6, @slava77, @smuzaffar, @fabiocos (and backports should be raised in the release meeting by the corresponding L2)

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 3, 2019

Comparison job queued.

@smuzaffar smuzaffar merged commit efac949 into cms-sw:IB/CMSSW_11_0_X/gcc700 Sep 4, 2019
@smuzaffar
Copy link
Contributor

@fwyzard , we have two new machines (from techlab) [a] but our gou unit tests fail on these [b]. Looks like we need -gencode=arch=compute_75,code=sm_75 . Any objection adding these and can we drop any of the existing one?

[a]

cat /proc/driver/nvidia/version 
NVRM version: NVIDIA UNIX x86_64 Kernel Module  418.87.00  Thu Aug  8 15:35:46 CDT 2019
GCC version:  gcc version 4.8.5 20150623 (Red Hat 4.8.5-36) (GCC) 

[b]

nDevices = 1
invalid device function
test_calo_rechit: src/DataFormats/CaloRecHit/test/test_calo_rechit.cu:20: test_calo_rechit()::<lambda(auto:1)> [with auto:1 = cudaError]: Assertion `false' failed.
	/bin/sh: line 1: 15182 Aborted                 test_calo_rechit

@fwyzard
Copy link
Contributor Author

fwyzard commented Sep 17, 2019 via email

@fwyzard fwyzard deleted the IB/CMSSW_11_0_X/gcc700_jetpack_421 branch April 16, 2020 13:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants