Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Tensorflow] Update to version 2.5.0 #7141

Merged
merged 3 commits into from Jul 20, 2021
Merged

Conversation

smuzaffar
Copy link
Contributor

@smuzaffar smuzaffar commented Jul 15, 2021

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @smuzaffar (Malik Shahzad Muzaffar) for branch IB/CMSSW_12_0_X/master.

@cmsbuild, @smuzaffar, @mrodozov, @iarspider can you please review it and eventually sign? Thanks.
@silviodonato, @dpiparo, @qliphy, @perrotta you are the release manager for this.
cms-bot commands are listed here

@smuzaffar
Copy link
Contributor Author

please test

FYI @riga

@cmsbuild
Copy link
Contributor

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-39f2e8/16866/summary.html
COMMIT: c18fb28
CMSSW: CMSSW_12_0_X_2021-07-15-1100/slc7_amd64_gcc900
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/7141/16866/install.sh to create a dev area with all the needed externals and cmssw changes.

External Build

I found compilation error when building:

+ DEPS='py2-tensorboard-data-server '
+ '[' 'py2-tensorboard-data-server ' '!=' '' ']'
+ echo 'ERROR: Missing dependencies for py3-tensorboard (python3) found: py2-tensorboard-data-server '
ERROR: Missing dependencies for py3-tensorboard (python3) found: py2-tensorboard-data-server 
+ exit 1
error: Bad exit status from /build/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.y9NWRz (%build)


RPM build errors:
Macro %rpmbuild_libdir defined but not used within scope
Bad exit status from /build/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.y9NWRz (%build)


@cmsbuild
Copy link
Contributor

Pull request #7141 was updated.

@smuzaffar
Copy link
Contributor Author

please test

@smuzaffar
Copy link
Contributor Author

please test for CMSSW_12_0_X/slc7_aarch64_gcc9

@smuzaffar
Copy link
Contributor Author

please test for CMSSW_12_0_X/slc7_ppc64le_gcc9

@cmsbuild
Copy link
Contributor

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-39f2e8/16877/summary.html
COMMIT: 2a06179
CMSSW: CMSSW_12_0_X_2021-07-15-2300/slc7_amd64_gcc900
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/7141/16877/install.sh to create a dev area with all the needed externals and cmssw changes.

External Build

I found compilation error when building:

+ DEPS='py2-keras-nightly '
+ '[' 'py2-keras-nightly ' '!=' '' ']'
+ echo 'ERROR: Missing dependencies for py3-tensorflow (python3) found: py2-keras-nightly '
ERROR: Missing dependencies for py3-tensorflow (python3) found: py2-keras-nightly 
+ exit 1
error: Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.q5cJMm (%build)


RPM build errors:
Macro %rpmbuild_libdir defined but not used within scope
Bad exit status from /data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.q5cJMm (%build)


@cmsbuild
Copy link
Contributor

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-39f2e8/16878/summary.html
COMMIT: 2a06179
CMSSW: CMSSW_12_0_X_2021-07-15-2300/slc7_ppc64le_gcc9
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/7141/16878/install.sh to create a dev area with all the needed externals and cmssw changes.

External Build

I found compilation error when building:

+ DEPS='py2-keras-nightly '
+ '[' 'py2-keras-nightly ' '!=' '' ']'
+ echo 'ERROR: Missing dependencies for py3-tensorflow (python3) found: py2-keras-nightly '
ERROR: Missing dependencies for py3-tensorflow (python3) found: py2-keras-nightly 
+ exit 1
error: Bad exit status from /scratch/cmsbuild/jenkins_b/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.mqaM7X (%build)


RPM build errors:
Macro %rpmbuild_libdir defined but not used within scope
Bad exit status from /scratch/cmsbuild/jenkins_b/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.mqaM7X (%build)


@cmsbuild
Copy link
Contributor

Pull request #7141 was updated.

@smuzaffar
Copy link
Contributor Author

please test

@smuzaffar
Copy link
Contributor Author

please test for CMSSW_12_0_X/slc7_ppc64le_gcc9

@smuzaffar
Copy link
Contributor Author

please test for CMSSW_12_0_X/slc7_aarch64_gcc9

@cmsbuild
Copy link
Contributor

-1

Failed Tests: UnitTests
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-39f2e8/16882/summary.html
COMMIT: 05b9896
CMSSW: CMSSW_12_0_X_2021-07-15-2300/slc7_ppc64le_gcc9
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/7141/16882/install.sh to create a dev area with all the needed externals and cmssw changes.

Unit Tests

I found errors in the following unit tests:

---> test test_PrepareInputDb had ERRORS
---> test test_MpsWorkFlow had ERRORS
---> test TestHeterogeneousCoreSonicTritonProducerGPU had ERRORS
---> test testEigenGPUNoFit_t had ERRORS
and more ...

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-39f2e8/16881/summary.html
COMMIT: 05b9896
CMSSW: CMSSW_12_0_X_2021-07-15-2300/slc7_amd64_gcc900
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/7141/16881/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 38
  • DQMHistoTests: Total histograms compared: 2786302
  • DQMHistoTests: Total failures: 1
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2786279
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 37 files compared)
  • Checked 160 log files, 37 edm output root files, 38 DQM output files
  • TriggerResults: no differences found

@smuzaffar
Copy link
Contributor Author

enable profiling

@smuzaffar
Copy link
Contributor Author

please test

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-39f2e8/16896/summary.html
COMMIT: 05b9896
CMSSW: CMSSW_12_0_X_2021-07-15-2300/slc7_aarch64_gcc9
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/7141/16896/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-39f2e8/16896/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-39f2e8/16896/git-merge-result

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-39f2e8/16905/summary.html
COMMIT: 05b9896
CMSSW: CMSSW_12_0_X_2021-07-15-2300/slc7_amd64_gcc900
Additional Tests: PROFILING
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/7141/16905/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-39f2e8/16905/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-39f2e8/16905/git-merge-result

Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 4 differences found in the comparisons
  • DQMHistoTests: Total files compared: 38
  • DQMHistoTests: Total histograms compared: 2786302
  • DQMHistoTests: Total failures: 6
  • DQMHistoTests: Total nulls: 1
  • DQMHistoTests: Total successes: 2786273
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: -0.004 KiB( 37 files compared)
  • DQMHistoSizes: changed ( 312.0 ): -0.004 KiB MessageLogger/Warnings
  • Checked 160 log files, 37 edm output root files, 38 DQM output files
  • TriggerResults: no differences found

@smuzaffar
Copy link
Contributor Author

assign reconstruction
@cms-sw/reconstruction-l2 tensorflow 2.5.0 update looks good, can you please check the profiling results and sign this PR if all look good?

@cmsbuild
Copy link
Contributor

New categories assigned: reconstruction

@slava77,@perrotta,@jpata you have been requested to review this Pull request/Issue and eventually sign? Thanks

@smuzaffar
Copy link
Contributor Author

@slava77,@perrotta,@jpata any objection on going forward with TF 2.5.0 integration?

@slava77
Copy link
Contributor

slava77 commented Jul 19, 2021

@slava77,@perrotta,@jpata any objection on going forward with TF 2.5.0 integration?

https://cmssdt.cern.ch/circles/web/piechart.php?local=false&dataset=CMSSW_12_0_X_2021-07-15-2300%2Fslc7_amd64_gcc900%2F11834.21%2FPR-39f2e8%2F16905%2Fstep4_PAT_PU.resources&resource=time_thread&colours=default&groups=packages&threshold=0
vs
baseline
https://cmssdt.cern.ch/circles/web/piechart.php?local=false&dataset=CMSSW_12_0_X_2021-07-15-2300%2Fslc7_amd64_gcc900%2F11834.21%2Fstep4_PAT_PU.resources&resource=time_thread&colours=default&groups=packages&threshold=0

timing of deepTau and deepMet modules looks consistent (both are a few % slower with this PR tests, but some other times are up as well, which is most likely the measurement stability). Also, there are no differences.

So, it should be fine to merge for reco needs.

@smuzaffar
Copy link
Contributor Author

+externals

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants