Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Service type in an exception message from ServiceRegistry #39636

Merged
merged 2 commits into from
Oct 10, 2022

Conversation

makortel
Copy link
Contributor

@makortel makortel commented Oct 5, 2022

PR description:

This PR adds the Service type as part of the exception message when a ServiceRegistry has not been set for the current thread. This information could help debugging semi-rare failures in HLT tests that were reported in #38801 (comment) .

PR validation:

Code compiles

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 5, 2022

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-39636/32434

  • This PR adds an extra 12KB to repository

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 5, 2022

A new Pull Request was created by @makortel (Matti Kortelainen) for master.

It involves the following packages:

  • FWCore/ServiceRegistry (core)

@cmsbuild, @smuzaffar, @Dr15Jones, @makortel can you please review it and eventually sign? Thanks.
@missirol, @wddgit, @fwyzard this is something you requested to watch as well.
@perrotta, @dpiparo, @rappoccio you are the release manager for this.

cms-bot commands are listed here

@makortel
Copy link
Contributor Author

makortel commented Oct 5, 2022

@cmsbuild, please test

@makortel
Copy link
Contributor Author

makortel commented Oct 5, 2022

@cmsbuild, please abort

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 5, 2022

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-39636/32439

  • This PR adds an extra 16KB to repository

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 5, 2022

Pull request #39636 was updated. @cmsbuild, @smuzaffar, @Dr15Jones, @makortel can you please check and sign again.

@makortel
Copy link
Contributor Author

makortel commented Oct 5, 2022

@cmsbuild, please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 6, 2022

-1

Failed Tests: UnitTests RelVals-INPUT AddOn
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-0130f6/28033/summary.html
COMMIT: 730ad74
CMSSW: CMSSW_12_6_X_2022-10-05-1100/el8_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/39636/28033/install.sh to create a dev area with all the needed externals and cmssw changes.

Unit Tests

I found errors in the following unit tests:

---> test condTestRegression had ERRORS
---> test SiStripDAQ_O2O_test had ERRORS

RelVals-INPUT

  • 140.56140.56_RunHI2018+RunHI2018+RECOHID18+HARVESTDHI18/step2_RunHI2018+RunHI2018+RECOHID18+HARVESTDHI18.log

AddOn Tests

  • hlt_mc_HIoncmsRun /cvmfs/cms-ib.cern.ch/nweek-02753/el8_amd64_gcc10/cms/cmssw-patch/CMSSW_12_6_X_2022-10-05-1100/src/HLTrigger/Configuration/test/OnLine_HLT_HIon.py realData=True globalTag=@ inputFiles=@ : FAILED - time: date Thu Oct 6 12:37:00 2022-date Thu Oct 6 12:31:00 2022 s - exit: 23040

Comparison Summary

There are some workflows for which there are errors in the baseline:
10224.15 step 5
11634.15 step 3
140.002 step 3
140.034 step 3
140.116 step 3
25202.15 step 5
The results for the comparisons for these workflows could be incomplete
This means most likely that the IB is having errors in the relvals.The error does NOT come from this pull request

@slava77 comparisons for the following workflows were not done due to missing matrix map:

  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-0130f6/41834.0_TTbar_14TeV+2026D94+TTbar_14TeV_TuneCP5_GenSimHLBeamSpot14+DigiTrigger+RecoGlobal+HARVESTGlobal

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 5 differences found in the comparisons
  • DQMHistoTests: Total files compared: 49
  • DQMHistoTests: Total histograms compared: 3391103
  • DQMHistoTests: Total failures: 6
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3391075
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 48 files compared)
  • Checked 204 log files, 49 edm output root files, 49 DQM output files
  • TriggerResults: no differences found

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 6, 2022

This pull request is fully signed and it will be integrated in one of the next master IBs (but tests are reportedly failing). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @rappoccio (and backports should be raised in the release meeting by the corresponding L2)

@rappoccio
Copy link
Contributor

Hoping the failed tests are spurious, will try again.

@rappoccio
Copy link
Contributor

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 8, 2022

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-0130f6/28113/summary.html
COMMIT: 730ad74
CMSSW: CMSSW_12_6_X_2022-10-07-1100/el8_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/39636/28113/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

@slava77 comparisons for the following workflows were not done due to missing matrix map:

  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-0130f6/41834.0_TTbar_14TeV+2026D94+TTbar_14TeV_TuneCP5_GenSimHLBeamSpot14+DigiTrigger+RecoGlobal+HARVESTGlobal

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 10 differences found in the comparisons
  • DQMHistoTests: Total files compared: 49
  • DQMHistoTests: Total histograms compared: 3391103
  • DQMHistoTests: Total failures: 90
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3390991
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 48 files compared)
  • Checked 204 log files, 49 edm output root files, 49 DQM output files
  • TriggerResults: no differences found

@smuzaffar
Copy link
Contributor

smuzaffar commented Oct 9, 2022

For AddOn tests the link to hlt_mc_HIon is wrong (@smuzaffar), the failed test was actually hlt_data_HIon. This was actually lucky because now we see the effect of this PR

thanks for catching this. The reason behind this wrong link is that both hlt_data_HIon and hlt_mc_HIon use same logfile names ( first 150 characters of the command )

hlt_data_HIon/cmsRun__cvmfs_cms-ib.cern.ch_nweek-02753_el8_amd64_gcc10_cms_cmssw-patch_CMSSW_12_6_X_2022-10-05-1100_src_HLTrigger_Configuration_test_OnLine_HLT_HIon.log
hlt_mc_HIon/cmsRun__cvmfs_cms-ib.cern.ch_nweek-02753_el8_amd64_gcc10_cms_cmssw-patch_CMSSW_12_6_X_2022-10-05-1100_src_HLTrigger_Configuration_test_OnLine_HLT_HIon.log

which confuses bot

@missirol
Copy link
Contributor

missirol commented Oct 9, 2022

For AddOn tests the link to hlt_mc_HIon is wrong

For the record, this has happened before (#36408 (comment)). #36423 was an attempt to fix it, but it was not integrated (while cms-sw/cms-bot#1675 was..).

@perrotta
Copy link
Contributor

perrotta commented Oct 9, 2022

There are DQM differences for JetMET in several workflows: not clear if and why this PR can have generated them...

@fwyzard
Copy link
Contributor

fwyzard commented Oct 9, 2022

I doubt this PR can cause any changes in physics or reconstruction quantities...

@perrotta
Copy link
Contributor

perrotta commented Oct 9, 2022

please test

@cmsbuild
Copy link
Contributor

-1

Failed Tests: UnitTests
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-0130f6/28142/summary.html
COMMIT: 730ad74
CMSSW: CMSSW_12_6_X_2022-10-09-0000/el8_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/39636/28142/install.sh to create a dev area with all the needed externals and cmssw changes.

Unit Tests

I found errors in the following unit tests:

---> test TestGeneratorInterfaceEvtGenInterface_external_bplus had ERRORS

Comparison Summary

@slava77 comparisons for the following workflows were not done due to missing matrix map:

  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-0130f6/41834.0_TTbar_14TeV+2026D94+TTbar_14TeV_TuneCP5_GenSimHLBeamSpot14+DigiTrigger+RecoGlobal+HARVESTGlobal

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 4 differences found in the comparisons
  • DQMHistoTests: Total files compared: 49
  • DQMHistoTests: Total histograms compared: 3392309
  • DQMHistoTests: Total failures: 3
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3392284
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 48 files compared)
  • Checked 204 log files, 49 edm output root files, 49 DQM output files
  • TriggerResults: no differences found

@perrotta
Copy link
Contributor

+1

  • The differences in the DQM comparisons have disappeared
  • The unit test error didn't show up in previous tests, and t is clearly unrelated

@perrotta
Copy link
Contributor

merge

@cmsbuild cmsbuild merged commit 5f6ead8 into cms-sw:master Oct 10, 2022
@makortel makortel deleted the serviceRegistryExceptionMessage branch October 10, 2022 14:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants