Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MTD geometry: fix BTL numbering scheme for DD4hep+Geant4 extra level in path #39843

Merged
merged 2 commits into from Oct 26, 2022

Conversation

fabiocos
Copy link
Contributor

PR description:

This PR provide a fix for the crash observed in wf 20834.911 after the integration of #39670 . The use of DD4hep within Geant4 adds effectively one more level in the volume stack, which breaks the logic of the BTL numbering scheme as updated. For this reason an extra protection is added, to address this specific case.

At the same time also the static analyser issue #39670 (review) is addressed.

PR validation:

Test workflows 20834.0, 20834.911 and 23634.911 are correctly executed. Unit tests pass.

@cmsbuild
Copy link
Contributor

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-39843/32724

  • This PR adds an extra 16KB to repository

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @fabiocos (Fabio Cossutti) for master.

It involves the following packages:

  • Geometry/MTDCommonData (geometry, upgrade)
  • RecoMTD/DetLayers (upgrade, reconstruction)

@civanch, @Dr15Jones, @bsunanda, @makortel, @ianna, @mdhildreth, @cmsbuild, @AdrianoDee, @srimanob, @clacaputo, @mandrenguyen can you please review it and eventually sign? Thanks.
@trtomei, @bsunanda, @beaucero, @missirol this is something you requested to watch as well.
@perrotta, @dpiparo, @rappoccio you are the release manager for this.

cms-bot commands are listed here

@fabiocos
Copy link
Contributor Author

test parameters:

  • workflows = 20834.911,23634.0,23634.911

@fabiocos
Copy link
Contributor Author

please test

@cmsbuild
Copy link
Contributor

-1

Failed Tests: RelVals RelVals-INPUT
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ef1843/28490/summary.html
COMMIT: b2e7113
CMSSW: CMSSW_12_6_X_2022-10-25-1100/el8_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/39843/28490/install.sh to create a dev area with all the needed externals and cmssw changes.

RelVals

----- Begin Fatal Exception 25-Oct-2022 16:34:11 CEST-----------------------
An exception of category 'FileOpenError' occurred while
   [0] Constructing the EventProcessor
   [1] Constructing module: class=MixingModule label='mix'
   [2] Calling RootInputFileSequence::initTheFile()
   [3] Calling StorageFactory::open()
   [4] Calling XrdFile::open()
Exception Message:
Failed to open the file 'root://xrootd-cms.infn.it//store/relval/CMSSW_12_3_0_pre5/RelValMinBias_14TeV/GEN-SIM/123X_mcRun4_realistic_v4_2026D88noPU-v1/10000/b93adc85-715f-477a-afc9-65f3241933ee.root'
   Additional Info:
      [a] Calling RootInputFileSequence::initTheFile(): fail to open the file with name root://eoscms.cern.ch//eos/cms/store/relval/CMSSW_12_3_0_pre5/RelValMinBias_14TeV/GEN-SIM/123X_mcRun4_realistic_v4_2026D88noPU-v1/10000/b93adc85-715f-477a-afc9-65f3241933ee.root
      [b] Calling RootInputFileSequence::initTheFile(): fail to open the file with name root://eoscms.cern.ch//eos/cms/store/user/cmsbuild/store/relval/CMSSW_12_3_0_pre5/RelValMinBias_14TeV/GEN-SIM/123X_mcRun4_realistic_v4_2026D88noPU-v1/10000/b93adc85-715f-477a-afc9-65f3241933ee.root
      [c] Input file root://xrootd-cms.infn.it//store/relval/CMSSW_12_3_0_pre5/RelValMinBias_14TeV/GEN-SIM/123X_mcRun4_realistic_v4_2026D88noPU-v1/10000/b93adc85-715f-477a-afc9-65f3241933ee.root could not be opened.
      [d] XrdCl::File::Open(name='root://xrootd-cms.infn.it//store/relval/CMSSW_12_3_0_pre5/RelValMinBias_14TeV/GEN-SIM/123X_mcRun4_realistic_v4_2026D88noPU-v1/10000/b93adc85-715f-477a-afc9-65f3241933ee.root', flags=0x10, permissions=0660) => error '[ERROR] Server responded with an error: [3014] Unable to open file  /eos/cms/store/relval/CMSSW_12_3_0_pre5/RelValMinBias_14TeV/GEN-SIM/123X_mcRun4_realistic_v4_2026D88noPU-v1/10000/b93adc85-715f-477a-afc9-65f3241933ee.root; Network is unreachable
' (errno=3014, code=400). No additional data servers were found.
      [e] Last URL tried: root://eoscms.cern.ch:1094//store/relval/CMSSW_12_3_0_pre5/RelValMinBias_14TeV/GEN-SIM/123X_mcRun4_realistic_v4_2026D88noPU-v1/10000/b93adc85-715f-477a-afc9-65f3241933ee.root?tried=+1213llrxrd-redir.in2p3.fr,&xrdcl.requuid=eb4a9bd2-d368-47fb-9db6-cf5ac063580d
      [f] Problematic data server: eoscms.cern.ch:1094
      [g] Disabled source: eoscms.cern.ch:1094
----- End Fatal Exception -------------------------------------------------

RelVals-INPUT

  • 10024.310024.3_TTbar_13+2017_trackingOnlyRun2+TTbar_13TeV_TuneCUETP8M1_GenSimINPUT+Digi+RecoFakeHLT+HARVESTFakeHLT/step2_TTbar_13+2017_trackingOnlyRun2+TTbar_13TeV_TuneCUETP8M1_GenSimINPUT+Digi+RecoFakeHLT+HARVESTFakeHLT.log
  • 10024.010024.0_TTbar_13+2017+TTbar_13TeV_TuneCUETP8M1_GenSimINPUT+Digi+RecoFakeHLT+HARVESTFakeHLT+ALCA+Nano/step2_TTbar_13+2017+TTbar_13TeV_TuneCUETP8M1_GenSimINPUT+Digi+RecoFakeHLT+HARVESTFakeHLT+ALCA+Nano.log
  • 10224.1510224.15_TTbar_13+2017PU_JMENano+TTbar_13TeV_TuneCUETP8M1_GenSimINPUT+DigiPU+RecoFakeHLTPU+HARVESTFakeHLTPU+Nano/step2_TTbar_13+2017PU_JMENano+TTbar_13TeV_TuneCUETP8M1_GenSimINPUT+DigiPU+RecoFakeHLTPU+HARVESTFakeHLTPU+Nano.log
Expand to see more relval errors ...

@perrotta
Copy link
Contributor

urgent

@fabiocos
Copy link
Contributor Author

@perrotta it seems to me that this test failure is unrelated to this PR

@srimanob
Copy link
Contributor

Hi @fabiocos

Three additional workflows are not tested. I don't see them in https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ef1843/28490/runTheMatrix-results/

@srimanob
Copy link
Contributor

@cmsbuild please test

Retrigger the test

@cmsbuild
Copy link
Contributor

-1

Failed Tests: RelVals
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ef1843/28494/summary.html
COMMIT: b2e7113
CMSSW: CMSSW_12_6_X_2022-10-25-1100/el8_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/39843/28494/install.sh to create a dev area with all the needed externals and cmssw changes.

RelVals

@fabiocos
Copy link
Contributor Author

I used it for my own private tests, but of course using --what upgrade

@perrotta
Copy link
Contributor

test parameters:

  • workflows = 20834.911

@perrotta
Copy link
Contributor

please test
(thank you @smuzaffar )

@smuzaffar
Copy link
Contributor

@perrotta , cms-sw/cms-bot#1871 should now properly report such errors too

@fabiocos
Copy link
Contributor Author

@smuzaffar thanks, this is generally useful for the future

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-ef1843/28514/summary.html
COMMIT: b2e7113
CMSSW: CMSSW_12_6_X_2022-10-25-2300/el8_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/39843/28514/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

There are some workflows for which there are errors in the baseline:
20834.911 step 3
4.6 step 3
The results for the comparisons for these workflows could be incomplete
This means most likely that the IB is having errors in the relvals.The error does NOT come from this pull request

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 11 differences found in the comparisons
  • Reco comparison had 8 failed jobs
  • DQMHistoTests: Total files compared: 48
  • DQMHistoTests: Total histograms compared: 3384029
  • DQMHistoTests: Total failures: 96
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3383911
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 47 files compared)
  • Checked 204 log files, 48 edm output root files, 48 DQM output files
  • TriggerResults: no differences found

@fabiocos
Copy link
Contributor Author

comparison failures look unrelated to this PR

@perrotta
Copy link
Contributor

@cms-sw/geometry-l2 @cms-sw/reconstruction-l2 @cms-sw/upgrade-l2 could you please review this PR?
We would like to have it merged in pre4, to get rid of the errors in some Phase2 workflows.

@clacaputo
Copy link
Contributor

+reconstruction

@srimanob
Copy link
Contributor

+Upgrade

@bsunanda
Copy link
Contributor

+geometry

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @rappoccio (and backports should be raised in the release meeting by the corresponding L2)

@rappoccio
Copy link
Contributor

+1

@cmsbuild cmsbuild merged commit df073f5 into cms-sw:master Oct 26, 2022
@fabiocos fabiocos deleted the fc-fixBTLnumbering branch October 26, 2022 16:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

8 participants