Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

If SecondaryEventProvider has no modules do not create a task #33125

Merged
merged 1 commit into from Mar 11, 2021

Conversation

Dr15Jones
Copy link
Contributor

PR description:

edm::syncWait is always required to create at least one TBB task and then wait on it (which can cause other modules to be run on top of the module using SecondaryEventProvider). To avoid that in the case where there are no modules in SecondaryEventProvider we first check that condition and if so immediately return from the member functions.

This fixes the problem where the Timing service and the MessageLogger were both failing.

PR validation:

The code compiles and running step 1 of workflow 250406.17 using 4 threads, which was failing each time it was run, succeded 3 times in a row when run using the fix.

edm::syncWait is always required to create at least one TBB
task and then wait on it (which can cause other modules to be run
on top of the module using SecondaryEventProvider). To avoid that
in the case where there are no modules in SecondaryEventProvider
we first check that condition and if so immediately return from
the member functions.
@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 9, 2021

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-33125/21477

  • This PR adds an extra 12KB to repository

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 9, 2021

A new Pull Request was created by @Dr15Jones (Chris Jones) for master.

It involves the following packages:

Mixing/Base

@cmsbuild, @civanch, @mdhildreth can you please review it and eventually sign? Thanks.
@fabiocos this is something you requested to watch as well.
@silviodonato, @dpiparo, @qliphy you are the release manager for this.

cms-bot commands are listed here

@Dr15Jones
Copy link
Contributor Author

Please test

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-28f7b2/13387/summary.html
COMMIT: 9461d3a
CMSSW: CMSSW_11_3_X_2021-03-09-1100/slc7_amd64_gcc900
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/33125/13387/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 4 differences found in the comparisons
  • DQMHistoTests: Total files compared: 38
  • DQMHistoTests: Total histograms compared: 2849195
  • DQMHistoTests: Total failures: 8
  • DQMHistoTests: Total nulls: 1
  • DQMHistoTests: Total successes: 2849164
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: -0.004 KiB( 37 files compared)
  • DQMHistoSizes: changed ( 312.0 ): -0.004 KiB MessageLogger/Warnings
  • Checked 160 log files, 37 edm output root files, 38 DQM output files

@Dr15Jones
Copy link
Contributor Author

@cms-sw/simulation-l2 this is critical to fixing problems in the IB. Please review.

@smuzaffar
Copy link
Contributor

smuzaffar commented Mar 10, 2021

@Dr15Jones ,should we try enabling threading tests for this PR?

@Dr15Jones
Copy link
Contributor Author

Sure

@smuzaffar
Copy link
Contributor

enable threading

@smuzaffar
Copy link
Contributor

please test workflow 250406.17

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-28f7b2/13411/summary.html
COMMIT: 9461d3a
CMSSW: CMSSW_11_3_X_2021-03-09-2300/slc7_amd64_gcc900
Additional Tests: THREADING
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/33125/13411/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

@slava77 comparisons for the following workflows were not done due to missing matrix map:

  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-28f7b2/250406.17_ZMM_13+FS_ZMM_13_PRMXUP17_PU50+HARVESTUP17FS+MINIAODMCUP17FS

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 3 differences found in the comparisons
  • DQMHistoTests: Total files compared: 38
  • DQMHistoTests: Total histograms compared: 2849195
  • DQMHistoTests: Total failures: 6
  • DQMHistoTests: Total nulls: 1
  • DQMHistoTests: Total successes: 2849166
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.004 KiB( 37 files compared)
  • DQMHistoSizes: changed ( 312.0 ): 0.004 KiB MessageLogger/Warnings
  • Checked 160 log files, 37 edm output root files, 38 DQM output files

@Dr15Jones
Copy link
Contributor Author

@smuzaffar Looking at the testing output I see that '250406.17' was only run for the single threaded case, not the multi-threaded case.

@Dr15Jones
Copy link
Contributor Author

@smuzaffar
Copy link
Contributor

@civanch, @mdhildreth can you please review this? This is critical to fix failures in IBs

@civanch
Copy link
Contributor

civanch commented Mar 10, 2021

+1

@smuzaffar
Copy link
Contributor

smuzaffar commented Mar 10, 2021

@smuzaffar Looking at the testing output I see that '250406.17' was only run for the single threaded case, not the multi-threaded case.

you are right @Dr15Jones , threading job was only running runTheMatrix.py -s , I have fixed it and restarted the threading job only. Bot should update the results when https://cmssdt.cern.ch/jenkins/job/ib-run-pr-relvals/2081/ is done

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next master IBs after it passes the integration tests. This pull request will now be reviewed by the release team before it's merged. @silviodonato, @dpiparo, @qliphy (and backports should be raised in the release meeting by the corresponding L2)

@Dr15Jones
Copy link
Contributor Author

@smuzaffar rerunning the test with that workflow is nice but not actually necessary. One of the workflows run in threading mode also exhibits the problem in the IB but worked fine here.

@smuzaffar
Copy link
Contributor

@silviodonato , @qliphy , this is fully signed. Can we get this in IB ?

@qliphy
Copy link
Contributor

qliphy commented Mar 11, 2021

+1

@cmsbuild cmsbuild merged commit 177db5c into cms-sw:master Mar 11, 2021
@Dr15Jones Dr15Jones deleted the fixSecondaryEventProvider branch March 15, 2021 17:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants