-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve Framework behavior after exceptions in begin/end transitions (Job, Stream, ProcessBlock) #45434
Improve Framework behavior after exceptions in begin/end transitions (Job, Stream, ProcessBlock) #45434
Conversation
cms-bot internal usage |
+code-checks Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-45434/40882
|
A new Pull Request was created by @wddgit for master. It involves the following packages:
@Dr15Jones, @civanch, @cmsbuild, @makortel, @mdhildreth, @smuzaffar can you please review it and eventually sign? Thanks. cms-bot commands are listed here |
enable threading |
please test |
-1 Failed Tests: RelVals RelVals-THREADING
RelVals
RelVals-THREADING
|
please test The failures seem unrelated to the changes in the PR and possibly not reproducible. Try again and see if we can get them to pass. |
I added some TWIKI documentation related to this PR here: https://twiki.cern.ch/twiki/bin/view/CMSPublic/SWGuideExceptionsInBeginEndTransitions And a link to the new TWIKI page here: https://twiki.cern.ch/twiki/bin/view/CMSPublic/SWGuideFrameWork#Error_handling |
-1 Failed Tests: UnitTests RelVals-THREADING Unit TestsI found 1 errors in the following unit tests: ---> test test-particleLevel_fromMiniAod had ERRORS RelVals-THREADING
Comparison SummaryThere are some workflows for which there are errors in the baseline: Summary:
|
please test let us test once more |
please test I believe all the review comments received so far are dealt with now. Let me know if there are more. I just recently noticed there is a link in the PR to compare after a force push is made, so I went ahead and just squashed everything. |
-1 Failed Tests: RelVals-THREADING
RelVals-THREADING
Comparison SummarySummary:
|
Same as before. All tests pass except for 2 DAS Errors in the additional tests added when threading tests are enabled. There is nothing in this PR that could cause a DAS Error and these failures are also occurring in the IBs. The failures are unrelated to the PR. |
Comparison differences show #39803 |
+core |
@cmsbuild, ignore tests-rejected ib-failure |
+1 |
This pull request is fully signed and it will be integrated in one of the next master IBs (test failures were overridden). This pull request will now be reviewed by the release team before it's merged. @sextonkennedy, @rappoccio, @antoniovilela, @mandrenguyen (and backports should be raised in the release meeting by the corresponding L2) |
+1 |
PR description:
Improve the behavior of the Framework after exceptions in beginJob, endJob, beginStream, endStream, beginProcessBlock and endProcessBlock transitions. This is the fourth and final PR in a series of PRs modifying the behavior of the Framework after exceptions so that it more consistently handles exceptions in all the begin/end transitions. The first PR handled stream lumi exceptions (PR #44624). The second PR handled global lumi exceptions (PR #44840). The third PR handled run transitions (PR #45017). The comments at the head of the first PR state the design for the new behavior we are implementing.
The intent is that nothing in the output will change if there are not any exceptions. In some cases, ordering of operations may change where that ordering is not supposed to matter. There are some minor differences related to signals.
This work was motivated by discussions related to Issues #43831 and #42501.
PR validation:
An existing unit test covering exceptions in different transitions is extended to cover the most salient cases. Additional manual testing of many various cases was also done. Existing unit tests pass.