New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
81X Workaround MT problem in HIPixelMedianVtxProducer #14724
81X Workaround MT problem in HIPixelMedianVtxProducer #14724
Conversation
A new Pull Request was created by @Martin-Grunewald (Martin Grunewald) for CMSSW_8_1_X. It involves the following packages: RecoHI/HiTracking @cmsbuild, @cvuosalo, @slava77, @davidlange6 can you please review it and eventually sign? Thanks. cms-bot commands are list here #13028 |
please test |
The tests are being triggered in jenkins. |
@Dr15Jones A typical traceback is this one, a common feature to the crashes observed seems to be the destructor sequence in the last thread - Thread 1.
|
We are seeing a similar crash in the IB. We think it is a bug in ROOT. Possibly running valgrind using the helgrind tool might pinpoint the problem. [I'm on vacation for the next week.] |
valgrind with HI code may have issues if it links to CGAL. There is a bug in valgrind. |
Sorry, no - but somewhat less timing efficiency is better than crashing! |
+1 Converting HIPixelMedianVtxProducer from edm::stream to edm::one to avoid a very elusive multi-threading bug. There should be no change in monitored quantities. #14725 is the 80X version of this PR. The code change is satisfactory, and Jenkins tests against baseline CMSSW_8_1_X_2016-05-31-2300 show no significant differences, as expected. |
This pull request is fully signed and it will be integrated in one of the next CMSSW_8_1_X IBs (tests are also fine). This pull request requires discussion in the ORP meeting before it's merged. @slava77, @davidlange6, @Degano, @smuzaffar |
+1 |
The crash is from a data race in the TMinuit constructor, fix is here: https://sft.its.cern.ch/jira/browse/ROOT-8217 valgrind can be run on HI code by defining CGAL_DISABLE_ROUNDING_MATH_CHECK in RecoHI/HiJetAlgos/plugins/VoronoiAlgorithm.h (valgrind doesn't emulate the different rounding modes). However, helgrind doesn't find the problem, as it is ordered by a mutex (just ordered incorrectly), so it had to be found via more labor-intensive means. |
@dan131riley |
81X Workaround MT problem in HIPixelMedianVtxProducer