New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Protect access to updatedState #14060
Protect access to updatedState #14060
Conversation
While testing if a newly added TrajectoryMeasurement to a Trajectory is compatible or not with the cut on the cluster charge, a protection has been added to test the validity of the updatedState before using it. This fix a segmentation fault that was due to an invalid updatedState caused by a failed KfUpdator. Full log avaialable here: https://hypernews.cern.ch/HyperNews/CMS/get/recoDevelopment/1436.html
A new Pull Request was created by @rovere (Marco Rovere) for CMSSW_8_0_X. It involves the following packages: TrackingTools/PatternTools @cmsbuild, @cvuosalo, @slava77, @davidlange6 can you please review it and eventually sign? Thanks. cms-bot commands are list here #13028 |
@@ -118,6 +118,8 @@ bool TempTrajectory::badForCCC(const TrajectoryMeasurement &tm) { | |||
return false; | |||
if (thit->isPixel()) | |||
return false; | |||
if (!tm.updatedState().isValid()) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
uhmmm, in principle we should NOT store invalid states in the trajectory. we should abort Pattern-reco for that traj...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ciao Vincenzo,
I agree, but at this level I cannot abort PatternReco. I can return true instead of false, but this will not guarantee to stop the PR.
please test |
The tests are being triggered in jenkins. |
@rovere |
Ciao @slava77, |
Ciao Marco, something like this:
I omitted input/output file names |
@slava77 it seems that era |
On 4/14/16 8:22 AM, Marco Rovere wrote:
ah, sorry,
|
Matti, thanks for correcting my typo in the cmsDriver. |
+1
|
This pull request is fully signed and it will be integrated in one of the next CMSSW_8_0_X IBs (tests are also fine). This pull request requires discussion in the ORP meeting before it's merged. @slava77, @davidlange6, @Degano, @smuzaffar |
Ciao @slava77,
and processing only 1 event. The crash happens both running with 4 threads or running with 1 thread. |
While testing if a newly added TrajectoryMeasurement to a Trajectory is compatible or not with the cut on the cluster charge, a protection has been added to test the validity of the updatedState before using it. This fix a segmentation fault that was due to an invalid updatedState caused by a failed KfUpdator. Full log avaialable here: https://hypernews.cern.ch/HyperNews/CMS/get/recoDevelopment/1436.html See also Pull Request cms-sw#14060 for the corresponding PR for the 80X branch.
I have a branch ready for 81X, but would rather prefer to have #13988 integrate first, otherwise my PR will bring in the very same changes as that PR. Marco. |
On 4/15/16 1:17 AM, Marco Rovere wrote:
Marco, Can you try on cmsdev02 (that's where I tested)? I have just tried in CMSSW_8_0_4 with the PSet.py with extra lines process.source.skipEvents = cms.untracked.uint32(535) to match what you describe. It ran to completion. Begin processing the 1st record. Run 269223, Event 9231372, LumiSection
|
Ciao @slava77, |
I tried on the following without any crashes
is this Ofast doing or is it something else, I wonder. |
finally got a crash on
on FNAL LPC |
@davidlange6 |
While testing if a newly added TrajectoryMeasurement to a Trajectory is
compatible or not with the cut on the cluster charge, a protection has
been added to test the validity of the updatedState before using it.
This fix a segmentation fault that was due to an invalid updatedState
caused by a failed KfUpdator. Full log avaialable here:
https://hypernews.cern.ch/HyperNews/CMS/get/recoDevelopment/1436.html