New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem in TkAlCaRecoMonitor:ALCARECOTkAlMuonIsolatedTkAlDQM #23439
Comments
A new Issue was created by @fabiocos Fabio Cossutti. @davidlange6, @Dr15Jones, @smuzaffar, @fabiocos can you please review it and eventually sign/assign? Thanks. cms-bot commands are listed here |
assign dqm,alca,pdmv |
New categories assigned: dqm,pdmv,alca @jfernan2,@lpernie,@prebello,@vazzolini,@franzoni,@fabozzi,@vanbesien,@GurpreetSinghChahal,@arunhep,@kmaeshima,@dmitrijus,@cerminar you have been requested to review this Pull request/Issue and eventually sign? Thanks |
the issue looks present in T0 files, for instance looking randomly at run 317382, and it appears related to the cout message ERROR: No beam sport found! from https://cmssdt.cern.ch/lxr/source/DQMOffline/Muon/src/MuonRecoAnalyzer.cc#0461 that has been upgraded to LogInfo in 10_2_X |
Apart from the typo in the message, but is this message really relevant? If not, the authors of this module should find a way to assure people looking the logs that reconstruction is not broken... or simply suppress this message |
The problem "ERROR: No beam sport found!" appears as a buggy check in 10_1_X that has been fixed in 10_2_X, a backport is for sure useful |
concerning the original problem, the DQM module triggering the problem is in the sequence process.pathALCARECOTkAlMuonIsolated = cms.Path(process.ALCARECOTkAlMuonIsolatedHLT+process.ALCARECOTkAlMuonIsolatedDCSFilter+process.ALCARECOTkAlMuonIso defined in https://cmssdt.cern.ch/lxr/source/DQMOffline/Alignment/python/ALCARECOTkAlDQM_cff.py For the events with crash that module of the path is simply not executed, as can be seen in the Tracer output (not running in multithreaded mode for simplicity): ++++++++ finished: prefetching before processing event for module: stream = 0 label = 'AlcaHBHEMuonFilter' id = 80 So if this is expected, the LogError should be downgraded to a LogInfo, or the Path should be refactored in such a way to ensure that the DQM module runs only when the input is really needed. |
the 10_1_X code
becomes in 10_2_X
and the problem disappears. |
@arunhep @lpernie @jfernan2 @dmitrijus could you please comment? |
This is fine, I guess (I am not the author of the module). From what it seems, backporting #23015 will do the trick. |
I agree with @dmitrijus , seems that a backport would solve the problem. |
@dmitrijus @lpernie that backprot will solve one problem, the beam spot not found message. What about the other one, that is flooding with error messages the LogError output? Who is going to look into it? Is anybody looking at the histograms produced by this DQM (I guess no, because the module is not executed I would say)? |
Hi, I am the author of MuonRecoAnalyzer.cc. Yes, there was a problem with that cout, that was fixed as Fabio commented. I think the backport should work. For the other module: https://cmssdt.cern.ch/lxr/source/DQMOffline/Alignment/src/TkAlCaRecoMonitor.cc#0190 that was triggering the initial problem I don't know anything. I'm not author/developer of that module. cheers, Pablo |
@parbol the backport is there #23519 @dmitrijus @lpernie the other issue is flooding of LogError the output (and the skim), who is taking care of that? |
Hi @fabiocos , for the other module I think it is DQM core or offline DQM-Tk developers responsibility to propose a change. |
I have pinged DQM-Tk since the decission depends on them, not on DQM. We (DQM) could silence blindly the message, but we are not sure about future consequences. |
Dear all the same issue appears in 10_1_9 with the sequence RECO -s RAW2DIGI,L1Reco,RECO,ALCA:TkAlMuonIsolated --runUnscheduled --nThreads 8 --data --era Run2_2018 --scenario pp --conditions 101X_dataRun2_Prompt_PixelCond_forTracker_v1 --eventcontent ALCARECO --datatier ALCARECO --customise Configuration/DataProcessing/RecoTLR.customisePostEra_Run2_2018 --filein /store/data/Run2018C/SingleMuon/RAW/v1/000/319/348/00000/D4C88FCF-0E83-E811-A6B0-FA163EDE417A.root -n 100 --python_filename=recoskim_Run2018C_SingleMuon.py --no_exec Do you have news about this issue? it should be fixed in 101X as pointed above. |
@prebello as 10_1_X is now used only by HLT, and Tier0 and online DQM have both moved to 10_2_X, I would say that this problem becomes less relevant |
though the prompt is run that way (as is anything with miniaod)- so perhaps not the best choice for the sake of consistency.
… On Sep 11, 2018, at 2:43 PM, Patricia Rebello Teles ***@***.***> wrote:
indeed @fabiocos and @mmusich in latest request of reprocessing by AlCa, @tocheng requested to remove —runUnscheduled .
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub, or mute the thread.
|
+1 |
+1 |
An error message has been reported by @prebello while testing the cmsDriver sequence for the reprocessing of the 2018A data affected by the T0 deletion problem, see https://hypernews.cern.ch/HyperNews/CMS/get/prep-ops/5378/1/1.html
The error message is
%MSG-e Alignment: TkAlCaRecoMonitor:ALCARECOTkAlMuonIsolatedTkAlDQM 02-Jun-2018 18:39:25 CEST Run: 315252 Event: 377936
invalid trackcollection encountered!
%MSG
triggered apparently in https://cmssdt.cern.ch/lxr/source/DQMOffline/Alignment/src/TkAlCaRecoMonitor.cc#0190
I confirm the problem in 10_1_5, 10_1_6 and in the present 10_2_X head. This issue is not seen in the recent 2018 relval tests as the corresponding module seems not run, the ALCA sequence used in 136.855 being different than the tested one for the reprocessing.
The text was updated successfully, but these errors were encountered: