New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Profiling at T0 AlCa and DQM workflows #38198
Comments
A new Issue was created by @tvami Tamas Vami. @Dr15Jones, @perrotta, @dpiparo, @makortel, @smuzaffar, @qliphy can you please review it and eventually sign/assign? Thanks. cms-bot commands are listed here |
assign alca,reconstruction |
New categories assigned: dqm,alca @jfernan2,@ahmad3213,@yuanchao,@micsucmed,@rvenditti,@emanueleusai,@francescobrivio,@malbouis,@tvami,@pmandrik you have been requested to review this Pull request/Issue and eventually sign? Thanks |
New categories assigned: reconstruction @jpata,@slava77,@clacaputo you have been requested to review this Pull request/Issue and eventually sign? Thanks |
How many events are processed for the plots in http://cms-reco-profiling.web.cern.ch/cms-reco-profiling/results/summary_plot_html/CMSSW_12_4_step3_136.889.html ? (I'm confused of the x axis) |
It's 5k events: https://github.com/cms-sw/cms-bot/blob/master/reco_profiling/profileRunner.py#L95. The plot has event IDs on the x axis, we need to change that (cc @xoqhdgh1002) to be just numbers. |
Thanks, good. That should be sufficient for this particular leak (or even smaller) to be visible (this would have been ~1.2 GB after 5k events). |
@tvami, should we take any action here? Is there a new workflow we should switch to that would be more representative? |
Hi @jpata we could use a run from this Monday. However, it will likely have limited stats, although if the tests go up to 5000 events that could probably be reached |
We test about 5k now, so it wouldn't be a big change. Is there a workflow so we can give a try, and you can see if the results are useful for ALCA? |
@tocheng is going to create a new wf that you can use. He promised to look at this tomorrow. |
@tocheng do you have any updates? |
@tvami |
Hi @tocheng
I think you missed some of them, please add those! Thanks! |
And maybe we could add another one, which is purely a technical wf that adds all the ALCARECOs to the MinBias PD... this of course would physically be incorrect, but would test everything under one wf... |
@tocheng please submit the PR, at this point we have good Run-3, 13.6 TeV input data |
@tocheng ? |
Being addressed in #38681 |
+alca
|
@jpata can you please take over from that? Thanks! |
Thanks! Which of the two new workflows should we use this instead of 136.889? From the reco point of view they are all equivalent, so the question is, which has the most representative ALCA configuration. Note that on the reco side, we basically submit and analyze this 8-threaded profiling job "by hand" for each prerelease - so we don't have the personpower to study a large number of workflows per release at this time. |
I think you can go ahead with 1001.3, thanks! |
hi @jpata do you have any update on this? thanks! |
hi @cms-sw/reconstruction-l2 did this happen in the end? |
@clacaputo @mandrenguyen hi guys, do you you know if the |
Follow up to the issue 36282
and cmsTalk
https://cms-talk.web.cern.ch/t/high-memory-usage-in-promptreco-jobs-for-run-352516/11040
So the issue is that the wf chosen in github issue 36282 is based on the MET dataset, thus
AlCaHcalHBHEMuonProducer
is not run on it. (It's attached to the MinBias and SingleMuon).This is a general issue for testing, certain ALCARECOs belong to certain PDs (as defined in the AlCaRECO matrix). i.e. we either do the testing on several wf, or just decide to pick one that has most of the ALCARECOs connected to it. That would be
SingleMuon
.If that's the prefered solution, we can set up a new wf after the Run3 single muon PD is done (next week?)
The text was updated successfully, but these errors were encountered: