Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

backport HLT DQM for menu v3.0 to CMSSW 9.2.x + HLTDQM : fix memory issue (part1) + add offlineHLTSourceOnAODextra sequence #20620

Merged
merged 22 commits into from Oct 27, 2017

Conversation

mtosi
Copy link
Contributor

@mtosi mtosi commented Sep 21, 2017

many PRs for the HLT DQM did not enter 92x, yet
because the corresponding ones in 93x showed a big issue in memory,
as reported in [1] and discussed during (and after) the DQM meeting

they are --mainly--

the memory issue seems --at least partially-- addressed by PR #20439
here there is its backport in 92x, which should make the above PRs fine in 92x as well
this is the 1st --low level-- round, in which

  1. I disable the tracking cumulative hit efficiency vs PU, lumi and BX almost everywhere
  2. reduce the #bins in the JetMET directories
    • disabling some not really useful plots
      (like the 2D in the HEM and HEP slices, which are simply a zoom of the full eta-phi map)
    • reducing the bin width in the eta plot [from 50 to 20 bins (from 0.2 to 0.5 resolution) ]
      the best would be to move to variable binning
    • dropping plots vs LS done per partitions

in this occasion, I also clean / re-organize the directories structure
==> the comparison plot by plot will fail

[1]
in https://indico.cern.ch/event/658725/contributions/2685778/attachments/1505407/2345690/dqm_memory.pdf

finally (?), as discussed at the recent XC
it would be probably safer to have a dedicated cms.Sequence for testing new DQM code for HLT
which is meant to be used in the HLT validation workflow, but not necessarily at Tier0

new DQM code should be integrated in the offlineHLTSourceOnAODextra sequence
and then it could be ported in the standard offlineHLTSourceOnAOD one (which is run at Tier0)

@mtosi
Copy link
Contributor Author

mtosi commented Sep 21, 2017

@cmsbuild , please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 21, 2017

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/23154/console Started: 2017/09/21 23:34

@cmsbuild
Copy link
Contributor

A new Pull Request was created by @mtosi (mia tosi) for CMSSW_9_2_X.

It involves the following packages:

DQMOffline/Configuration
DQMOffline/Trigger
DQMServices/ClientConfig
HLTriggerOffline/Btag
HLTriggerOffline/Higgs
HLTriggerOffline/SUSYBSM
HLTriggerOffline/Tau
HLTriggerOffline/Top

@kmaeshima, @cmsbuild, @vanbesien, @vazzolini, @dmitrijus can you please review it and eventually sign? Thanks.
@barvic, @HuguesBrun, @swertz, @imarches, @makortel, @trocino, @acaudron, @battibass, @jhgoh, @calderona, @mverzett, @JyothsnaKomaragiri, @threus, @ferencek, @pvmulder, @rociovilar this is something you requested to watch as well.
@davidlange6, @slava77 you are the release manager for this.

cms-bot commands are listed here

@cmsbuild
Copy link
Contributor

@cmsbuild
Copy link
Contributor

Comparison job queued.

@cmsbuild
Copy link
Contributor

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-20620/23154/summary.html

Comparison Summary:

  • You potentially added 4774 lines to the logs
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 23
  • DQMHistoTests: Total histograms compared: 1764262
  • DQMHistoTests: Total failures: 29671
  • DQMHistoTests: Total nulls: 1151
  • DQMHistoTests: Total successes: 1733266
  • DQMHistoTests: Total skipped: 174
  • DQMHistoTests: Total Missing objects: 0
  • Checked 94 log files, 14 edm output root files, 23 DQM output files

@cmsbuild
Copy link
Contributor

Pull request #20620 was updated. @kmaeshima, @cmsbuild, @vanbesien, @vazzolini, @dmitrijus can you please check and sign again.

@mtosi
Copy link
Contributor Author

mtosi commented Sep 22, 2017 via email

@cmsbuild
Copy link
Contributor

cmsbuild commented Sep 22, 2017

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/23169/console Started: 2017/09/22 14:13

@cmsbuild
Copy link
Contributor

@cmsbuild
Copy link
Contributor

Comparison job queued.

@mtosi
Copy link
Contributor Author

mtosi commented Oct 20, 2017

for completeness, the memory usage w/ all HLT plots enable is roughly 60MB .....
which is roughly 1/3 of the total

is it too much ?
I do not think so, we can for sure try to squeeze it further,
but that is the order of magnitude for monitoring 500 paths

==> the mentioned work on the DQM framework is really important, I guess

@cmsbuild
Copy link
Contributor

Pull request #20620 was updated. @prebello, @vazzolini, @dmitrijus, @kmaeshima, @kpedro88, @fabozzi, @cmsbuild, @franzoni, @GurpreetSinghChahal, @vanbesien, @davidlange6 can you please check and sign again.

@dmitrijus
Copy link
Contributor

+1

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 24, 2017

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/23940/console Started: 2017/10/24 08:36

@cmsbuild
Copy link
Contributor

@cmsbuild
Copy link
Contributor

Comparison job queued.

@cmsbuild
Copy link
Contributor

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-20620/23940/summary.html

Comparison Summary:

  • You potentially added 10783 lines to the logs
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 23
  • DQMHistoTests: Total histograms compared: 1815556
  • DQMHistoTests: Total failures: 46250
  • DQMHistoTests: Total nulls: 1452
  • DQMHistoTests: Total successes: 1767682
  • DQMHistoTests: Total skipped: 172
  • DQMHistoTests: Total Missing objects: 0
  • Checked 94 log files, 9 edm output root files, 23 DQM output files

@fabozzi
Copy link
Contributor

fabozzi commented Oct 24, 2017

+1

@davidlange6
Copy link
Contributor

some fixes to be discussed for 94x once I write my suggestions up - but ok for 92x.

@davidlange6
Copy link
Contributor

merge

@cmsbuild cmsbuild merged commit 788328d into cms-sw:CMSSW_9_2_X Oct 27, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

7 participants