Add GlobalCache to DeepFlavourJetTagsProducer to improve startup perf… #22886

wddgit · 2018-04-06T20:23:39Z

…ormance

igprof profiles showed this module was taking significant
time at startup, mostly as it reads the configuration file
used to initialize its neural network (lots of memory
churn also, around 150 MB per stream). Without the
GlobalCache, the startup time is proportional to the number
of streams. This change moves the LightweightNeuralNetwork
into the global cache so there is only one and the initialization
time is no longer proportional to the number of streams.
LightweightNeuralNetwork is supposed to be thread safe
to use in this manner.

Some other minor cleanup also.

The intent is that after merging this PR the module will give
identical results as before. I tested simply by putting in print
statements for the output of the neural network and also all
the parameter values I moved around. Then I compared the
output before and after in a runTheMatrix test that runs it (25.0).
All was identical. I also ran under igprof and verified the initialization
time is now constant and not proportional to the number of streams.

…ormance igprof profiles showed this module was taking significant time at startup, most as it reads the configuration file used to initialize its neural network (lots of memory churn also). Without the GlobalCache the startup time is proportional to the number of streams. This change moves the LightweightNeuralNetwork into the global cache so there is only one and the initialization time is no longer proportional to the number of streams. LightweightNeuralNetwork is supposed to be thread safe to use in this manner.

cmsbuild · 2018-04-06T20:23:58Z

The code-checks are being triggered in jenkins.

cmsbuild · 2018-04-06T20:25:41Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-22886/4267

cmsbuild · 2018-04-06T20:25:57Z

A new Pull Request was created by @wddgit (W. David Dagenhart) for master.

It involves the following packages:

RecoBTag/Combined

@perrotta, @cmsbuild, @slava77 can you please review it and eventually sign? Thanks.
@imarches, @acaudron, @JyothsnaKomaragiri, @mverzett, @ferencek, @pvmulder this is something you requested to watch as well.
@davidlange6, @slava77, @fabiocos you are the release manager for this.

cms-bot commands are listed here

wddgit · 2018-04-06T20:26:22Z

please test

FYI @Dr15Jones

cmsbuild · 2018-04-06T20:26:41Z

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/27361/console Started: 2018/04/06 22:29

cmsbuild · 2018-04-06T21:30:41Z

+1
Tested at: 292f7ac
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-22886/27361/summary.html

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:
34a355c
You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-22886/27361/git-log-recent-commits
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-22886/27361/git-merge-result

cmsbuild · 2018-04-06T21:30:44Z

Comparison job queued.

cmsbuild · 2018-04-06T22:11:40Z

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-22886/27361/summary.html

Comparison Summary:

No significant changes to the logs found
Reco comparison results: 4 differences found in the comparisons
DQMHistoTests: Total files compared: 29
DQMHistoTests: Total histograms compared: 2504254
DQMHistoTests: Total failures: 1
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 2504077
DQMHistoTests: Total skipped: 176
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 1.04000000002 KiB( 22 files compared)
Checked 119 log files, 9 edm output root files, 29 DQM output files

slava77 · 2018-04-07T01:53:36Z

+1

for #22886 292f7ac

mostly a technical update. This is nStreams better than what we had, but it is still a factor of 2.5 worse than where it could be with the NN provided via EventSetup.
- there are 5 instances running in full reco+miniAOD jobs (4 of these have the same NN now)
jenkins tests pass and comparisons with the baseline show no differences

Based on somewhat old memory performance tests, I confirm the memory churn size and note that the total RSS savings are not very large, expected to go down from 1.9 MB *nStreams to just 1.9 MB.
https://slava77sk.web.cern.ch/slava77sk/reco/cgi-bin/igprof-navigator/CMSSW_10_0_X_2017-12-17-2300-orig.step3.136.831.100.IgProf.1.MEM_LIVE/2572
https://slava77sk.web.cern.ch/slava77sk/reco/cgi-bin/igprof-navigator/CMSSW_10_0_X_2017-12-17-2300-orig.step3.136.831.100.IgProf.99.MEM_TOTAL/1528

cmsbuild · 2018-04-07T01:53:55Z

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @davidlange6, @slava77, @smuzaffar, @fabiocos (and backports should be raised in the release meeting by the corresponding L2)

fabiocos · 2018-04-10T08:11:00Z

+1

cmsbuild added this to the CMSSW_10_2_X milestone Apr 6, 2018

cmsbuild added code-checks-pending comparison-pending orp-pending pending-signatures reconstruction-pending tests-pending labels Apr 6, 2018

cmsbuild added code-checks-approved and removed code-checks-pending labels Apr 6, 2018

cmsbuild added tests-started and removed tests-pending labels Apr 6, 2018

cmsbuild added tests-approved and removed tests-started labels Apr 6, 2018

cmsbuild added comparison-available and removed comparison-pending labels Apr 6, 2018

slava77 mentioned this pull request Apr 7, 2018

migrate DeepFlavourJetTagsProducer to use LWTNN from the event setup #22890

Open

cmsbuild added fully-signed reconstruction-approved and removed pending-signatures reconstruction-pending labels Apr 7, 2018

cmsbuild added orp-approved and removed orp-pending labels Apr 10, 2018

cmsbuild merged commit f38bc64 into cms-sw:master Apr 10, 2018

wddgit deleted the addGlobalCacheDeepFlavorJetTagsProducer branch July 31, 2018 20:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GlobalCache to DeepFlavourJetTagsProducer to improve startup perf… #22886

Add GlobalCache to DeepFlavourJetTagsProducer to improve startup perf… #22886

wddgit commented Apr 6, 2018

cmsbuild commented Apr 6, 2018

cmsbuild commented Apr 6, 2018

cmsbuild commented Apr 6, 2018

wddgit commented Apr 6, 2018

cmsbuild commented Apr 6, 2018 •

edited

cmsbuild commented Apr 6, 2018

cmsbuild commented Apr 6, 2018

cmsbuild commented Apr 6, 2018

slava77 commented Apr 7, 2018

cmsbuild commented Apr 7, 2018

fabiocos commented Apr 10, 2018

Add GlobalCache to DeepFlavourJetTagsProducer to improve startup perf… #22886

Add GlobalCache to DeepFlavourJetTagsProducer to improve startup perf… #22886

Conversation

wddgit commented Apr 6, 2018

cmsbuild commented Apr 6, 2018

cmsbuild commented Apr 6, 2018

cmsbuild commented Apr 6, 2018

wddgit commented Apr 6, 2018

cmsbuild commented Apr 6, 2018 • edited

cmsbuild commented Apr 6, 2018

cmsbuild commented Apr 6, 2018

cmsbuild commented Apr 6, 2018

slava77 commented Apr 7, 2018

cmsbuild commented Apr 7, 2018

fabiocos commented Apr 10, 2018

cmsbuild commented Apr 6, 2018 •

edited