improving performance of PuppiProducer #31164

missirol · 2020-08-18T12:49:09Z

PR description:

This PR is an attempt to improve the performance (in terms of execution time) of the PuppiProducer plugin.

No changes in the outputs are expected.

Most of these technical changes are not in the producer itself, but in the PuppiContainer class; all changes are related (directly or indirectly) to the function var_within_R (the "alpha" variable on which the PUPPI weight is based), which appears to be the most time-consuming part of the algorithm.

The table below summarizes a few timing estimates for the puppi module in the RECO step for a Run-3 and Phase-2 workflow.

wf	time pre-PR [ms]	time post-PR [ms]
Run-3 RECO (QCD MC)	74.2	33.2
Phase-2 RECO (QCD MC)	620.1	255.9

The numbers are taken from the FastTimerService, running on 100 events; they suggest that the overall reduction in timing is approx. 55-60%.

Here is a short summary of the changes in the PR (below, goodVar and var_within_R are used interchangeably, since the former function is simply a call to the latter):

PuppiCandidate is converted to a simple struct, removing its inheritance from fastjet::PseudoJet (looking at the PUPPI implementation as a whole, the usage of fastjet does not seem necessary); this change accounts for most of the speedup (approx. 35-40% speedup);
skip the calculation of goodVar when not needed, e.g. for candidates that are assigned a default weight (0, or 1); this seems to bring roughly a 10-15% improvement in performance.
another bit of speedup is obtained by taking one condition (puppi_id != 3) out of var_within_R, and using it just once in PuppiContainer::initialize to fill the collections later used as inputs to goodVar; this leads to adding one more data member (a vector of PuppiCandidates) to the PuppiContainer class.

In this PR, there are two spots where things could certainly be improved (but improving them will lead to small changes in the outputs):

in PuppiContainer::initialize, an instance of fastjet::PseudoJet (here) is used to obtain the 4-momenta of the candidates used by PUPPI; this was the only way I could find to maintain the same numbers as pre-PR (this may be due to the fact that the inputs used for reset_PtYPhiM are floats, and then fastjet recomputes and stores the p4 using doubles); one way to simplify this (assuming the differences in the outputs are ultimately not significant) would be to remove PuppiCandidate altogether and simply use the RecoObj class already in use in the PuppiProducer (maybe, changing the floats in RecoObj to doubles).
in goodVar, the DeltaR2 between two given candidates is calculated (at most) twice (here), first with rapidity, then with pseudo-rapidity; this is not modified by this PR (otherwise, the outputs would inevitably change), but maybe it’s something that could be simplified.

FYI: @ahinzmann @lathomas @kirschen

PR validation:

Checked that the PUPPI weights of all PF candidates are unchanged for 200 events (100 for a Run-3 workflow, and 100 for a Phase-2 workflow), for both puppi (used for jets) and puppiNoLep (used for MET).
Standard workflows, i.e. runTheMatrix.py -l limited -i all --ibeos, ran successfully.

if this PR is a backport please specify the original PR and why you need to backport that PR:

N/A

slava77 · 2020-08-18T13:02:58Z

@mrodozov @smuzaffar
do we have some problems with webhooks?
this one was posted for 14 minutes already (at the time of my message)

mrodozov · 2020-08-18T13:33:26Z

@slava77 the remote calls to jenkins hanged with proxy error 502 and I'm trying to restart the service but I don't have sudo rights to do it. I'm trying to do it from the java CLI using the keys for cmsbuild

davidlange6 · 2020-08-18T13:48:25Z

It seems there are other problems at cern today (at least affecting cms computing infrastructure) On Aug 18, 2020, at 3:33 PM, Mircho Rodozov <notifications@github.com<mailto:notifications@github.com>> wrote: @slava77<https://github.com/slava77> the remote calls to jenkins hanged with proxy error 502 and I'm trying to restart the service but I don't have sudo rights to do it — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub<#31164 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABGPFQ2VYBUIGTQZJX5IVY3SBJ7LRANCNFSM4QDNGPKA>.

mrodozov · 2020-08-18T14:45:33Z

@davidlange6 do you by any chance happen to have root permissions for the jenkins03 machine ?

davidlange6 · 2020-08-18T15:06:45Z

No, I don’t - perhaps ask IT to reboot it? On Aug 18, 2020, at 4:45 PM, Mircho Rodozov <notifications@github.com<mailto:notifications@github.com>> wrote: @davidlange6<https://github.com/davidlange6> do you by any chance happen to have root permissions for the jenkins03 machine ? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub<#31164 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ABGPFQ5R5XJMARLSV773IOLSBKHZ3ANCNFSM4QDNGPKA>.

cmsbuild · 2020-08-18T15:49:49Z

The code-checks are being triggered in jenkins.

cmsbuild · 2020-08-18T15:59:13Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-31164/17796

This PR adds an extra 24KB to repository

cmsbuild · 2020-08-18T15:59:36Z

A new Pull Request was created by @missirol (Marino Missiroli) for master.

It involves the following packages:

CommonTools/PileupAlgos

@perrotta, @jpata, @cmsbuild, @santocch, @slava77 can you please review it and eventually sign? Thanks.
@rappoccio, @ahinzmann, @riga, @jdolen, @gkasieczka, @hatakeyamak, @clelange this is something you requested to watch as well.
@silviodonato, @dpiparo, @qliphy you are the release manager for this.

cms-bot commands are listed here

slava77 · 2020-08-18T16:59:48Z

@cmsbuild please test

cmsbuild · 2020-08-18T17:00:10Z

The tests are being triggered in jenkins.

CMSSW_11_2_X_2020-08-17-2300/slc7_amd64_gcc820: https://cmssdt.cern.ch/jenkins/job/ib-run-pr-tests/8799/console Started: 2020/08/18 19:01

cmsbuild · 2020-08-18T18:18:26Z

+1
Tested at: 0c3b885
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-63bdad/8799/summary.html
CMSSW: CMSSW_11_2_X_2020-08-17-2300
SCRAM_ARCH: slc7_amd64_gcc820

cmsbuild · 2020-08-18T18:18:29Z

Comparison job queued.

cmsbuild · 2020-08-18T19:45:22Z

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-63bdad/8799/summary.html

Comparison Summary:

No significant changes to the logs found
Reco comparison results: 0 differences found in the comparisons
DQMHistoTests: Total files compared: 35
DQMHistoTests: Total histograms compared: 2608246
DQMHistoTests: Total failures: 1
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 2608197
DQMHistoTests: Total skipped: 48
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 34 files compared)
Checked 149 log files, 22 edm output root files, 35 DQM output files

cmsbuild · 2020-08-21T16:25:19Z

Comparison job queued.

cmsbuild · 2020-08-21T17:52:10Z

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-63bdad/8853/summary.html

Comparison Summary:

No significant changes to the logs found
Reco comparison results: 145 differences found in the comparisons
DQMHistoTests: Total files compared: 35
DQMHistoTests: Total histograms compared: 2608222
DQMHistoTests: Total failures: 294
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 2607906
DQMHistoTests: Total skipped: 22
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 34 files compared)
Checked 149 log files, 22 edm output root files, 35 DQM output files

santocch · 2020-08-24T18:27:17Z

+1

slava77 · 2020-08-24T21:27:08Z

+1

for #31164 e0f02d3

code changes are in line with the PR description and the follow up review
- the last commits removed dependence on fastjet, but introduced a numerical difference https://github.com/cms-sw/cmssw/pull/31164/files/0c3b885fbbd8e493f396d89bcdc0687cf52f0818..e0f02d36a5b2d0fd5c5fbe53358cbb902f1700cf in the way how rapidity/pseudorapidity are set, something to be noted/bypassed in a possible backport
jenkins tests pass and comparisons with the baseline show small differences in puppi-related variables, consistent with the change in the numerical precision of the kinematics

curiously enough, there was no actual overlap in the context lines with #31174

cmsbuild · 2020-08-24T21:27:33Z

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @silviodonato, @dpiparo, @qliphy (and backports should be raised in the release meeting by the corresponding L2)

missirol · 2020-08-24T22:06:34Z

the last commits removed dependence on fastjet, but introduced a numerical difference

@slava77

In case it helps: numerical changes in the outputs are introduced only by the very last commit of the PR, i.e. e0f02d3
(we didn't verify this explicitly in the PR review, but I had checked it locally).

silviodonato · 2020-08-25T08:22:30Z

+1

missirol · 2020-08-25T13:27:11Z

@slava77 , one late question from my side: would it be okay to backport this (except for the last commit) to 11_1_X, so that it can be used for the Phase-2 HLT TDR studies?

slava77 · 2020-08-25T13:57:03Z

@slava77 , one late question from my side: would it be okay to backport this (except for the last commit) to 11_1_X, so that it can be used for the Phase-2 HLT TDR studies?

sure, but please limit it to the commit range that does not change the results.

kpedro88 · 2020-08-26T19:34:57Z

@missirol thanks for this PR! It's a worthy followup to my initial effort in #23270.

Could you make the same limited backport to 10_6_X? It would be nice to propagate this speedup to users for ultra-legacy analysis.

missirol · 2020-08-27T16:12:59Z

Could you make the same limited backport to 10_6_X? It would be nice to propagate this speedup to users for ultra-legacy analysis.

I'm looking into a backport of this PR to 10_6_X (taking into account the no-change policy). It would not be entirely trivial, because the Puppi implementation in earlier releases (incl. 10_6) is significantly different; nevertheless, I'll give it a try.

slava77 · 2020-08-27T16:21:57Z

I'm looking into a backport of this PR to 10_6_X (taking into account the no-change policy). It would not be entirely trivial, because the Puppi implementation in earlier releases (incl. 10_6) is significantly different; nevertheless, I'll give it a try.

Thank you.

missirol · 2020-08-29T06:25:47Z

Backport to 10_6_X tried in #31290.

[Puppi] performance improvements

0c3b885

cmsbuild added this to the CMSSW_11_2_X milestone Aug 18, 2020

cmsbuild added analysis-pending code-checks-pending comparison-pending orp-pending pending-signatures reconstruction-pending tests-pending labels Aug 18, 2020

cmsbuild added code-checks-approved and removed code-checks-pending labels Aug 18, 2020

cmsbuild added tests-started and removed tests-pending labels Aug 18, 2020

cmsbuild added tests-approved and removed tests-started labels Aug 18, 2020

cmsbuild added comparison-available and removed comparison-pending labels Aug 18, 2020

cmsbuild added tests-approved and removed tests-started labels Aug 21, 2020

cmsbuild added comparison-available and removed comparison-pending labels Aug 21, 2020

cmsbuild added analysis-approved and removed analysis-pending labels Aug 24, 2020

cmsbuild added fully-signed reconstruction-approved and removed pending-signatures reconstruction-pending labels Aug 24, 2020

cmsbuild added orp-approved and removed orp-pending labels Aug 25, 2020

cmsbuild merged commit 25a40ac into cms-sw:master Aug 25, 2020

missirol mentioned this pull request Aug 25, 2020

[11_1_X] improving performance of PuppiProducer #31241

Merged

cmsbuild mentioned this pull request Aug 25, 2020

[Do not Merge] testing new aarch64 container #31231

Closed

missirol mentioned this pull request Aug 29, 2020

[10_6_X] improving performance of PuppiProducer #31290

Merged

missirol deleted the devel_112X_puppi_speedup2 branch September 16, 2020 17:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improving performance of PuppiProducer #31164

improving performance of PuppiProducer #31164

missirol commented Aug 18, 2020

slava77 commented Aug 18, 2020

mrodozov commented Aug 18, 2020 •

edited

davidlange6 commented Aug 18, 2020 via email

mrodozov commented Aug 18, 2020

davidlange6 commented Aug 18, 2020 via email

cmsbuild commented Aug 18, 2020

cmsbuild commented Aug 18, 2020

cmsbuild commented Aug 18, 2020

slava77 commented Aug 18, 2020

cmsbuild commented Aug 18, 2020 •

edited

cmsbuild commented Aug 18, 2020

cmsbuild commented Aug 18, 2020

cmsbuild commented Aug 18, 2020

cmsbuild commented Aug 21, 2020

cmsbuild commented Aug 21, 2020

santocch commented Aug 24, 2020

slava77 commented Aug 24, 2020

cmsbuild commented Aug 24, 2020

missirol commented Aug 24, 2020

silviodonato commented Aug 25, 2020

missirol commented Aug 25, 2020

slava77 commented Aug 25, 2020

kpedro88 commented Aug 26, 2020

missirol commented Aug 27, 2020

slava77 commented Aug 27, 2020

missirol commented Aug 29, 2020

improving performance of PuppiProducer #31164

improving performance of PuppiProducer #31164

Conversation

missirol commented Aug 18, 2020

PR description:

PR validation:

if this PR is a backport please specify the original PR and why you need to backport that PR:

slava77 commented Aug 18, 2020

mrodozov commented Aug 18, 2020 • edited

davidlange6 commented Aug 18, 2020 via email

mrodozov commented Aug 18, 2020

davidlange6 commented Aug 18, 2020 via email

cmsbuild commented Aug 18, 2020

cmsbuild commented Aug 18, 2020

cmsbuild commented Aug 18, 2020

slava77 commented Aug 18, 2020

cmsbuild commented Aug 18, 2020 • edited

cmsbuild commented Aug 18, 2020

cmsbuild commented Aug 18, 2020

cmsbuild commented Aug 18, 2020

cmsbuild commented Aug 21, 2020

cmsbuild commented Aug 21, 2020

santocch commented Aug 24, 2020

slava77 commented Aug 24, 2020

cmsbuild commented Aug 24, 2020

missirol commented Aug 24, 2020

silviodonato commented Aug 25, 2020

missirol commented Aug 25, 2020

slava77 commented Aug 25, 2020

kpedro88 commented Aug 26, 2020

missirol commented Aug 27, 2020

slava77 commented Aug 27, 2020

missirol commented Aug 29, 2020

mrodozov commented Aug 18, 2020 •

edited

cmsbuild commented Aug 18, 2020 •

edited