Updated MLPF producer with ONNX #36841

jpata · 2022-01-31T09:47:17Z

PR description:

Following the presentation at the PPD general meeting, at ACAT2021, and as outlined in the PPD workshop, we are updating the MLPF integration in CMSSW to facilitate further development and scrutiny.

Note that MLPF is off by default and thus no changes are expected in any of the standard workflows.

This PR mainly updates the ML model and switches the inference to ONNX from tensorflow. MLPF-specific event content is removed, as now MLPF produces PFCandidates instead of (rather than in parallel to) PFAlgo, when enabled.

Here are the igprof results:

PR validation:

For physics validation, please see the slides linked above. The integration can be tested in the workflows 11843.13 and 11834.13.

jpata · 2022-01-31T09:48:43Z

test parameters:

pull_requests = update the MLPF model cms-data/RecoParticleFlow-PFProducer#3
workflows = 11834.13, 11843.13
relvals_opt = --what upgrade,standard,highstats,pileup,generator,extendedgen,production,ged,machine,premix

cmsbuild · 2022-01-31T09:55:41Z

-code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-36841/28032

This PR adds an extra 60KB to repository
There are other open Pull requests which might conflict with changes you have proposed:
- File Configuration/PyReleaseValidation/python/upgradeWorkflowComponents.py modified in PR(s): Phase 2 HLT menu targetting 7.5×10³⁴ cm⁻²s⁻¹ #35342, Phase2 tracker: add T30 (geometry with more realistic TFPX) #36660, Add hgcalv16 modifier and corresponding era #36753, ECAL Phase 2 Development WF 28234.61 fix #36748, Add run3 tracking low pu era #33532

Code check has found code style and quality issues which could be resolved by applying following patch(s)

code-format:
https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-36841/28032/code-format.patch
e.g. curl -k https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-36841/28032/code-format.patch | patch -p1
You can also run scram build code-format to apply code format directly

cmsbuild · 2022-01-31T10:05:40Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-36841/28033

This PR adds an extra 132KB to repository
There are other open Pull requests which might conflict with changes you have proposed:
- File Configuration/PyReleaseValidation/python/upgradeWorkflowComponents.py modified in PR(s): Phase 2 HLT menu targetting 7.5×10³⁴ cm⁻²s⁻¹ #35342, Phase2 tracker: add T30 (geometry with more realistic TFPX) #36660, Add hgcalv16 modifier and corresponding era #36753, ECAL Phase 2 Development WF 28234.61 fix #36748, Add run3 tracking low pu era #33532

jpata · 2022-02-04T15:32:48Z

adding @cms-sw/pf-l2 @laurenhay here, just to make sure everyone is informed.

cmsbuild · 2022-02-05T09:12:41Z

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-bafd5c/22230/summary.html
COMMIT: 539f744
CMSSW: CMSSW_12_3_X_2022-02-04-1100/slc7_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/36841/22230/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

@Dr15Jones Intialize member in GEMClusterProcessor constructor #36881

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-bafd5c/22230/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-bafd5c/22230/git-merge-result

Comparison Summary

@slava77 comparisons for the following workflows were not done due to missing matrix map:

/data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-bafd5c/11834.13_TTbar_14TeV+2021PU_mlpf+TTbar_14TeV_TuneCP5_GenSim+DigiPU+RecoNanoPU+HARVESTNanoPU
/data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-bafd5c/11843.13_QCD_FlatPt_15_3000HS_14+2021PU_mlpf+QCDForPF_14TeV_TuneCP5_GenSim+DigiPU+RecoNanoPU+HARVESTNanoPU

Summary:

No significant changes to the logs found
Reco comparison results: 0 differences found in the comparisons
DQMHistoTests: Total files compared: 46
DQMHistoTests: Total histograms compared: 3766018
DQMHistoTests: Total failures: 2
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 3765994
DQMHistoTests: Total skipped: 22
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 45 files compared)
Checked 193 log files, 42 edm output root files, 46 DQM output files
TriggerResults: no differences found

jfernan2 · 2022-02-07T12:13:01Z

+1

clacaputo · 2022-02-07T14:45:25Z

+reconstruction

no changes are expected since MLPF is off by default
WFs 11834.13 and 11843.13 run without problems

srimanob · 2022-02-07T15:56:20Z

+Upgrade

From the code related to Upgrade, updating the MLPF workflow to allow QCD sample is fine.

kskovpen · 2022-02-07T15:58:12Z

+pdmv

cmsbuild · 2022-02-07T15:58:39Z

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @qliphy (and backports should be raised in the release meeting by the corresponding L2)

perrotta · 2022-02-10T11:11:34Z

RecoParticleFlow/PFProducer/plugins/MLPFProducer.cc

-  tensorflow::Tensor input(tensorflow::DT_FLOAT, shape);
-  input.flat<float>().setZero();
+#ifdef MLPF_DEBUG
+  std::cout << "tensor_size=" << tensor_size << std::endl;


Is this needed, given the previous assert's?
If really needed to debug, then maybe better to have this cout before L55

true, it would perhaps be more informative (and less lines) to print out before the assert while debugging. can we address this in a follow-up version, or would you prefer a resigning of this PR?

Ok, do not request 4+1 signatures only to move one comment line: let postpone it to a follow-up PR

perrotta · 2022-02-10T11:21:45Z

RecoParticleFlow/PFProducer/src/MLPFModel.cc

-    cand.setPdgId(pred_pid);
-    cand.setCharge(charge);
+    reco::PFCandidate::ParticleType particleType(reco::PFCandidate::X);
+    if (pred_pid == 211)


It looks like this pred_pid is unsigned (here and everywhere else in this code), i.e. it does not consider the charge: can you confirm, just for check?

Yes, by construction, the underlying model reconstructs the absolute value of the PID separately from the charge.

So far, we didn't present detailed results about the charge prediction (so //cand.setCharge(charge); downstream), but it should not be a major problem, as should be driven by the track information.

perrotta · 2022-02-11T13:47:58Z

+1

It goes together with update the MLPF model cms-data/RecoParticleFlow-PFProducer#3

jpata added 10 commits September 23, 2021 15:39

mlpf v2

db7b369

update with latest model cms_20210929_223058_191573

d5dfd44

fixes for MLPF

c997d20

update with acat2021 model

4c77212

add qcd mlpf workflow

2fc0c6f

add refs for GSF and BREM

d79f7eb

update model path

7aaa63b

reorder

3c5f255

some cleanup

7e76e95

revert whitespace

b67e911

cmsbuild added this to the CMSSW_12_3_X milestone Jan 31, 2022

cmsbuild added code-checks-pending dqm-pending orp-pending pdmv-pending pending-signatures reconstruction-pending tests-pending upgrade-pending labels Jan 31, 2022

jpata mentioned this pull request Jan 31, 2022

update the MLPF model cms-data/RecoParticleFlow-PFProducer#3

Merged

cmsbuild added the requires-external label Jan 31, 2022

cmsbuild added code-checks-rejected and removed code-checks-pending labels Jan 31, 2022

code-format

539f744

cmsbuild added code-checks-pending and removed code-checks-rejected labels Jan 31, 2022

cmsbuild removed the code-checks-pending label Jan 31, 2022

cmsbuild added tests-approved and removed tests-started labels Feb 5, 2022

cmsbuild added dqm-approved and removed dqm-pending labels Feb 7, 2022

cmsbuild added reconstruction-approved and removed reconstruction-pending labels Feb 7, 2022

cmsbuild added upgrade-approved and removed upgrade-pending labels Feb 7, 2022

cmsbuild added fully-signed pdmv-approved and removed pending-signatures pdmv-pending labels Feb 7, 2022

perrotta reviewed Feb 10, 2022

View reviewed changes

cmsbuild mentioned this pull request Feb 11, 2022

fix era to be compliant with underlying D91 geometry #36931

Merged

cmsbuild added orp-approved and removed orp-pending labels Feb 11, 2022

cmsbuild merged commit 18d26b4 into cms-sw:master Feb 11, 2022

smuzaffar mentioned this pull request Feb 11, 2022

Update tag for RecoParticleFlow-PFProducer to V16-02-00 cms-sw/cmsdist#7619

Merged

This was referenced Feb 11, 2022

Add SV variables to BTV OfflineDQM #36884

Merged

Fixed replace with FinalPath #36937

Merged

Add 2021 MinimumBias express and prompt reco wf to limited #36942

Merged

Update Run-3 data and MC GTs with several updates #36940

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated MLPF producer with ONNX #36841

Updated MLPF producer with ONNX #36841

jpata commented Jan 31, 2022 •

edited

jpata commented Jan 31, 2022 •

edited

cmsbuild commented Jan 31, 2022

cmsbuild commented Jan 31, 2022

jpata commented Feb 4, 2022

cmsbuild commented Feb 5, 2022

jfernan2 commented Feb 7, 2022

clacaputo commented Feb 7, 2022

srimanob commented Feb 7, 2022

kskovpen commented Feb 7, 2022

cmsbuild commented Feb 7, 2022

perrotta Feb 10, 2022

jpata Feb 10, 2022

perrotta Feb 10, 2022

perrotta Feb 10, 2022 •

edited

jpata Feb 10, 2022

perrotta commented Feb 11, 2022

Updated MLPF producer with ONNX #36841

Updated MLPF producer with ONNX #36841

Conversation

jpata commented Jan 31, 2022 • edited

PR description:

PR validation:

jpata commented Jan 31, 2022 • edited

cmsbuild commented Jan 31, 2022

cmsbuild commented Jan 31, 2022

jpata commented Feb 4, 2022

cmsbuild commented Feb 5, 2022

Comparison Summary

jfernan2 commented Feb 7, 2022

clacaputo commented Feb 7, 2022

srimanob commented Feb 7, 2022

kskovpen commented Feb 7, 2022

cmsbuild commented Feb 7, 2022

perrotta Feb 10, 2022

Choose a reason for hiding this comment

jpata Feb 10, 2022

Choose a reason for hiding this comment

perrotta Feb 10, 2022

Choose a reason for hiding this comment

perrotta Feb 10, 2022 • edited

Choose a reason for hiding this comment

jpata Feb 10, 2022

Choose a reason for hiding this comment

perrotta commented Feb 11, 2022

jpata commented Jan 31, 2022 •

edited

jpata commented Jan 31, 2022 •

edited

perrotta Feb 10, 2022 •

edited