Update to DNN-based strategy for outside-in seed generation in Muon HLT #37437

kondratyevd · 2022-04-01T20:18:44Z

PR description:

This is a follow-up on #35237.
The DNN-based approach to outside-in seed generation for muon HLT has been revised.

The DNN model has been retrained to address the following issues:
- Old models didn't perform well for low-pT muons.
- Overall HLT timing for old models was higher than that for the baseline approach w/o DNN.
- The approach of training two different models (barrel & endcap) was found to be inconvenient for optimization, and didn't provide a significant gain in terms of overall efficiency and timing.
Updates to DNN training:
- Low-pT muons are included in training.
- Hyperparameters optimized automatically using Keras Tuner (previously architecture was chosen manually). The optimized architecture features two hidden layers, with 1024 and 2048 nodes, respectively.
Change to TSGForOIDNN.cc plugin: remove split into barrel and endcap; use one model for all muons.

The updated model is added to cms-data: cms-data/RecoMuon-TrackerSeedGenerator#4. The older classifier models will not work with the updated plugin.

PR validation:

Slides describing the update: link.
The performance was checked on J/psi (low-pT) and Drell-Yan (high-pT) datasets. Overall HLT efficiency is similar for all models, while the timing for the newly trained models is improved.

cmsbuild · 2022-04-01T20:26:57Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-37437/29132

This PR adds an extra 20KB to repository

cmsbuild · 2022-04-01T20:27:21Z

A new Pull Request was created by @kondratyevd (Dmitry Kondratyev) for master.

It involves the following packages:

RecoMuon/TrackerSeedGenerator (reconstruction)

@jpata, @cmsbuild, @clacaputo, @slava77 can you please review it and eventually sign? Thanks.
@HuguesBrun, @abbiendi, @Fedespring, @bellan, @sscruz, @jhgoh, @CeliaFernandez, @trocino, @cericeci, @rociovilar this is something you requested to watch as well.
@perrotta, @dpiparo, @qliphy you are the release manager for this.

cms-bot commands are listed here

JanFSchulte · 2022-04-02T21:01:18Z

@missirol This together with the cms-data PR is for the HLT and would ideally also be backported to 12_3_0 if time permits.

missirol · 2022-04-02T21:29:32Z

Thanks for the info, @JanFSchulte . It looks like this PR is only updating the producer, so it does not really need to be tested together with cms-data/RecoMuon-TrackerSeedGenerator#4 , correct?

Concerning the backport to 12_3_X, I'm not sure how the backporting works for the cms-data update. If needed, you could ask in that PR.

missirol · 2022-04-02T21:31:25Z

urgent

MUO-HLT developers intend to have this update backported in time for 12_3_0.

("urgent" here means "to be backported in time for 12_3_0").

JanFSchulte · 2022-04-02T21:48:28Z

Yes, the cms-data PR does not affect the tests. In fact, the producer is not run in any tests at all, so they are meaningless for this PR.

jpata · 2022-04-04T09:36:35Z

assign hlt

cmsbuild · 2022-04-04T09:36:56Z

New categories assigned: hlt

@missirol,@Martin-Grunewald you have been requested to review this Pull request/Issue and eventually sign? Thanks

Martin-Grunewald · 2022-04-04T10:07:57Z

please test

jpata · 2022-04-04T10:48:57Z

The model file is fairly large (the largest I'm aware of in CMSSW):

BIN +10.1 MB OIseeding/DNNclassifier_Run3_inclusive.pb

Have you rechecked memory and CPU performance of the model? I suppose it's for HLT to evaluate if it's appropriate (since AFAIK this doesn't run offline), but just to be aware.

missirol · 2022-04-04T11:57:12Z

Have you rechecked memory and CPU performance of the model? I suppose it's for HLT to evaluate if it's appropriate (since AFAIK this doesn't run offline), but just to be aware.

Thanks for pointing this out (for the record, this does not run at HLT either yet, not even the previous version of this DNN did).

@JanFSchulte @khaosmos93 @kondratyevd , for CPU timing I see the slides have the numbers for HLT, but do you also have numbers for the amount of memory used by the model?

( cc: @silviodonato )

cmsbuild · 2022-04-04T13:53:59Z

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-553822/23636/summary.html
COMMIT: 52ca744
CMSSW: CMSSW_12_4_X_2022-04-03-2300/slc7_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmssw/37437/23636/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

No significant changes to the logs found
Reco comparison results: 12 differences found in the comparisons
DQMHistoTests: Total files compared: 48
DQMHistoTests: Total histograms compared: 3593039
DQMHistoTests: Total failures: 25
DQMHistoTests: Total nulls: 1
DQMHistoTests: Total successes: 3592991
DQMHistoTests: Total skipped: 22
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.004 KiB( 47 files compared)
DQMHistoSizes: changed ( 312.0 ): 0.004 KiB MessageLogger/Warnings
Checked 200 log files, 45 edm output root files, 48 DQM output files
TriggerResults: no differences found

missirol · 2022-04-04T18:11:27Z

+hlt

not validated by PR tests, relies on validation done by the MUO POG in the context of HLT development

jpata · 2022-04-05T07:40:52Z

+reconstruction

does not / is not intended to run offline, nothing for us to validate here (code changes look reasonable)

cmsbuild · 2022-04-05T07:41:11Z

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @qliphy (and backports should be raised in the release meeting by the corresponding L2)

missirol · 2022-04-05T08:09:08Z

for CPU timing I see the slides have the numbers for HLT, but do you also have numbers for the amount of memory used by the model?

@JanFSchulte @khaosmos93 @kondratyevd , please address this point, and open a backport of this PR to 12_3_X.

kondratyevd · 2022-04-05T13:05:55Z

@jpata @missirol

I have generated the igprof memory reports, but I didn't manage to turn them into interpretable web-navigable reports. The sql3 files are here (lxplus):
/afs/cern.ch/user/d/dkondrat/public/OIseeding_reports/ig_reports_apr4/
There are two files for comparison - with and w/o using DNN-based strategy.

jpata · 2022-04-05T13:54:47Z

I uploaded your memory profiles here:

Since this is probably a HLT workflow, I'm not really able to tell much about the expected/observed use from these.

kondratyevd · 2022-04-05T14:07:10Z

@jpata ah, I used the default igprof instructions and forgot to take into account Slava's recommendations to the previous PR: #35237 (comment).
I am going to reproduce these reports shortly.

kondratyevd · 2022-04-05T14:29:21Z

@jpata the new reports are available at /afs/cern.ch/user/d/dkondrat/public/OIseeding_reports/ig_reports_apr5/.
Could you please upload them again? Hopefully they are more informative now.

jpata · 2022-04-05T14:41:08Z

kondratyevd · 2022-04-05T15:46:20Z

Looking at the logs, it seems that unfortunately in the most recent tests the input file failed to open.
I have fixed the issue and updated the files in /afs/cern.ch/user/d/dkondrat/public/OIseeding_reports/ig_reports_apr5/.
@jpata could you please copy them once again? I apologize for the inconvenience.

jpata · 2022-04-06T07:17:27Z

Sure, I reuploaded them. The links are the same as above. Looks like the DNN is among the top modules by memory in this workflow, but I don't have a reference point for HLT, so it doesn't affect the reco signature.

perrotta · 2022-04-06T08:16:23Z

+1

It updates (optimizes) an existing model for outside-in seed generation in Muon HLT
This model is not used in the current developments for the actual HLT menu in Run3, only for tests: before using it the large memory consumption issue should be taken into account (partially addressed here bye use one model for all muons and remove split into barrel and endcap

update DNN-based OI seed generation

52ca744

kondratyevd mentioned this pull request Apr 1, 2022

Updated DNN classifier for outside-in seed generation in Muon HLT cms-data/RecoMuon-TrackerSeedGenerator#4

Merged

cmsbuild added this to the CMSSW_12_4_X milestone Apr 1, 2022

cmsbuild added code-checks-pending orp-pending pending-signatures reconstruction-pending tests-pending labels Apr 1, 2022

cmsbuild added code-checks-approved and removed code-checks-pending labels Apr 1, 2022

cmsbuild added the urgent label Apr 2, 2022

cmsbuild added the hlt-pending label Apr 4, 2022

cmsbuild added tests-started and removed tests-pending labels Apr 4, 2022

cmsbuild added tests-approved and removed tests-started labels Apr 4, 2022

cmsbuild removed the hlt-pending label Apr 4, 2022

cmsbuild added the hlt-approved label Apr 4, 2022

cmsbuild added fully-signed and removed reconstruction-pending pending-signatures labels Apr 5, 2022

cmsbuild added the reconstruction-approved label Apr 5, 2022

kondratyevd mentioned this pull request Apr 5, 2022

[Backport 12_3_X] Update to DNN-based strategy for outside-in seed generation in Muon HLT #37467

Merged

cmsbuild added orp-approved and removed orp-pending labels Apr 6, 2022

cmsbuild merged commit eb7657b into cms-sw:master Apr 6, 2022

smuzaffar mentioned this pull request Apr 6, 2022

Update tag for RecoMuon-TrackerSeedGenerator to V00-04-00 cms-sw/cmsdist#7754

Merged

iarspider mentioned this pull request Apr 6, 2022

Update tag for RecoMuon-TrackerSeedGenerator to V00-04-00 cms-sw/cmsdist#7755

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update to DNN-based strategy for outside-in seed generation in Muon HLT #37437

Update to DNN-based strategy for outside-in seed generation in Muon HLT #37437

kondratyevd commented Apr 1, 2022 •

edited

cmsbuild commented Apr 1, 2022

cmsbuild commented Apr 1, 2022

JanFSchulte commented Apr 2, 2022

missirol commented Apr 2, 2022

missirol commented Apr 2, 2022

JanFSchulte commented Apr 2, 2022

jpata commented Apr 4, 2022

cmsbuild commented Apr 4, 2022

Martin-Grunewald commented Apr 4, 2022

jpata commented Apr 4, 2022

missirol commented Apr 4, 2022

cmsbuild commented Apr 4, 2022

missirol commented Apr 4, 2022

jpata commented Apr 5, 2022

cmsbuild commented Apr 5, 2022

missirol commented Apr 5, 2022

kondratyevd commented Apr 5, 2022

jpata commented Apr 5, 2022 •

edited

kondratyevd commented Apr 5, 2022

kondratyevd commented Apr 5, 2022

jpata commented Apr 5, 2022

kondratyevd commented Apr 5, 2022

jpata commented Apr 6, 2022

perrotta commented Apr 6, 2022

Update to DNN-based strategy for outside-in seed generation in Muon HLT #37437

Update to DNN-based strategy for outside-in seed generation in Muon HLT #37437

Conversation

kondratyevd commented Apr 1, 2022 • edited

PR description:

PR validation:

cmsbuild commented Apr 1, 2022

cmsbuild commented Apr 1, 2022

JanFSchulte commented Apr 2, 2022

missirol commented Apr 2, 2022

missirol commented Apr 2, 2022

JanFSchulte commented Apr 2, 2022

jpata commented Apr 4, 2022

cmsbuild commented Apr 4, 2022

Martin-Grunewald commented Apr 4, 2022

jpata commented Apr 4, 2022

missirol commented Apr 4, 2022

cmsbuild commented Apr 4, 2022

Comparison Summary

missirol commented Apr 4, 2022

jpata commented Apr 5, 2022

cmsbuild commented Apr 5, 2022

missirol commented Apr 5, 2022

kondratyevd commented Apr 5, 2022

jpata commented Apr 5, 2022 • edited

kondratyevd commented Apr 5, 2022

kondratyevd commented Apr 5, 2022

jpata commented Apr 5, 2022

kondratyevd commented Apr 5, 2022

jpata commented Apr 6, 2022

perrotta commented Apr 6, 2022

kondratyevd commented Apr 1, 2022 •

edited

jpata commented Apr 5, 2022 •

edited