Add function to refine FastSim DeepJet discriminators #40553

wolfmor · 2023-01-18T08:10:11Z

PR description:

Requires: cms-data/PhysicsTools-NanoAOD#14

This PR adds a function that uses a regression neural network to refine the DeepJet discriminators of CHS jets in NanoAOD for FastSim to better match FullSim. The function can be called by including the option --customise PhysicsTools/NanoAOD/jetsAK4_CHS_cff.nanoAOD_refineFastSim_bTagDeepFlav in the cmsDriver command and requires the ONNX model added in the above mentioned PR to cms-data. The original values are copied to new variables named with the suffix "unrefined".

Due to a bug in ONNX runtime 1.10.0 (see here) graph optimization has to be disabled to evaluate the model. The corresponding option is implemented in BaseMVAValueMapProducer for the ONNX backend.

The technique has been presented at the FastSim Days 2022 Workshop. There are plans to make this the default for FastSim in the future and possibly to extend to further collections/variables.

A complete set of commands to produce NanoAOD files with refined DeepJet discriminators is:

cmsDriver.py TTbar_13TeV_TuneCUETP8M1_cfi --relval 100000,1000 -s GEN,SIM,RECOBEFMIX,DIGI:pdigi_valid,L1,DIGI2RAW,L1Reco,RECO,VALIDATION:@standardValidation,DQM:@standardDQMFS -n 10 --conditions auto:run2_mc --beamspot Realistic25ns13TeV2016Collision --datatier GEN-SIM-DIGI-RECO,DQMIO --eventcontent FEVTDEBUGHLT,DQM --fast --era Run2_2016

cmsDriver.py step3 -s PAT --era Run2_2016 -n -1 --conditions auto:run2_mc --mc --datatier MINIAODSIM --eventcontent MINIAODSIM --filein file:TTbar_13TeV_TuneCUETP8M1_cfi_GEN_SIM_RECOBEFMIX_DIGI_L1_DIGI2RAW_L1Reco_RECO_VALIDATION_DQM.root --fast

cmsDriver.py --python_filename NanoAODrefined_cfg.py --eventcontent NANOAODSIM --fast --customise Configuration/DataProcessing/Utils.addMonitoring,PhysicsTools/NanoAOD/jetsAK4_CHS_cff.nanoAOD_refineFastSim_bTagDeepFlav --datatier NANOAODSIM --fileout file:step3_NANO.root --conditions auto:run2_mc --step NANO --filein "file:step3_PAT.root" --era run2_nanoAOD_106Xv2 --mc -n -1

PR validation:

The neural network has been trained on GEN-synchronized FastSim/FullSim jet pairs from SUSY simplified model T1tttt events and has been validated also in TTbar events. In both cases, considerably improved agreement with the FullSim output and an improvement in correlations among output observables and external parameters is seen.

If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:

Needs to be backported to 12_6.

@sbein @kpedro88

cmsbuild · 2023-01-18T08:18:24Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-40553/33785

This PR adds an extra 20KB to repository
There are other open Pull requests which might conflict with changes you have proposed:
- File PhysicsTools/NanoAOD/python/jetsAK4_CHS_cff.py modified in PR(s): Fix for wf 11634.15 #40539

cmsbuild · 2023-01-18T08:18:48Z

A new Pull Request was created by @wolfmor for master.

It involves the following packages:

PhysicsTools/NanoAOD (xpog)
PhysicsTools/PatAlgos (xpog, reconstruction)

@cmsbuild, @mandrenguyen, @clacaputo, @swertz, @vlimant can you please review it and eventually sign? Thanks.
@AlexDeMoor, @rappoccio, @gouskos, @jdolen, @JyothsnaKomaragiri, @ahinzmann, @AnnikaStein, @schoef, @emilbols, @jdamgov, @mbluj, @nhanvtran, @gkasieczka, @hatakeyamak, @gpetruc, @azotz, @mariadalfonso, @demuller, @andrzejnovak, @seemasharmafnal, @mmarionncern this is something you requested to watch as well.
@perrotta, @dpiparo, @rappoccio you are the release manager for this.

cms-bot commands are listed here

mandrenguyen · 2023-01-18T11:26:11Z

type btv

kpedro88 · 2023-01-18T13:50:20Z

please test

cmsbuild · 2023-01-18T16:20:45Z

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-19f146/30061/summary.html
COMMIT: 4ca1787
CMSSW: CMSSW_13_0_X_2023-01-18-1100/el8_amd64_gcc11
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmssw/40553/30061/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

No significant changes to the logs found
Reco comparison results: 8 differences found in the comparisons
DQMHistoTests: Total files compared: 49
DQMHistoTests: Total histograms compared: 3555479
DQMHistoTests: Total failures: 3
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 3555454
DQMHistoTests: Total skipped: 22
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 48 files compared)
Checked 211 log files, 162 edm output root files, 49 DQM output files
TriggerResults: no differences found

mandrenguyen · 2023-01-22T08:59:20Z

@wolfmor Would it be worth adding the driver commands you list in the PR description as a RelVal workflow? In particular, as you indicate that further developments are coming.

kpedro88 · 2023-01-22T15:44:35Z

@mandrenguyen we're working on exactly that

…rom-CMSSW_13_0_X_2023-01-17-1100 add test workflow

kpedro88 · 2023-01-31T16:25:34Z

@cms-sw/pdmv-l2 @cms-sw/upgrade-l2 please check and sign? workflow changes are hopefully straightforward, let me know if you have any concerns.

srimanob · 2023-01-31T17:27:54Z

+Upgrade

sunilUIET · 2023-02-01T16:09:51Z

+pdmv

cmsbuild · 2023-02-01T16:10:15Z

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @perrotta, @dpiparo, @rappoccio (and backports should be raised in the release meeting by the corresponding L2)

perrotta · 2023-02-01T21:35:35Z

+1

srimanob · 2023-04-13T11:34:41Z

Hi @wolfmor @kpedro88 @sbein
SUS is planning for Run3-2022 FastSim production in 12_4. Should this PR and ONNX model for refinement be backport to 12_4? Or if there will be technical detail with backport? Thanks.

sbein · 2023-04-13T11:47:11Z

Hi @srimanob yes, ideally this should be backported so that the refinement can be used in Run 3. My plan was once the backport to UL is merged #40828 (comment), I will do the backport to 12_4. This will probably take another week.

srimanob · 2023-04-13T11:54:45Z

Thanks @sbein
So I will note this in the production plan to @cms-sw/pdmv-l2

srimanob · 2023-04-14T06:32:33Z

Hi @sbein @kpedro88
Will we face technical issue if backport to 12_6 first? This is not about Run-2, but Run-3 2022 NanoV11. Thx.

swertz · 2023-04-14T06:48:54Z

Since the refinement runs at Nano level, for Run3 samples why can't you simply use the imminent NanoV12 campaigns in 13_0 that will take 12_4 MINI samples as input?

FYI @simonepigazzini

srimanob · 2023-04-14T07:02:20Z

Hi @swertz
What I am asking is based on the production done for Summer22 campaign. Currently, they run NanoV10 and 11. If SUS will mix Fast/Full samples, it is better to run the same version. So either run V12 in production, or backport to 12_6 for V11. I can put this comment on the discussion also. Thx.

https://cms-pdmv.cern.ch/mcm/campaigns?prepid=Run3*22*Nano*&page=-1&shown=16447

swertz · 2023-04-14T07:13:25Z

Hi @srimanob , so much is clear, I was also referring to Summer22 MC. Don't forget that current Summer22 and Nano v10/11 is not "complete" in the sense that many jet-related ingredients (PUPPI tune, taggers) were not updated yet. NanoV12 will be run on Summer22 MC and will contain all the recommended ingredients for analysis of Run3 data.

I can see why you'd want some FastSim MC in 12_6/NanoV11 to be able to quickly implement Fast/Full comparisons with the existing samples, but just keep in mind that for physics results, in most cases you'll need to use NanoV12 anyway.

Another point: this PR only implemented refinement for taggers in CHS jets (which makes sense for the Run2 UL backport), but Run3 samples contain PUPPI jets...

Add function to refine FastSim DeepJet discriminators

4ca1787

wolfmor mentioned this pull request Jan 18, 2023

Add function to refine FastSim DeepJet discriminators #40550

Closed

cmsbuild added this to the CMSSW_13_0_X milestone Jan 18, 2023

cmsbuild added code-checks-pending orp-pending pending-signatures reconstruction-pending tests-pending xpog-pending labels Jan 18, 2023

wolfmor mentioned this pull request Jan 18, 2023

Add ONNX model for FastSim DeepJet refinement cms-data/PhysicsTools-NanoAOD#14

Merged

cmsbuild added code-checks-approved and removed code-checks-pending labels Jan 18, 2023

cmsbuild added the btv label Jan 18, 2023

cmsbuild added tests-started and removed tests-pending labels Jan 18, 2023

cmsbuild added tests-approved and removed tests-started labels Jan 18, 2023

cmsbuild mentioned this pull request Jan 20, 2023

[HLT] [CLANG] Fix unused-but-set-variable warnings #40578

Merged

kpedro88 added 2 commits January 20, 2023 16:35

add nanoAOD to run2 FS matrix workflows

2085653

add test 2016 workflow for fastsim refinement

565306c

Merge pull request #1 from kpedro88/fastsim-refine-btagdeepflav-CHS_f…

324d88e

…rom-CMSSW_13_0_X_2023-01-17-1100 add test workflow

cmsbuild removed tests-approved code-checks-approved labels Jan 23, 2023

cmsbuild added xpog-approved and removed xpog-pending labels Jan 30, 2023

cmsbuild added upgrade-approved and removed upgrade-pending labels Jan 31, 2023

This was referenced Jan 31, 2023

Use --beamspot Nominal2022PbPbCollision for all Run 3 heavy-ion workflows #40653

Merged

Add a cosmics PCL workflow to runTheMatrix #40661

Merged

clacaputo mentioned this pull request Feb 1, 2023

Custom Run 3 PFScouting NanoAOD #40438

Merged

cmsbuild added fully-signed pdmv-approved and removed pending-signatures pdmv-pending labels Feb 1, 2023

cmsbuild mentioned this pull request Feb 1, 2023

The Lorentz Angle Prompt Calibration Loop for Pixel Forward Phase 1 detector #40664

Merged

cmsbuild added orp-approved and removed orp-pending labels Feb 1, 2023

cmsbuild merged commit 4fc972d into cms-sw:master Feb 1, 2023

This was referenced Feb 2, 2023

[Tensorflow] Build with GPU enabled cms-sw/cmsdist#7648

Merged

Update Tensorflow to version 2.11.0 cms-sw/cmsdist#8258

Closed

sbein mentioned this pull request Feb 20, 2023

Backport FastSim refiner network for NANO (DeepJet for AK4 CHS Jets) #40828

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add function to refine FastSim DeepJet discriminators #40553

Add function to refine FastSim DeepJet discriminators #40553

wolfmor commented Jan 18, 2023 •

edited

cmsbuild commented Jan 18, 2023

cmsbuild commented Jan 18, 2023

mandrenguyen commented Jan 18, 2023

kpedro88 commented Jan 18, 2023

cmsbuild commented Jan 18, 2023

mandrenguyen commented Jan 22, 2023

kpedro88 commented Jan 22, 2023

kpedro88 commented Jan 31, 2023

srimanob commented Jan 31, 2023

sunilUIET commented Feb 1, 2023

cmsbuild commented Feb 1, 2023

perrotta commented Feb 1, 2023

srimanob commented Apr 13, 2023

sbein commented Apr 13, 2023 •

edited

srimanob commented Apr 13, 2023

srimanob commented Apr 14, 2023

swertz commented Apr 14, 2023 •

edited

srimanob commented Apr 14, 2023 •

edited

swertz commented Apr 14, 2023

Add function to refine FastSim DeepJet discriminators #40553

Add function to refine FastSim DeepJet discriminators #40553

Conversation

wolfmor commented Jan 18, 2023 • edited

PR description:

PR validation:

If this PR is a backport please specify the original PR and why you need to backport that PR. If this PR will be backported please specify to which release cycle the backport is meant for:

cmsbuild commented Jan 18, 2023

cmsbuild commented Jan 18, 2023

mandrenguyen commented Jan 18, 2023

kpedro88 commented Jan 18, 2023

cmsbuild commented Jan 18, 2023

Comparison Summary

mandrenguyen commented Jan 22, 2023

kpedro88 commented Jan 22, 2023

kpedro88 commented Jan 31, 2023

srimanob commented Jan 31, 2023

sunilUIET commented Feb 1, 2023

cmsbuild commented Feb 1, 2023

perrotta commented Feb 1, 2023

srimanob commented Apr 13, 2023

sbein commented Apr 13, 2023 • edited

srimanob commented Apr 13, 2023

srimanob commented Apr 14, 2023

swertz commented Apr 14, 2023 • edited

srimanob commented Apr 14, 2023 • edited

swertz commented Apr 14, 2023

wolfmor commented Jan 18, 2023 •

edited

sbein commented Apr 13, 2023 •

edited

swertz commented Apr 14, 2023 •

edited

srimanob commented Apr 14, 2023 •

edited