Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ParticleNetAK4 to JME custom NanoAOD #31714

Merged
merged 1 commit into from Nov 10, 2020

Conversation

hqucms
Copy link
Contributor

@hqucms hqucms commented Oct 8, 2020

PR description:

This PR adds the ParticleNetAK4 tagger scores to the JME custom NanoAOD. The tagger is added to both AK4 CHS and Puppi jets.

PR validation:

Tested NANO step on a UL17 ttbar sample w/ --customise PhysicsTools/NanoAOD/custom_jme_cff.PrepJMECustomNanoAOD_MC, and the tagger scores are correctly added.

The event size increases from 11.78kb to 12.07kb (+2.5%) on ttbar MC.

FYI @camclean @alefisico

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 8, 2020

The code-checks are being triggered in jenkins.

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 8, 2020

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-31714/18899

  • This PR adds an extra 20KB to repository

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 8, 2020

A new Pull Request was created by @hqucms (Huilin Qu) for master.

It involves the following packages:

PhysicsTools/NanoAOD

@gouskos, @cmsbuild, @fgolf, @mariadalfonso, @santocch, @peruzzim can you please review it and eventually sign? Thanks.
@gpetruc this is something you requested to watch as well.
@silviodonato, @dpiparo, @qliphy you are the release manager for this.

cms-bot commands are listed here

@mariadalfonso
Copy link
Contributor

please test workflow 25202.15,10224.15,11024.15

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 8, 2020

The tests are being triggered in jenkins.
Test Parameters:

@@ -25,7 +25,8 @@
'pfDeepFlavourJetTags:probb','pfDeepFlavourJetTags:probbb','pfDeepFlavourJetTags:problepb',
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

pfCombinedMVAV2BJetTags and pfCombinedInclusiveSecondaryVertexV2BJetTags are already in the central nano, why are they added also in the JMEnano with a different name ?

btagCMVA = Var("bDiscriminator('pfCombinedMVAV2BJetTags')",float,doc="CMVA V2 btag discriminator",precision=10),
btagCSVV2 = Var("bDiscriminator('pfCombinedInclusiveSecondaryVertexV2BJetTags')",float,doc=" pfCombinedInclusiveSecondaryVertexV2 b-tag discriminator (aka CSVV2)",precision=10)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@camclean @alefisico any comment ?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These variables are included in the central nano for the Jet collection (AK4 CHS) but in our JMEnano we included other AK4 collections (PUPPI/PF) where we want to have the same content for comparison. So we need to include these variables in the new jet collections.

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 8, 2020

+1
Tested at: 590576d
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-015821/9818/summary.html
CMSSW: CMSSW_11_2_X_2020-10-08-1100
SCRAM_ARCH: slc7_amd64_gcc820

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 8, 2020

Comparison job queued.

@cmsbuild
Copy link
Contributor

cmsbuild commented Oct 8, 2020

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-015821/9818/summary.html

@slava77 comparisons for the following workflows were not done due to missing matrix map:

  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-015821/10224.15_TTbar_13+2017PU_JMENano+TTbar_13TeV_TuneCUETP8M1_GenSim+DigiPU+RecoFakeHLTPU+HARVESTFakeHLTPU+Nano
  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-015821/11024.15_TTbar_13+2018PU_JMENano+TTbar_13TeV_TuneCUETP8M1_GenSim+DigiPU+RecoFakeHLTPU+HARVESTFakeHLTPU+Nano
  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-015821/25202.15_TTbar_13+TTbar_13+DIGIUP15_PU25+RECOUP15_PU25+HARVESTUP15_PU25+NANOUP15MC_PU25_JME

Comparison Summary:

  • No significant changes to the logs found
  • Reco comparison results: 4 differences found in the comparisons
  • DQMHistoTests: Total files compared: 35
  • DQMHistoTests: Total histograms compared: 2542225
  • DQMHistoTests: Total failures: 7
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2542196
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 34 files compared)
  • Checked 149 log files, 22 edm output root files, 35 DQM output files

@mariadalfonso
Copy link
Contributor

+xpog

central nano are not touched, only JMEnano

@silviodonato
Copy link
Contributor

Hi @gouskos sorry for the late reply, but I was also very surprise by the nJet multiplicity. We performed some checks and the difference is real. We see similar nJet multiplicity for genJets and JetPuppi (as you can see in the plot below, thanks @kirschen) AK4 Puppi jets performs really good at low pt, removing most of the pileup jets that CHS finds.
I think this is something important to keep in mind. We will need CHS and PUPPI AK4 jets in the transition period, but after that the size of our JMEnano samples will be significantly smaller.

@kirschen are you asking @gouskos @mariadalfonso to keep the PR as it is?

@kirschen
Copy link
Contributor

kirschen commented Nov 9, 2020

Hi @gouskos sorry for the late reply, but I was also very surprise by the nJet multiplicity. We performed some checks and the difference is real. We see similar nJet multiplicity for genJets and JetPuppi (as you can see in the plot below, thanks @kirschen) AK4 Puppi jets performs really good at low pt, removing most of the pileup jets that CHS finds.
I think this is something important to keep in mind. We will need CHS and PUPPI AK4 jets in the transition period, but after that the size of our JMEnano samples will be significantly smaller.

@kirschen are you asking @gouskos @mariadalfonso to keep the PR as it is?

The PUPPI jet multiplicity and size consolidation as suggested in cms-nanoAOD#549 are unrelated to this PR, so I would say this can go ahead. We will address what jets exactly to keep at low pt and what level of information to keep down to which pt in a separate PR.

@santocch
Copy link

+1

@mariadalfonso
Copy link
Contributor

please test workflow 25202.15,10224.15,11024.15

@cmsbuild
Copy link
Contributor

cmsbuild commented Nov 10, 2020

The tests are being triggered in jenkins.
Test Parameters:

@cmsbuild
Copy link
Contributor

+1
Tested at: f9fbb26
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-015821/10616/summary.html
CMSSW: CMSSW_11_2_X_2020-11-10-1100
SCRAM_ARCH: slc7_amd64_gcc820

@cmsbuild
Copy link
Contributor

Comparison job queued.

@cmsbuild
Copy link
Contributor

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-015821/10616/summary.html

@slava77 comparisons for the following workflows were not done due to missing matrix map:

  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-015821/10224.15_TTbar_13+2017PU_JMENano+TTbar_13TeV_TuneCUETP8M1_GenSim+DigiPU+RecoFakeHLTPU+HARVESTFakeHLTPU+Nano
  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-015821/11024.15_TTbar_13+2018PU_JMENano+TTbar_13TeV_TuneCUETP8M1_GenSim+DigiPU+RecoFakeHLTPU+HARVESTFakeHLTPU+Nano
  • /data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/data/PR-015821/25202.15_TTbar_13+TTbar_13+DIGIUP15_PU25+RECOUP15_PU25+HARVESTUP15_PU25+NANOUP15MC_PU25_JME

Comparison Summary:

  • No significant changes to the logs found
  • Reco comparison results: 0 differences found in the comparisons
  • DQMHistoTests: Total files compared: 35
  • DQMHistoTests: Total histograms compared: 2529296
  • DQMHistoTests: Total failures: 1
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 2529273
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 34 files compared)
  • Checked 148 log files, 22 edm output root files, 35 DQM output files

@mariadalfonso
Copy link
Contributor

+xpog

changes only to the jmenano;
the size keep increasing and refinement will be done in a separate PR;

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @silviodonato, @dpiparo, @qliphy (and backports should be raised in the release meeting by the corresponding L2)

@silviodonato
Copy link
Contributor

+1

@cmsbuild cmsbuild merged commit 8c057b2 into cms-sw:master Nov 10, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

9 participants