Fix jets with crazy momenta in miniAOD #27374

perrotta · 2019-06-27T10:28:00Z

BoostedDoubleSVProd is experiencing some crash in nanoAOD productions from miniAOD, see https://its.cern.ch/jira/browse/CMSCOMPPR-6445

The origin of the crash was identified as due to subjets components that have nan's in their momenta values. A debug of one such case revealed that those subjets with nan's were originating from jets having crazy momenta (|p|=82 TeV in the example pinpointed in #27238).

The fix proposed in #27238 is able to rescue BoostedDoubleSVProd from this issue, but it doesn't solve the original bug, This will have to be identified and fixed when the jets are produced.

A recipe to reproduce could be to run on the following event

process.source.eventsToProcess = cms.untracked.VEventRange('303832:918:1081148408-303832:918:1081148408')
process.source.fileNames = filesRelValTTbarPileUpMINIAODSIM
process.source.fileNames = cms.untracked.vstring( '/store/data/Run2017E/DoubleEG/MINIAOD/31Mar2018-v1/00000/5A8BC921-F437-E811-81ED-1866DAEB3370.root')

a nanoAOD or any other job which accesses subjets from miniAOD.

Another possible job that hits it is obtained by customizing and run RecoBTag/TensorFlow/test/test_deep_doublex_cfg.py, as explained in #27238 (comment)

This is put to the attention of the jet reco contacts: please @rappoccio @knash @gkasieczka have a look and report here about your findings

The text was updated successfully, but these errors were encountered:

perrotta · 2019-06-27T10:28:13Z

assign reconstruction

cmsbuild · 2019-06-27T10:28:19Z

New categories assigned: reconstruction

@slava77,@perrotta you have been requested to review this Pull request/Issue and eventually sign? Thanks

cmsbuild · 2019-06-27T10:28:22Z

A new Issue was created by @perrotta .

@davidlange6, @Dr15Jones, @smuzaffar, @fabiocos, @kpedro88 can you please review it and eventually sign/assign? Thanks.

cms-bot commands are listed here

slava77 · 2019-06-28T10:57:49Z

in which jet collection are these jets and does the problem come from miniAOD or from AOD?

andrzejnovak · 2019-06-28T12:32:11Z

in which jet collection are these jets and does the problem come from miniAOD or from AOD?

https://github.com/cms-sw/cmssw/blob/master/RecoBTag/TensorFlow/test/test_deep_doublex_cfg.py#L48

perrotta · 2019-06-28T12:55:40Z

Curiously enough, in the same miniAOD input file there are also a few CaloJets with about 80/90 TeV energy:

slava77 · 2019-07-04T13:38:41Z

in which jet collection are these jets and does the problem come from miniAOD or from AOD?

https://github.com/cms-sw/cmssw/blob/master/RecoBTag/TensorFlow/test/test_deep_doublex_cfg.py#L48

So, it's slimmedJetsAK8. (IIRC, this is based on PUPPI).
The question is if this comes from AOD or does this appear in PUPPI in miniAOD ?

rappoccio · 2019-07-29T14:02:11Z

I'm sorry for the delay here, I've been traveling. This is coming from AOD (and is present in CaloJets). If I look at root://cmsxrootd.fnal.gov//store/data/Run2017E/DoubleEG/AOD/17Nov2017-v1/40000/74C22A1F-95D3-E711-AB20-02163E019CB1.root I see 30 TeV CaloJets:

root [2] Events->Scan("recoCaloJets_ak4CaloJets__RECO.obj.m_state.p4Polar_.fCoordinates.fPt", "recoCaloJets_ak4CaloJets__RECO.obj.m_state.p4Polar_.fCoordinates.fPt > 1000");
***********************************
*    Row   * Instance * recoCaloJ *
***********************************
*       49 *        0 * 1117.0339 *
*       49 *        1 * 1061.7541 *
*      184 *        0 * 1076.6167 *
*      184 *        1 * 1022.2794 *
*      525 *        0 * 1086.7362 *
*      525 *        1 * 1024.4000 *
*     2948 *        0 * 1057.2645 *
*     3277 *        0 * 1518.5771 *
*     3277 *        1 * 1231.1153 *
*     4734 *        0 * 1017.6969 *
*     5070 *        0 * 1028.7641 *
*     5360 *        0 * 30736.918 *
*     6188 *        0 * 1008.8494 *
*     7427 *        0 * 1072.2818 *
*     7692 *        0 * 1325.9370 *
*     7692 *        1 * 1220.7102 *
*     7871 *        0 * 1308.4901 *
*     7871 *        1 * 1132.8198 *
*     9282 *        0 * 1005.0418 *
*    12626 *        0 * 1077.2320 *
*    12626 *        1 * 1035.2044 *
***********************************

This gets propagated to 80 TeV jets in PF:

root [3] Events->Scan("recoPFJets_pfJetsEI__RECO.obj.m_state.p4Polar_.fCoordinates.fPt", "recoPFJets_pfJetsEI__RECO.obj.m_state.p4Polar_.fCoordinates.fPt > 1000");
***********************************
*    Row   * Instance * recoPFJet *
***********************************
*       49 *        0 * 1108.9173 *
*       49 *        1 * 1106.2629 *
*      184 *        0 * 1075.6630 *
*      184 *        1 * 1039.2093 *
*      525 *        0 * 1107.8487 *
*      525 *        1 * 1092.2025 *
*     2948 *        0 * 1050.2926 *
*     3277 *        0 * 1506.9429 *
*     3277 *        1 * 1334.4985 *
*     4734 *        0 * 1146.8552 *
*     5066 *        0 * 1060.8508 *
*     5070 *        0 * 1041.5122 *
*     5360 *        0 * 81992.640 *
*     6188 *        0 * 1046.7060 *
*     6188 *        1 * 1006.3564 *
*     7427 *        0 * 1121.1865 *
*     7692 *        0 * 1312.9611 *
*     7692 *        1 * 1235.4965 *
*     7871 *        0 * 1311.7025 *
*     7871 *        1 * 1177.8438 *
*     9282 *        0 * 1022.7382 *
*    10833 *        0 * 1062.5575 *
*    12626 *        0 * 1068.4943 *
*    12626 *        1 * 1045.9259 *
***********************************

rappoccio · 2019-07-29T14:06:48Z

@perrotta @slava77 we're probably going to need HCAL experts to take a look at this.

slava77 · 2019-07-29T14:21:34Z

@rappoccio
do you happen to know how this 80TeV becomes a NaN downstream?

rappoccio · 2019-07-29T16:21:03Z

In MiniAOD the PFJets (both CHS and Puppi) correctly handle this as "80 TeV", and so do the soft dropped subjets.

It seems like the packing of the PFCandidates is the culprit, since the PF candidates at AOD level are fine but then in MINIAOD I get:

root [9] Events->Scan("patPackedCandidates_packedPFCandidates__PAT.obj.pt()", "patPackedCandidates_packedPFCandidates__PAT.obj.pt() > 100");
***********************************
*    Row   * Instance * patPacked *
***********************************
*        0 *     1614 *       inf *
***********************************
==> 1 selected entry

The "raw" packing is okay:

 Events->Scan("patPackedCandidates_packedPFCandidates__PAT.obj.packedPt_");
*        0 *     1614 *     31744 *

But the "pt()" function is throwing infinity.

rappoccio · 2019-07-29T16:26:51Z

@gpetruc Any ideas on how to fix this? I think this is the culprit line:

https://github.com/cms-sw/cmssw/blob/master/DataFormats/PatCandidates/src/PackedCandidate.cc#L14

packedPt_ = MiniFloatConverter::float32to16(p4_.load()->Pt());

slava77 · 2019-07-29T16:51:58Z

we could probably truncate this to some large number.
MiniFloatConverter::max() returns 65504.f. A truncation seems practical.

perrotta · 2019-09-13T12:49:53Z

@rappoccio : is there any progress with solving this issue?

rappoccio · 2019-09-13T13:03:40Z

I wasn't sure who was responsible, sorry. I can do it if there are no others ;)

rappoccio · 2019-09-13T13:20:05Z

Actually the truncation is already written, right?

https://cmssdt.cern.ch/lxr/source/DataFormats/Math/interface/libminifloat.h#0022

So we can change float32to16 to float32to16crop?

perrotta · 2019-09-13T13:44:26Z

Thank you Sal!

Actually the truncation is already written, right?

https://cmssdt.cern.ch/lxr/source/DataFormats/Math/interface/libminifloat.h#0022

So we can change float32to16 to float32to16crop?

One should truncate only if limits are hit.

What about something like:

  float unpackedPt = std::min(p4_.load()->Pt(),MiniFloatConverter::max());
  packedPt_ = MiniFloatConverter::float32to16(unpackedPt);

rappoccio · 2019-09-13T13:47:32Z

Sure, that also works.

In response to cms-sw#27374, truncating pt of PackedCandidate in case the 32 bit representation is beyond the 16-bit max (represented by the "minifloat" as inf, as per that standard requirement [here](ftp://ftp.fox-toolkit.org/pub/fasthalffloatconversion.pdf)). This was giving problems downstream when the 16-bit "infinity" was widened to 32 bits and became inaccurate.

rappoccio · 2019-09-13T13:58:40Z

PR #27988

slava77 · 2019-12-03T22:45:42Z

@perrotta
it looks like it's time to close this

perrotta · 2020-01-13T15:10:36Z

+1

Fixed by Backport of #27988 (Truncate pt of PackedCandidate) #28043

cmsbuild · 2020-01-13T15:10:58Z

This issue is fully signed and ready to be closed.

cmsbuild added pending-signatures reconstruction-pending labels Jun 27, 2019

This was referenced Jun 28, 2019

Fix BoostedDoubleSVProd crashing with nan/inf daughters #27238

Merged

[Backport] Fix BoostedDoubleSVProd crashing with nan/inf daughters #27328

Merged

[Backport] Fix BoostedDoubleSVProd crashing with nan/inf daughters #27329

Merged

rappoccio mentioned this issue Sep 13, 2019

Truncate pt of PackedCandidate #27988

Merged

rappoccio mentioned this issue Sep 20, 2019

Backport of #27988 (Truncate pt of PackedCandidate) #28043

Merged

perrotta closed this as completed Jan 13, 2020

cmsbuild added fully-signed reconstruction-approved and removed pending-signatures reconstruction-pending labels Jan 13, 2020

slava77 mentioned this issue Oct 20, 2020

Tracks with ill-defined momenta within pat::PackedCands #31583

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix jets with crazy momenta in miniAOD #27374

Fix jets with crazy momenta in miniAOD #27374

perrotta commented Jun 27, 2019

perrotta commented Jun 27, 2019

cmsbuild commented Jun 27, 2019

cmsbuild commented Jun 27, 2019

slava77 commented Jun 28, 2019

andrzejnovak commented Jun 28, 2019

perrotta commented Jun 28, 2019

slava77 commented Jul 4, 2019

rappoccio commented Jul 29, 2019

rappoccio commented Jul 29, 2019

slava77 commented Jul 29, 2019

rappoccio commented Jul 29, 2019 •

edited

Loading

rappoccio commented Jul 29, 2019

slava77 commented Jul 29, 2019

perrotta commented Sep 13, 2019

rappoccio commented Sep 13, 2019

rappoccio commented Sep 13, 2019

perrotta commented Sep 13, 2019

rappoccio commented Sep 13, 2019

rappoccio commented Sep 13, 2019

slava77 commented Dec 3, 2019

perrotta commented Jan 13, 2020

cmsbuild commented Jan 13, 2020

Fix jets with crazy momenta in miniAOD #27374

Fix jets with crazy momenta in miniAOD #27374

Comments

perrotta commented Jun 27, 2019

perrotta commented Jun 27, 2019

cmsbuild commented Jun 27, 2019

cmsbuild commented Jun 27, 2019

slava77 commented Jun 28, 2019

andrzejnovak commented Jun 28, 2019

perrotta commented Jun 28, 2019

slava77 commented Jul 4, 2019

rappoccio commented Jul 29, 2019

rappoccio commented Jul 29, 2019

slava77 commented Jul 29, 2019

rappoccio commented Jul 29, 2019 • edited Loading

rappoccio commented Jul 29, 2019

slava77 commented Jul 29, 2019

perrotta commented Sep 13, 2019

rappoccio commented Sep 13, 2019

rappoccio commented Sep 13, 2019

perrotta commented Sep 13, 2019

rappoccio commented Sep 13, 2019

rappoccio commented Sep 13, 2019

slava77 commented Dec 3, 2019

perrotta commented Jan 13, 2020

cmsbuild commented Jan 13, 2020

rappoccio commented Jul 29, 2019 •

edited

Loading