[HCAL] Sanitize Mahi HCAL local reconstruction pulse arrival time values #22394

jaylawhorn · 2018-02-28T18:41:53Z

The current version of Mahi can return a pulse arrival time that is NaN because there is no check for division by zero. Additionally, we limit the pulse arrival time to +/- 12.5 ns from the nominal time, because if the hit is put in the in-time sample (bx=0), it does belong in that sample. This removes large tails in the timing distribution induced by deviations from the default pulse shape template and/or the existence of pulses in bunch crossing we don't allow pulses to exist in the fit.

For example, some recHits from the default release:

root [5] HcalTree->Scan("ieta:iphi:depth:mahiE:mahiT")
************************************************************************
*    Row   *      ieta *      iphi *     depth *     mahiE *     mahiT *
************************************************************************
*        0 *        -1 *         1 *         1 *         0 *       inf *
*        1 *        -1 *         7 *         1 *         0 *     -9999 *
*        2 *        -1 *         8 *         1 * 0.0201498 * -2.192569 *
*        3 *        -1 *         9 *         1 * 0.5976437 * -0.054388 *
*        4 *        -1 *        14 *         1 * 0.2618228 * 0.0405726 *
*        5 *        -1 *        19 *         1 * 0.2631363 * -3.694539 *
*        6 *        -1 *        23 *         1 *         0 *     -9999 *
*        7 *        -1 *        24 *         1 *         0 *       inf *
*        8 *        -1 *        25 *         1 *         0 *       inf *
*        9 *        -1 *        28 *         1 *         0 *       inf *
*       10 *        -1 *        29 *         1 *         0 *     -9999 *

become

root [3] HcalTree->Scan("ieta:iphi:depth:mahiE:mahiT")
************************************************************************
*    Row   *      ieta *      iphi *     depth *     mahiE *     mahiT *
************************************************************************
*        0 *        -1 *         1 *         1 *         0 *     -9999 *
*        1 *        -1 *         7 *         1 *         0 *     -9999 *
*        2 *        -1 *         8 *         1 * 0.0201498 * -2.192569 *
*        3 *        -1 *         9 *         1 * 0.5976437 * -0.054388 *
*        4 *        -1 *        14 *         1 * 0.2618228 * 0.0405726 *
*        5 *        -1 *        19 *         1 * 0.2631363 * -3.694539 *
*        6 *        -1 *        23 *         1 *         0 *     -9999 *
*        7 *        -1 *        24 *         1 *         0 *     -9999 *
*        8 *        -1 *        25 *         1 *         0 *     -9999 *
*        9 *        -1 *        28 *         1 *         0 *     -9999 *
*       10 *        -1 *        29 *         1 *         0 *     -9999 *

where -9999 is the default value corresponding to zero in-time energy.

The following plot demonstrates the truncation of the ~1% tails in the pulse shape arrival time:

@mariadalfonso @deguio @jaehyeok

…ival time

cmsbuild · 2018-02-28T18:42:11Z

The code-checks are being triggered in jenkins.

cmsbuild · 2018-02-28T18:44:27Z

+code-checks

Logs: https://cmssdt.cern.ch/SDT/code-checks/cms-sw-PR-22394/3619

cmsbuild · 2018-02-28T18:44:44Z

A new Pull Request was created by @jaylawhorn (Jay Lawhorn) for master.

It involves the following packages:

RecoLocalCalo/HcalRecAlgos

@perrotta, @cmsbuild, @slava77 can you please review it and eventually sign? Thanks.
@mariadalfonso, @argiro this is something you requested to watch as well.
@davidlange6, @slava77, @fabiocos you are the release manager for this.

cms-bot commands are listed here

perrotta · 2018-02-28T22:03:14Z

please test

cmsbuild · 2018-02-28T22:05:00Z

The tests are being triggered in jenkins.
https://cmssdt.cern.ch/jenkins/job/ib-any-integration/26390/console Started: 2018/02/28 23:05

cmsbuild · 2018-02-28T23:48:34Z

+1
Tested at: 2df4e2a
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-22394/26390/summary.html

cmsbuild · 2018-02-28T23:48:41Z

Comparison job queued.

cmsbuild · 2018-03-01T01:34:03Z

Comparison is ready
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-22394/26390/summary.html

Comparison Summary:

No significant changes to the logs found
Reco comparison results: 96 differences found in the comparisons
DQMHistoTests: Total files compared: 29
DQMHistoTests: Total histograms compared: 2479021
DQMHistoTests: Total failures: 288
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 2478557
DQMHistoTests: Total skipped: 176
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.940000000177 KiB( 21 files compared)
Checked 118 log files, 9 edm output root files, 29 DQM output files

jaylawhorn · 2018-03-01T12:39:03Z

The differences are mostly as expected, impacting Hcal RecHit timing and higher level object timing that uses Hcal RecHits. I see one set of differences involving edmErrorSummaryEntries that aren't immediately obvious to me, here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/baseLineComparisons/CMSSW_10_1_X_2018-02-28-1100+22394/25315/validateJR/all_OldVSNew_TTbarPUwf25202p0/

for example:

I'm not sure if someone can confirm these behave as expected when we stop returning NaN for RecHit timing? Maybe @abdoulline @deguio @igv4321 ?

slava77 · 2018-03-01T14:05:34Z

On 3/1/18 4:39 AM, Jay Lawhorn wrote: The differences are mostly as expected, impacting Hcal RecHit timing and higher level object timing that uses Hcal RecHits. I see one set of differences involving edmErrorSummaryEntries that aren't immediately obvious to me, here: https://cmssdt.cern.ch/SDT/jenkins-artifacts/baseLineComparisons/CMSSW_10_1_X_2018-02-28-1100+22394/25315/validateJR/all_OldVSNew_TTbarPUwf25202p0/ for example: image <https://user-images.githubusercontent.com/6333978/36845020-52478994-1d55-11e8-9987-7d69094bd5f9.png>

this is very likely due to a difference in (randomized) PU test reads unrelated to this PR. [hopefully we can get rid of these false-positive differences at some point soon]

I'm not sure if someone can confirm these behave as expected when we stop returning NaN for RecHit timing? Maybe @abdoulline <https://github.com/abdoulline> @deguio <https://github.com/deguio> @igv4321 <https://github.com/igv4321> ?

It's obvious to me that NaNs should be removed. The decision to truncate at +/-12.5 is less obvious. Is the value still meaningful for out-of-time pulse fits? If so, perhaps some broader coverage should be preserved (+/-50 maybe).

jaylawhorn · 2018-03-01T14:20:14Z

@slava77 Thanks for the clarification!

So if the fit is putting the pulse in the in-time bunch crossing, however large the residuals, it seems misleading to me to return a "time" value that corresponds to an out-of-time pulse. (Also, the time value is less and less meaningful the farther away from the nominal it gets because it is based on the local derivative of the pulse at the nominal value.) If it was an out-of-time pulse, it would be assigned to an out-of-time bunch crossing, which we don't return.

On a longer time scale we would like to fix this pulse arrival time to be useful and reasonable without any hard boundaries, either by including the arrival time as an explicit parameter in the fit with a gaussian constraint, or by adding more pulse shapes to the fit (up to the # of bunch crossings) which would reduce the residuals problem. However, for now, we would prefer to not return information that could be mis-interpreted.

@jaehyeok sees anyways that the large time values come from low energy RecHits (https://indico.cern.ch/event/708228/contributions/2907551/attachments/1605858/2547938/20180223_Jae_HCAL_MAHI.pdf slide 5).

slava77 · 2018-03-01T14:51:25Z

On 3/1/18 6:20 AM, Jay Lawhorn wrote: @slava77 <https://github.com/slava77> Thanks for the clarification! So if the fit is putting the pulse in the in-time bunch crossing, however large the residuals, it seems misleading to me to return a "time" value that corresponds to an out-of-time pulse. (Also, the time value is less and less meaningful the farther away from the nominal it gets because it is based on the local derivative of the pulse at the nominal value.) If it was an out-of-time pulse, it would be assigned to an out-of-time bunch crossing, which we don't return.

OK

On a longer time scale we would like to fix this pulse arrival time to be useful and reasonable without any hard boundaries, either by including the arrival time as an explicit parameter in the fit with a gaussian constraint, or by adding more pulse shapes to the fit (up to the # of bunch crossings) which would reduce the residuals problem. However, for now, we would prefer to not return information that could be mis-interpreted.

Perhaps on a longer time scale we can even save OOT hits above some threshold in a separate collection.

…

@jaehyeok <https://github.com/jaehyeok> sees anyways that the large time values come from low energy RecHits (https://indico.cern.ch/event/708228/contributions/2907551/attachments/1605858/2547938/20180223_Jae_HCAL_MAHI.pdf slide 5).

slava77 · 2018-03-06T17:02:59Z

+1

for #22394 2df4e2a

implementation is in line with the description: avoid NaNs and truncate the reported time.
jenkins tests pass and comparisons with baseline show differences only in HCAL time-related variables.

E.g. 136.788

the truncation is evident

cmsbuild · 2018-03-06T17:03:17Z

This pull request is fully signed and it will be integrated in one of the next master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @davidlange6, @slava77, @smuzaffar, @fabiocos (and backports should be raised in the release meeting by the corresponding L2)

fabiocos · 2018-03-07T09:03:50Z

+1

jaylawhorn added 2 commits February 27, 2018 17:05

add check to prevent division by zero

2d2cdcf

limit Mahi pulse arrival time to 25 ns window centered on nominal arr…

2df4e2a

…ival time

cmsbuild added this to the CMSSW_10_1_X milestone Feb 28, 2018

cmsbuild added code-checks-pending comparison-pending orp-pending pending-signatures reconstruction-pending tests-pending labels Feb 28, 2018

cmsbuild added code-checks-approved and removed code-checks-pending labels Feb 28, 2018

cmsbuild added tests-started and removed tests-pending labels Feb 28, 2018

cmsbuild added tests-approved and removed tests-started labels Feb 28, 2018

cmsbuild added comparison-available and removed comparison-pending labels Mar 1, 2018

cmsbuild added fully-signed reconstruction-approved and removed pending-signatures reconstruction-pending labels Mar 6, 2018

cmsbuild added orp-approved and removed orp-pending labels Mar 7, 2018

cmsbuild merged commit 2e2579f into cms-sw:master Mar 7, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HCAL] Sanitize Mahi HCAL local reconstruction pulse arrival time values #22394

[HCAL] Sanitize Mahi HCAL local reconstruction pulse arrival time values #22394

jaylawhorn commented Feb 28, 2018

cmsbuild commented Feb 28, 2018

cmsbuild commented Feb 28, 2018

cmsbuild commented Feb 28, 2018

perrotta commented Feb 28, 2018

cmsbuild commented Feb 28, 2018 •

edited

cmsbuild commented Feb 28, 2018

cmsbuild commented Feb 28, 2018

cmsbuild commented Mar 1, 2018

jaylawhorn commented Mar 1, 2018

slava77 commented Mar 1, 2018 via email

jaylawhorn commented Mar 1, 2018

slava77 commented Mar 1, 2018 via email

slava77 commented Mar 6, 2018

cmsbuild commented Mar 6, 2018

fabiocos commented Mar 7, 2018

[HCAL] Sanitize Mahi HCAL local reconstruction pulse arrival time values #22394

[HCAL] Sanitize Mahi HCAL local reconstruction pulse arrival time values #22394

Conversation

jaylawhorn commented Feb 28, 2018

cmsbuild commented Feb 28, 2018

cmsbuild commented Feb 28, 2018

cmsbuild commented Feb 28, 2018

perrotta commented Feb 28, 2018

cmsbuild commented Feb 28, 2018 • edited

cmsbuild commented Feb 28, 2018

cmsbuild commented Feb 28, 2018

cmsbuild commented Mar 1, 2018

jaylawhorn commented Mar 1, 2018

slava77 commented Mar 1, 2018 via email

jaylawhorn commented Mar 1, 2018

slava77 commented Mar 1, 2018 via email

slava77 commented Mar 6, 2018

cmsbuild commented Mar 6, 2018

fabiocos commented Mar 7, 2018

cmsbuild commented Feb 28, 2018 •

edited