Fixing peaklet baseline bias #486

WenzDaniel · 2021-07-14T05:10:15Z

What is the problem / what does the code in this PR do
Our current treatment of the floating point part of our baseline leads to an area (and summed waveform) bias of several 10ish % in peaklets. The main reason for the positive bias is coming form the fact, that we are also adding the floating point part of the baseline in zero-padded regions. A more detailed note can be found here.

Can you briefly describe how it works?
The solution is rather simple: We make our sum_waveform hit dependent and only build peaklets over the regions in which we found hits before.

In terms of performance is the new algorithm ~ x1.5 slower compared to the old one.

This reverts commit 81fdc11.

JelleAalbers · 2021-07-25T07:37:40Z

Hi Daniel, Great work spotting this! I'm amazed such a large area bias could have slipped our validation tests on 1T data. CES peak positions below saturation territory agreed between pax and strax (e.g. xenon:xenonnt:analysis:strax_clustering_classification#s1_s2_events), but maybe that's driven mostly by the S2s? The 1PE peak also seemed about right in #261, at least if the gain calibration includes underamplified / sub-threshold hits.

With the 'zero-padded part' of the waveform, you mean the parts zeroed out by cut_outside_hits, right? There is also a part of the record data field beyond record['length'] (for the final record_i in a pulse), but that shouldn't do anything; no routine should even look at it.

Still, sorry.. I should have realized we can't zero an integer-valued waveform if the true zero level is a floating value. Ultimately I guess this is payback for not copying the record data into the hit datatype, as we did in pax, or as in your hitlets..

If the hit-dependent sum waveform somehow doesn't work out, an alternative might be to let cut_outside_hits assign not zero, but a special value way outside the dynamic range of the digitizer (e.g. define TRUE_ZERO = -32768), and have sum_waveform ignore samples with that value. To avoid reprocessing records, you could run the modified cut_outside_hits again after hitfinding in peaklets (it should be idempotent).

But that's just a random idea. I think your solution is to essentially merge cut_outside_hits with sum_waveform; that should certainly work too. It would make cut_outside_hits in PulseProcessing really only a disk space saver.

strax/processing/peak_building.py

This reverts commit 10ca471.

…trax into fix_peaklet_bias2

WenzDaniel · 2021-08-11T15:16:21Z

Okay I added Tianyu's comments. I run also all tests in straxen with XENONnT/straxen#601 and all tests pass. We should not merge this PR without merging XENONnT/straxen#601. Waiting for Jorans review now.

JoranAngevaare

Thanks Daniel, the changes look good!

Just a few minor comments from my side

strax/processing/peak_building.py

…trax into fix_peaklet_bias2

WenzDaniel and others added 13 commits July 13, 2021 10:59

Fixing peaklet baseline bias

b0ce5d9

Fix multi-record_i problem

81fdc11

Revert "Fix multi-record_i problem"

b5a937c

This reverts commit 81fdc11.

Fix record_i in multi-peaks

3a1cfac

Fixed bug of wrong area compared to wf

1dd3de4

Revert order for record_i and time check

8951217

Add found next start in case peak ends

2a76bdc

Changed beyond peak case for peak splitting within hit

9a8d53d

Merge branch 'master' into fix_peaklet_bias2

45f65de

Modified splitting test accordingly

db346aa

Extended tests according to change.

89a31c1

Rename todo to please codefactor...

7225799

Fix empty inputs

e08662e

WenzDaniel marked this pull request as ready for review July 22, 2021 16:02

WenzDaniel mentioned this pull request Jul 22, 2021

Fix peaklet area bias XENONnT/straxen#601

Merged

WenzDaniel marked this pull request as draft July 23, 2021 05:41

WenzDaniel closed this Jul 23, 2021

WenzDaniel and others added 11 commits July 23, 2021 08:05

Allow integration bounds beyond record

85df6cd

Make find_hit_integration_bounds non private.

e4c50e9

Unify return

3ab80c3

Remove return as things are modified inplace

6361656

Add test for hit integration bounds

b1904c7

Refactored summed waveform to include hit integration bounds.

29ce3e1

Updated splitting accordingly

feb88cb

forgot left_hit_i

5fd04f2

Minor fixes

130e896

Command in le/re bounds outside record

7ddab03

Fix peak area estimate

bfc08b4

WenzDaniel reopened this Jul 24, 2021

Merge branch 'master' into fix_peaklet_bias2

c64157a

WenzDaniel requested review from JoranAngevaare and zhut19 August 5, 2021 12:52

Merge branch 'master' into fix_peaklet_bias2

366304b

zhut19 reviewed Aug 10, 2021

View reviewed changes

strax/processing/peak_building.py Show resolved Hide resolved

zhut19 reviewed Aug 10, 2021

View reviewed changes

strax/processing/peak_building.py Outdated Show resolved Hide resolved

zhut19 reviewed Aug 10, 2021

View reviewed changes

strax/processing/peak_building.py Outdated Show resolved Hide resolved

WenzDaniel and others added 9 commits August 11, 2021 16:08

Remove print statements

8979d8f

Change hit_waveform to buffer

7a10f02

Updated doc string

10ca471

Small fix

b7ef9e7

Revert "Updated doc string"

2e0189e

This reverts commit 10ca471.

Add docs again

a3ad937

Merge branch 'master' into fix_peaklet_bias2

f4a4524

Updated doc string of peak splitting

537f9c8

Merge branch 'fix_peaklet_bias2' of https://github.com/AxFoundation/s…

1fc700f

…trax into fix_peaklet_bias2

zhut19 approved these changes Aug 13, 2021

View reviewed changes

Merge branch 'master' into fix_peaklet_bias2

992b5e6

JoranAngevaare approved these changes Aug 20, 2021

View reviewed changes

JoranAngevaare reviewed Aug 20, 2021

View reviewed changes

strax/processing/peak_building.py Show resolved Hide resolved

WenzDaniel and others added 6 commits August 20, 2021 10:16

Update doc-string

f86e8db

Merge branch 'fix_peaklet_bias2' of https://github.com/AxFoundation/s…

b00a385

…trax into fix_peaklet_bias2

Refactored function changed doc string removed todo

1131914

Merge branch 'master' into fix_peaklet_bias2

ed1958f

Remove comment

0eb1a04

Merge branch 'fix_peaklet_bias2' of https://github.com/AxFoundation/s…

c114788

…trax into fix_peaklet_bias2

JoranAngevaare merged commit d77b241 into master Aug 25, 2021

JoranAngevaare deleted the fix_peaklet_bias2 branch August 25, 2021 06:31

DCichon mentioned this pull request Oct 30, 2021

Add capability for building summed waveform over channel subset #565

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing peaklet baseline bias #486

Fixing peaklet baseline bias #486

WenzDaniel commented Jul 14, 2021 •

edited

JelleAalbers commented Jul 25, 2021

WenzDaniel commented Aug 11, 2021

JoranAngevaare left a comment

Fixing peaklet baseline bias #486

Fixing peaklet baseline bias #486

Conversation

WenzDaniel commented Jul 14, 2021 • edited

JelleAalbers commented Jul 25, 2021

WenzDaniel commented Aug 11, 2021

JoranAngevaare left a comment

Choose a reason for hiding this comment

WenzDaniel commented Jul 14, 2021 •

edited