Ranking statistic for live singles #4689

GarethCabournDavies · 2024-04-08T16:01:51Z

This PR does a couple of things:

Allows the ranking statistic mechanism to be used for single-detector events in pycbc-live (as is done for coincs in [pycbc live] Allowing the use of template fits in the pycbc live ranking statistic #4527)
Allow the live single significance fits to use ranking statistic rather than sngl-ranking
Fixes rank_stat_single for PhaseTDStatistic (which didn't work, but was unused before this)

…an sngl-ranking

…te with case where no triggers are found

GarethCabournDavies

Minor explanation comments

bin/live/pycbc_live_combine_single_significance_fits

bin/live/pycbc_live_plot_combined_single_significance_fits

GarethCabournDavies · 2024-04-10T12:05:39Z

examples/live/make_singles_significance_fits.py

@@ -6,7 +6,7 @@
 import h5py
 import numpy as np

-f = h5py.File('single_trigger_fits.hdf','w')
+f = h5py.File('single_significance_fits.hdf','w')


Renaming to clarify the two types of single fits

pycbc/events/single.py

GarethCabournDavies · 2024-04-10T12:09:50Z

pycbc/events/stat.py

@@ -682,7 +682,7 @@ def rank_stat_single(self, single_info,
        numpy.ndarray
            The array of single detector statistics
        """
-        return single_info[1]
+        return single_info[1]['snglstat']


This allows the phasetd rank_stat_single to actually work!

GarethCabournDavies · 2024-04-11T11:01:48Z

BTW - I requested reviews from both Arthur and Tito as I am aware that Arthur has the expertise in adding the statistic objects to Live, and Tito understands more of the significance fitting, so I don't expect you both to be able to review all of it

examples/live/make_fit_coeffs.py

GarethCabournDavies · 2024-04-11T11:05:34Z

pycbc/events/single.py

@@ -158,43 +198,62 @@ def check(self, trigs, data_reader):
        # Apply cuts to trigs before clustering
        # Cut on snr so that triggers which could not reach newsnr
        # threshold do not have newsnr calculated
+        if 'psd_var_val' in trigs:


See here for the psdvar conversions

pycbc/events/single.py

pycbc/events/stat.py

ArthurTolley

Just a single question from me but happy to approve. Glad to see the fits being included in the live example too!

ArthurTolley · 2024-04-18T11:14:33Z

pycbc/events/stat.py

@@ -711,8 +711,8 @@ def coinc_lim_for_thresh(self, sngls_list, thresh, limifo,
        if not self.has_hist:
            self.get_hist()

-        lim_stat = [b['snglstat'] for a, b in sngls_list if a == limifo][0]
-        s1 = thresh ** 2. - lim_stat ** 2.
+        fixed_stat = [b['snglstat'] for a, b in sngls_list if a != limifo][0]


Is this a change in logic (going from == to !=) or was this not working properly before?

This wasn't working properly before - the statistic from the limifo was being used instead of the fixed IFO network

Looking again, I don't think this will actually work, so I will check it again

It was broken, just in a different way.

The basic problem was this - the sngls_list passed to coinc_lim_for_thresh did not contain the limifo singles (here). As a result, the list comprehension was empty, and the [0] at the end of the line broke.

The problem with my fix was that by using only the first entry, this works for 2-ifo, but not for 3-ifo (but doesn't actually error).

As the list of sngls passed to coinc_lim_for_thresh doesn't contain the limifo, we can remove this check, but it should be a sum of squares rather than grabbing the zeroth entry.

GarethCabournDavies · 2024-04-24T08:23:07Z

Note: new commits do not affect the part of the code which Arthur has reviewed

GarethCabournDavies · 2024-05-20T08:23:56Z

poke @titodalcanton to look at the supervision scripts here

bin/live/pycbc_live_combine_single_significance_fits

bin/live/pycbc_live_plot_single_significance_fits

bin/live/pycbc_live_single_significance_fits

pycbc/events/single.py

titodalcanton · 2024-06-06T07:51:03Z

pycbc/events/single.py

+                    (trig_chisq <
                     self.thresholds['reduced_chisq']) & \
-                    (trigs['snr'] >
+                    (trig_snr >


This seems to effectively change the meaning of --reduced-chisq-threshold and --newsnr-threshold in case the PSD var statistic is used, i.e. "reduced chisq" and "newsnr" will no longer mean what people historically think they mean. Should we rename the two options then?

I think I'd like to change these to use the template-cuts and trigger-cuts module eventually, but for now I have updated single-newsnr-threshold to be call single-ranking-threshold, and and added a note to the help on the chisq threshold

Co-authored-by: Tito Dal Canton <tito.dalcanton@ijclab.in2p3.fr>

…t I am

bin/live/pycbc_live_combine_single_significance_fits

Co-authored-by: Tito Dal Canton <tito.dalcanton@ijclab.in2p3.fr>

titodalcanton

I think we can merge and start proper tests on this now, provided the CI passes.

* Allow the live single trigger fits to use ranking statistic rather than sngl-ranking * inbin is no longer all the events above threshold, plotting to indicate with case where no triggers are found * deal better with cases where there are no triggers * Use ranking statistic for single-detector events * Fix some errors * fix some statistics so they can produce single-detector events * Some codeclimate suggestions * get fit coeff files into CI, set a maximum IFAR for singles * alter the CI example run * Codeclimate suggestions * Line too long * minor tweaks * Used shared code * Fix broken fixing * missed that this needs the module * typo * calculate plotmax earlier and use it to decide the histogram bins * Update bin/live/pycbc_live_plot_single_significance_fits Co-authored-by: Tito Dal Canton <tito.dalcanton@ijclab.in2p3.fr> * TDC comments * Update threshold naming and description * update argument in example * Please do not look at the previous commit and see how much of an idiot I am * Update bin/live/pycbc_live_combine_single_significance_fits Co-authored-by: Tito Dal Canton <tito.dalcanton@ijclab.in2p3.fr> --------- Co-authored-by: Tito Dal Canton <tito.dalcanton@ijclab.in2p3.fr>

GarethCabournDavies added the low latency label Apr 8, 2024

GarethCabournDavies requested review from titodalcanton and ArthurTolley April 8, 2024 16:01

GarethCabournDavies added 10 commits April 10, 2024 03:49

Allow the live single trigger fits to use ranking statistic rather th…

312b685

…an sngl-ranking

inbin is no longer all the events above threshold, plotting to indica…

1af93ff

…te with case where no triggers are found

deal better with cases where there are no triggers

cb8b685

Use ranking statistic for single-detector events

d1d39a3

Fix some errors

15c0c9e

fix some statistics so they can produce single-detector events

0984664

Some codeclimate suggestions

a12d99f

get fit coeff files into CI, set a maximum IFAR for singles

ef29697

alter the CI example run

2de35b6

Codeclimate suggestions

4d11273

GarethCabournDavies force-pushed the ranking_statistic_in_live_fits branch from c1dd2ee to 4d11273 Compare April 10, 2024 10:49

GarethCabournDavies added 3 commits April 10, 2024 03:59

Line too long

0b7c91a

minor tweaks

0e9e065

Used shared code

5500809

GarethCabournDavies commented Apr 10, 2024

View reviewed changes

GarethCabournDavies commented Apr 11, 2024

View reviewed changes

GarethCabournDavies mentioned this pull request Apr 11, 2024

Add mechanism for re-loading the statistic files GarethCabournDavies/pycbc#5

Closed

ArthurTolley approved these changes Apr 18, 2024

View reviewed changes

GarethCabournDavies added 4 commits April 18, 2024 04:51

Fix broken fixing

914b00c

missed that this needs the module

698b4e4

typo

1601071

calculate plotmax earlier and use it to decide the histogram bins

f2124c3