Add multiplicity cut #15

HealthyPear · 2020-07-30T15:05:44Z

Description:

This cut should be applied on DL2 data and it should target the number of telescopes used to reconstruct successfully an image.
Both CTA-MARS and EventDisplay do this, but in a different way (see this document), so this should be a function which, depending on an argument read from the configuration file, applies the given method.

Notes:

Since we used protopipe.perf to create the new DL3 generator pyirf, it is best for this feature to be implemented directly there.

The text was updated successfully, but these errors were encountered:

HealthyPear · 2020-09-18T13:15:32Z

IMPORTANT

It seems that EventDisplay DL2 data (at least a gamma_onSource.S.3HB9-FD_ID0.eff-0 file) has the multiplicity cut already applied (i.e. no event with multiplicity less than 4 is present).

This means that we cannot effectively test this cut in pyirf with these version of the files, and we would need them produced with all multiplicities.

I refer here @GernotMaier and @TarekHC for reference.

TarekHC · 2020-09-18T13:26:06Z

@HealthyPear: Quick and very naive question: why do you need multiplicities less than 4 to test pyirf? If those multiplicities were not used to produce the public performance IRFs, then why do you need them?

In any case I guess we could eventually add them. @GernotMaier I understand 2-telescope events may be more problematic, but why 3-telescope events are not present in the files?

HealthyPear · 2020-09-18T13:29:26Z

@HealthyPear: Quick and very naive question: why do you need multiplicities less than 4 to test pyirf? If those multiplicities were not used to produce the public performance IRFs, then why do you need them?

This would be a good point if we were only trying to reproduce EventDisplay current IRFs, but meanwhile, the pipelines algorithms, arrays, etc are evolving, so in general, we need uncut DL2 data where we can optimize ourselves quantities like the telescope multiplicity (which again - in general) depend on the science case or custom needs of an observational proposal.

So, to better answer your question: for the near future steps (#4 ) we don't need it but later we should be as general as possible

vuillaut · 2020-09-18T13:59:23Z

If there are flat cuts applied a priori, it should be sufficient to know them and apply the same for the comparison.

HealthyPear · 2020-09-18T14:02:56Z

If there are flat cuts applied a priori, it should be sufficient to know them and apply the same for the comparison.

This is the problem (at least as I see it): we cannot apply it, because it has been already applied before entering pyirf.
Thus, we cannot check that (when we apply it on other uncut data) we do it correctly.

vuillaut · 2020-09-18T14:16:48Z

Sorry, I'll try to be more clear.
There are two things to achieve:

get similar (same) IRFs with pyIRF than with EventDisplay starting from the same event list.
For that, you don't care (you don't even need to know) that some pre-selection cuts have been applied, because for sure that have been applied in both pipelines. That will validate pyIRF.
get similar IRFs with (pyIRF + ctapipe) than with EventDisplay starting from the same RAW data. That would validate the whole pipeline. In this case, indeed we need to know if there were pre-selection cuts. But we are not there yet imo.

Or I am missing something?

HealthyPear · 2020-09-18T14:20:19Z

Sorry, I'll try to be more clear.
There are two things to achieve:

1. get similar (same) IRFs with pyIRF than with EventDisplay starting from the same event list.
   For that, you don't care (you don't even need to know) that some pre-selection cuts have been applied, because for sure that have been applied in both pipelines. That will validate pyIRF.

Yes

2. get similar IRFs with (pyIRF + ctapipe) than with EventDisplay starting from the same RAW data. That would validate the whole pipeline. In this case, indeed we need to know if there were pre-selection cuts. But we are not there yet imo.

Yes, but since we are about to refactor/rewrite it is better to remember that we need to check (or even require) for any pre-selection cuts

HealthyPear · 2020-09-18T14:22:07Z

Sorry, it is likely that we are saying the same thing - these specific files are sufficient, I just wanted to report this since this issue is about adding this cut which doesn't exist yet in the code.

vuillaut · 2020-09-18T14:25:17Z

Yes, but since we are about to refactor/rewrite it is better to remember that we need to check (or even require) for any pre-selection cuts

Ok but this is not an issue for pyIRF imo, but rather for the pipeline getting to DL2.
There are other pre-selection cuts (e.g. cleaning) that are applied. If you want a pipeline to pipeline comparison, then you need to know and apply the exact same pre-selection cuts in both of them.

TarekHC · 2020-09-20T09:06:12Z

My comment was coming mainly from the bold, capitalized "IMPORTANT". I just wanted to understand why it is that important.

To test pyIRF to cross-check eventDisplay IRFs (which would be very relevant!), which is what I originally thought the objective of producing these tables, I don't think adding those multiplicities is that important.

For other uses it may be, but I just wanted to understand it... So no pressure. Let's see what Gernot thinks.

maxnoe · 2020-09-20T09:10:12Z

I also don't think this is relevant. If it the problem is testing if a multiplicity cut in pyiRF works, we can just test a harder cut than what is present in the current files and all hell would have to break loose if the code breaks for multiplicity > 2 but not for multiplicity > 4.

GernotMaier · 2020-09-22T06:15:00Z

Being late here: I apply fix multiplicity cuts to the analysis. This is non-ideal, as obviously the multiplicity should be one of the optimisation parameter - but I simply cannot do this at this point without putting a bit of work into some code changes.

But this means also that all training of g/h separators and cut optimisation steps are done after the application of the multiplicity cuts. I will prepare files with the other multiplicities, but performance estimates are different (as Tarek said: for smaller multiplicities, the precision is lower)

HealthyPear · 2020-12-07T18:24:40Z

Since this cut is as simple as doing e.g.

p["events"] = p["events"][p["events"]["multiplicity"] >= 4

I will close this issue

HealthyPear transferred this issue from cta-observatory/protopipe Sep 9, 2020

HealthyPear added the enhancement New feature or request label Sep 9, 2020

HealthyPear mentioned this issue Sep 9, 2020

Missing features #16

Closed

6 tasks

HealthyPear added this to To do in Next release via automation Sep 9, 2020

HealthyPear added this to the 0.1.0 milestone Sep 9, 2020

HealthyPear closed this as completed Dec 7, 2020

Next release automation moved this from To do to Done Dec 7, 2020

HealthyPear mentioned this issue Dec 7, 2020

Comparison with EventDisplay : Modeling cta-observatory/protopipe#89

Open

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multiplicity cut #15

Add multiplicity cut #15

HealthyPear commented Jul 30, 2020

HealthyPear commented Sep 18, 2020

TarekHC commented Sep 18, 2020

HealthyPear commented Sep 18, 2020 •

edited

Loading

vuillaut commented Sep 18, 2020

HealthyPear commented Sep 18, 2020

vuillaut commented Sep 18, 2020

HealthyPear commented Sep 18, 2020

HealthyPear commented Sep 18, 2020

vuillaut commented Sep 18, 2020

TarekHC commented Sep 20, 2020

maxnoe commented Sep 20, 2020

GernotMaier commented Sep 22, 2020

HealthyPear commented Dec 7, 2020

Add multiplicity cut #15

Add multiplicity cut #15

Comments

HealthyPear commented Jul 30, 2020

HealthyPear commented Sep 18, 2020

TarekHC commented Sep 18, 2020

HealthyPear commented Sep 18, 2020 • edited Loading

vuillaut commented Sep 18, 2020

HealthyPear commented Sep 18, 2020

vuillaut commented Sep 18, 2020

HealthyPear commented Sep 18, 2020

HealthyPear commented Sep 18, 2020

vuillaut commented Sep 18, 2020

TarekHC commented Sep 20, 2020

maxnoe commented Sep 20, 2020

GernotMaier commented Sep 22, 2020

HealthyPear commented Dec 7, 2020

HealthyPear commented Sep 18, 2020 •

edited

Loading