Add different options to compute stat_array on FluxPointsDatasets #5135

QRemy · 2024-02-29T16:24:19Z

Add different options to compute stat_array on FluxPointsDatasets :

chi2 : etimate from chi2 statistics.
profile : estimate from interpolation of the likelihood profile.
distrib : estimate from probability distributions,
assumes that flux points correspond to asymmetric gaussians and upper limits complemantary error functions.

Default is chi2, in that case upper limits are ignored and the mean of asymetrics error is used.
The distrib case provides an approximation if the profile is not available
which allows to take into accounts upper limits and asymetrics errors.

Updated /tutorials/analysis-1d/spectral_analysis.py to present the different cases:

Without ul the results are the same:

codecov · 2024-03-02T10:59:53Z

Codecov Report

Attention: Patch coverage is 97.36842% with 3 lines in your changes are missing coverage. Please review.

Project coverage is 94.46%. Comparing base (a048535) to head (a55f23c).
Report is 79 commits behind head on main.

Files	Patch %	Lines
gammapy/datasets/flux_points.py	97.46%	2 Missing ⚠️
gammapy/estimators/points/tests/test_lightcurve.py	95.45%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #5135      +/-   ##
==========================================
+ Coverage   94.25%   94.46%   +0.20%     
==========================================
  Files         234      235       +1     
  Lines       35226    35493     +267     
==========================================
+ Hits        33204    33528     +324     
+ Misses       2022     1965      -57

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

AtreyeeS

Thanks @QRemy ! The profile is an elegant solution for considering upper limits during the fit!

I don't understand the implementation of the distrib option. Maybe you can add some information in gammapy.stat docs page?

Otherwise, only minor comments inline

examples/tutorials/analysis-1d/spectral_analysis.py

gammapy/datasets/flux_points.py

QRemy · 2024-03-06T16:11:44Z

Thanks @AtreyeeS I implemented most of the comments.

I don't understand the implementation of the distrib option. Maybe you can add some information in gammapy.stat docs page?

Assuming gaussian errors the likelihood is given by the probability density function of the normal distribution.
For the upper limit case it is necessary to marginalize over the unknown measurement so we integrate the normal distribution up to the upper limit value which gives the complementary error function.

see eq. C7 from https://iopscience.iop.org/article/10.1088/0004-637X/773/2/168/pdf
(in eq C6 they are doing something more complicated as they also marginalised over the unknown true value)

adonath · 2024-03-06T16:30:06Z

Thanks @QRemy, the statistical approach to handle the ULs looks reasonable to me. I was wonderign whether we can maybe find a more descriptive name. I think even "chi2-marginalize-ul" would be better. We should definitely give the paper reference in addition.

AtreyeeS

Thank you @QRemy ! Elegant indeed.
Maybe you can add a little info the documentation (what you wrote in the comment, and the paper reference).
And add stat_type option in the example here

gammapy/examples/tutorials/analysis-time/light_curve_simulation.py

Line 295 in fdd05df

dataset_fp.models = model

AtreyeeS

The distrib option is often returning a nan for some bins, leading to the total stat becoming nan and a failure of the minimisation. Any idea why?

gammapy/datasets/flux_points.py

gammapy/datasets/tests/test_flux_points.py

gammapy/datasets/flux_points.py

registerrier

Thanks @QRemy ! I have left some comments inline.

My main comment is that you could compute most of the necessary arrays in advance. I suppose that for large SEDs this might be much more efficient.

examples/tutorials/analysis-1d/spectral_analysis.py

gammapy/datasets/flux_points.py

examples/tutorials/analysis-1d/spectral_analysis.py

bkhelifi

Thanks @QRemy.. I have some questions for you.

gammapy/utils/interpolation.py

gammapy/datasets/tests/test_flux_points.py

gammapy/estimators/points/tests/test_lightcurve.py

bkhelifi

Thanks @QRemy . All good for me;)

registerrier

Thanks @QRemy . This looks good to me.

I have left a simple comment regarding not providing the dictionary of stat functions but simply their keys.

Otherwise, this would be much cleaner with a stat functions being defined as external objects. This can be a further PR though.

gammapy/datasets/flux_points.py

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

Co-authored-by: Atreyee Sinha <asinha@ucm.es>

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

Co-authored-by: Régis Terrier <regis.terrier@m4x.org>

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

Co-authored-by: Bruno Khélifi <khelifi@in2p3.fr>

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

QRemy added the feature label Feb 29, 2024

QRemy requested review from AtreyeeS and registerrier March 2, 2024 10:58

QRemy assigned registerrier Mar 2, 2024

QRemy added this to the 1.3 milestone Mar 2, 2024

QRemy marked this pull request as ready for review March 2, 2024 10:59

AtreyeeS requested changes Mar 6, 2024

View reviewed changes

QRemy commented Mar 6, 2024

View reviewed changes

gammapy/datasets/flux_points.py Outdated Show resolved Hide resolved

QRemy commented Mar 6, 2024

View reviewed changes

gammapy/datasets/flux_points.py Outdated Show resolved Hide resolved

AtreyeeS previously approved these changes Mar 6, 2024

View reviewed changes

QRemy dismissed AtreyeeS’s stale review via d7f4e9d March 7, 2024 12:29

AtreyeeS requested changes Mar 8, 2024

View reviewed changes

gammapy/datasets/flux_points.py Show resolved Hide resolved

gammapy/datasets/flux_points.py Show resolved Hide resolved

gammapy/datasets/tests/test_flux_points.py Show resolved Hide resolved

gammapy/datasets/flux_points.py Outdated Show resolved Hide resolved

registerrier reviewed Mar 8, 2024

View reviewed changes

QRemy commented Mar 8, 2024

View reviewed changes

gammapy/datasets/flux_points.py Outdated Show resolved Hide resolved

QRemy force-pushed the fp_from_sampling branch from a5dd99e to 54c5550 Compare May 3, 2024 13:07

bkhelifi reviewed May 3, 2024

View reviewed changes

examples/tutorials/analysis-1d/spectral_analysis.py Outdated Show resolved Hide resolved

bkhelifi reviewed May 3, 2024

View reviewed changes

examples/tutorials/analysis-1d/spectral_analysis.py Outdated Show resolved Hide resolved

QRemy force-pushed the fp_from_sampling branch from d210c98 to a2e5e9d Compare May 3, 2024 15:38

bkhelifi reviewed May 3, 2024

View reviewed changes

gammapy/utils/interpolation.py Show resolved Hide resolved

gammapy/datasets/tests/test_flux_points.py Show resolved Hide resolved

gammapy/estimators/points/tests/test_lightcurve.py Show resolved Hide resolved

bkhelifi previously approved these changes May 6, 2024

View reviewed changes

registerrier previously approved these changes May 16, 2024

View reviewed changes

QRemy commented May 17, 2024

View reviewed changes

gammapy/datasets/flux_points.py Outdated Show resolved Hide resolved

QRemy dismissed stale reviews from registerrier and bkhelifi via 8fe9907 May 17, 2024 09:06

QRemy force-pushed the fp_from_sampling branch from 9badffb to 8fe9907 Compare May 17, 2024 09:06

stat using sampled flux points

bd24ceb

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

QRemy and others added 23 commits May 17, 2024 11:07

fix

cd33f19

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

random_state for choice

4c2bc9f

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

extrapolate option on interpolate profile

44142d2

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

add stat_type=profile case

8d6d933

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

draft example in tutorial

542f3b8

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

restructure notebook

40307cb

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

simplify

4e56cd7

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

Apply suggestions from code review

c611841

Co-authored-by: Atreyee Sinha <asinha@ucm.es>

Update gammapy/datasets/flux_points.py

1558aa0

implement review suggestions and fix

e25d155

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

add more tests

b7f2345

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

mask_valid as property

12e82bc

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

plot limits

b5f6091

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

improve doc

8f8baf1

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

Apply suggestions from code review

219a88a

Co-authored-by: Régis Terrier <regis.terrier@m4x.org>

Update gammapy/datasets/flux_points.py

73422f4

fix shape

4718664

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

avoid loop in _stat_array_distrib

0cb4373

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

fix mask shape

dbca32a

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

fix temporal case if reference model has no temporal model

bb9eaba

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

Apply suggestions from code review

f17b0ca

Co-authored-by: Bruno Khélifi <khelifi@in2p3.fr>

docstring

ab0a68d

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

implement suggestion

7a2368d

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

QRemy force-pushed the fp_from_sampling branch from 8fe9907 to 7a2368d Compare May 17, 2024 09:07

fix

a55f23c

Signed-off-by: Quentin Remy <quentin.remy@mpi-hd.mpg.de>

QRemy merged commit 813ec3e into gammapy:main May 17, 2024
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add different options to compute stat_array on FluxPointsDatasets #5135

Add different options to compute stat_array on FluxPointsDatasets #5135

QRemy commented Feb 29, 2024 •

edited

Loading

codecov bot commented Mar 2, 2024 •

edited

Loading

AtreyeeS left a comment

QRemy commented Mar 6, 2024

adonath commented Mar 6, 2024

AtreyeeS left a comment

AtreyeeS left a comment

registerrier left a comment

bkhelifi left a comment

bkhelifi left a comment

registerrier left a comment

Add different options to compute stat_array on FluxPointsDatasets #5135

Add different options to compute stat_array on FluxPointsDatasets #5135

Conversation

QRemy commented Feb 29, 2024 • edited Loading

codecov bot commented Mar 2, 2024 • edited Loading

Codecov Report

AtreyeeS left a comment

Choose a reason for hiding this comment

QRemy commented Mar 6, 2024

adonath commented Mar 6, 2024

AtreyeeS left a comment

Choose a reason for hiding this comment

AtreyeeS left a comment

Choose a reason for hiding this comment

registerrier left a comment

Choose a reason for hiding this comment

bkhelifi left a comment

Choose a reason for hiding this comment

bkhelifi left a comment

Choose a reason for hiding this comment

registerrier left a comment

Choose a reason for hiding this comment

QRemy commented Feb 29, 2024 •

edited

Loading

codecov bot commented Mar 2, 2024 •

edited

Loading