Fit class: what to do about changing the stats class? #2063

DougBurke · 2024-06-17T16:30:35Z

This has come out of the discussion at #2054 (comment)

It could be considered a bug, a design decision, a new enhancement, or ...

The issue is that when directly using the Fit class you can change the statistic in use, but an internally-controlled object doesn't know about this change:

>>> from sherpa.data import Data1D
>>> from sherpa.models.basic import Scale1D
>>> from sherpa.stats import Cash
>>> d = Data1D('', [1, 2, 3], [3, 7, 2])
>>> m = Scale1D()
>>> from sherpa.fit import Fit
>>> f = Fit(d, m)
>>> f.fit()
<Fit results instance>
>>> f.stat.name, f._iterfit.stat.name
('chi2gehrels', 'chi2gehrels')

You don't need to have called the fit method to show this. The _iterfit attribute is a "hidden" part of Fit.

Now, I can change the stat attribute

>>> f.stat = Cash()
>>> f.stat.name, f._iterfit.stat.name
('cash', 'chi2gehrels')

However, note that _iterfit has not been updated. I believe that it should be updated too (ideally we'd be able to source the information from the same location but the current design makes that hard, as would - I think - making _iterfit.stat a read-only attribute to make sure users can't say f.stat = CStat(); f._iterstat.stat = Chi2DataVar();).

However, I am not 100% sure about this (I note that we only have two tests which would be affected by this behaviour).

I do not believe this is a problem for the UI layer since in the UI layer when we need a Fit object we create one on the fly, via the internal _get_fit method, so we never get into this situation where the stat field can get changed.

The text was updated successfully, but these errors were encountered:

hamogu · 2024-06-17T16:50:00Z

I think it's a design problem and should be fixed, if possible with low effort.
We need find way for _iterfit to retain a reference to the "parent" Fit object so that we can just call whatever the statistic in that object is.

The issue at hand is that if you *change* the stat method of the Fit structure then you will get a mis-match with the actual fitted statistic, which remains using the _iterfit.stat method (i.e. the original statistic). We only have two test cases that expose this difference, so take advantage of the new "write the fit results to StingIO" capability and show that the stat value has changed (i.e. all but the last row use the original statistic, in this case Chi2DataVar, so the stat value is ~ 8500, and only the last row uses the Cash or CStat value). This is intended as a regression test.

We can make sure that changing the Fit stat also changes the associated _iterfit.stat field, which seems like it should be the correct thing to do. There are only two tests where this is an issue, and it's unclear whether the changes are good, bad, or not significant, since a) the tests appear to be regression tests b) we don't know how the test data was created, so we don't know what the true values are Note that the parameter values do not appear to have changed "hugely", but I haven't looked into the results too deeply. The tests do show the change - in that the statistic value reported in the "fit(outfile=...)" option now remains consistent (i.e. matches the expected value, assuming our new interpretation of the stat field of the Fit object is correct [which it should be, I claim]).

We can just call the callback routne after updating the model values, rather than manually setting the thawed pars, calculating the stat, and writing out the values to a file. This is ony possible now that sherpa#2063 has been addressed.

DougBurke · 2024-06-17T20:23:38Z

I ended up with a "simple" fix - when you change the stat attribute of Fit then you also change the _iterfit.stat method. It doesn't catch all cases but is a minimal fix. Ideally we'd only create the _iterfit object when needed (ie make it throwaway) but there's design issues that make this hard and I don't really want to spend too much time here as this is nerd-sniping myself to the nth degree...

I made the changes to #2054

The issue at hand is that if you *change* the stat method of the Fit structure then you will get a mis-match with the actual fitted statistic, which remains using the _iterfit.stat method (i.e. the original statistic). We only have two test cases that expose this difference, so take advantage of the new "write the fit results to StingIO" capability and show that the stat value has changed (i.e. all but the last row use the original statistic, in this case Chi2DataVar, so the stat value is ~ 8500, and only the last row uses the Cash or CStat value). This is intended as a regression test.

We can make sure that changing the Fit stat also changes the associated _iterfit.stat field, which seems like it should be the correct thing to do. There are only two tests where this is an issue, and it's unclear whether the changes are good, bad, or not significant, since a) the tests appear to be regression tests b) we don't know how the test data was created, so we don't know what the true values are Note that the parameter values do not appear to have changed "hugely", but I haven't looked into the results too deeply. The tests do show the change - in that the statistic value reported in the "fit(outfile=...)" option now remains consistent (i.e. matches the expected value, assuming our new interpretation of the stat field of the Fit object is correct [which it should be, I claim]).

We can just call the callback routne after updating the model values, rather than manually setting the thawed pars, calculating the stat, and writing out the values to a file. This is ony possible now that sherpa#2063 has been addressed.

DougBurke added type:other area:code labels Jun 17, 2024

DougBurke mentioned this issue Jun 17, 2024

Allow a file handle or Path object to be sent to the outfile parameter of fit #2054

Merged

DougBurke linked a pull request Jun 17, 2024 that will close this issue

Allow a file handle or Path object to be sent to the outfile parameter of fit #2054

Merged

wmclaugh closed this as completed in #2054 Jun 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fit class: what to do about changing the stats class? #2063

Fit class: what to do about changing the stats class? #2063

DougBurke commented Jun 17, 2024

hamogu commented Jun 17, 2024

DougBurke commented Jun 17, 2024

Fit class: what to do about changing the stats class? #2063

Fit class: what to do about changing the stats class? #2063

Comments

DougBurke commented Jun 17, 2024

hamogu commented Jun 17, 2024

DougBurke commented Jun 17, 2024