Emulated termination criteria #363

yallup · 2024-02-29T11:56:40Z

Description

Aim is to introduce a termination criteria, such that given a set of chains read in to anesthetic we can determine if the nested sampling runs satisfactorily terminated.

This is achieved by introducing a boolean is_terminated function, which calls the more informative critical_ratio. The aim of this is to return the ratio of something in the live points to the same something contained in the dead points. For first pass this is either the ratio of evidence or the ratio of KL divergences for these two point populations. The former then emulates the PC and MN termination criteria, so the default call of is_terminated is the exact polychord setup.

Checklist:

I have performed a self-review of my own code
My code is PEP8 compliant (flake8 anesthetic tests)
My code contains compliant docstrings (pydocstyle --convention=numpy anesthetic)
New and existing unit tests pass locally with my changes (python -m pytest)
I have added tests that prove my fix is effective or that my feature works
I have appropriately incremented the semantic version number in both README.rst and anesthetic/_version.py

yallup · 2024-02-29T12:01:12Z

I would suggest black as a pre-commit hook, apologies I tangled some opinionated code formatting up in this, the code style not being enforced as a pre-commit seems vulnerable to such things!

codecov · 2024-02-29T12:03:41Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (80769c5) to head (5ef5cd9).

❗ Current head 5ef5cd9 differs from pull request most recent head 27f5ddb. Consider uploading reports for the commit 27f5ddb to get more accurate results

Additional details and impacted files

@@            Coverage Diff            @@
##            master      #363   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           36        34    -2     
  Lines         3032      2990   -42     
=========================================
- Hits          3032      2990   -42

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

AdamOrmondroyd · 2024-03-01T23:52:53Z

I would suggest black as a pre-commit hook, apologies I tangled some opinionated code formatting up in this, the code style not being enforced as a pre-commit seems vulnerable to such things!

There is an (optional) hook to check the formatting, but there isn't a formatter. Personally I've grown to like having to do the anesthetic formatting by hand as it tends to look nicer and sometimes it forces me to think again about an ugly bit of code

williamjameshandley

In the first instance, you need to revert the black adjustments for now -- the wholesale filechange makes it hard to review. Could you make an issue to discuss code formatters (e.g. switching to black).

…termination

williamjameshandley

Excellent. Thanks for taking charge of this @yallup -- I've wanted something like this in anesthetic for some time.

Apologies in advance -- this is going to be quite a picky review. I'd like to make sure that this will generalise to other stopping criteria, and I think the most straightforward way to get there is through a few adjacent iterations.

To make this obvious, we should implement one more stopping criteria. The simplest of these is one suggested to me by John Skilling last year, which is the same as the evidence-based one, but instead operating on the DKL (This is what John Skilling's in-house nested sampling algorithms tend to use). This can be thought of as measuring when the D_KL 'levels off'. John claims anecdotally that this is a more robust criterion.

With this in mind, I suggest a switch of 'Z' and 'D_KL' as strings to pick between criteria for now, with kwarg being conditionally popped depending on that first switch.

Other stopping criteria we could implement in future are 'logX', 'logLmax', 'ndead'.

anesthetic/samples.py

williamjameshandley · 2024-03-05T09:52:27Z

anesthetic/samples.py

+        """
+        logL = self.contour(logL)
+        i_live = ((self.logL >= logL) & (self.logL_birth < logL)).to_numpy()
+        i_dead = ((self.logL < logL)).to_numpy()


In hindsight, do we need the dead points?
self.logX()[i_live[0]] + logZ_live vs self.logZ() would probably also do the trick.

Not sure I see this, if I am taking the ratio, self.logZ() is the union whereas I want the self.iloc[dead].logZ()? So I think I need dead idx? I will implement all the other suggestions and resend for comments

anesthetic/samples.py

lukashergt

Introduces a PC termination criteria to check on live points

Would be nice if we could make the description a bit more detailed. Currently this leaves me at a bit of a loss what this is exactly about.

Does this addition deserve an entry in our docs?
https://anesthetic.readthedocs.io/en/latest/

…termination

yallup · 2024-03-07T14:17:24Z

Introduces a PC termination criteria to check on live points

Would be nice if we could make the description a bit more detailed. Currently this leaves me at a bit of a loss what this is exactly about.

Does this addition deserve an entry in our docs? https://anesthetic.readthedocs.io/en/latest/

Added a more complete description of the goal here to the original pr.

@williamjameshandley Reworked in a slightly unpleasant way to have the two functions declared only in scope, with some currently unnecessary code duplication as it stands, but I wanted to separate them in case the logX was added in a different way.

As this currently stand the tests are failing for the D_KL version as the run reports being terminated far too early (if I truncate at an early logL), I suspect my sums may be the wrong way round

williamjameshandley

OK, I've refactored this into something which now gets DKL correctly, and reorganises into something which impacts the samples class a little more gently. I've also implemented a few more criteria which shows why this layout is slightly more general.

The DKL was actually quite subtle, and after some thought it was not obvious how you could easily re-use the calculation from the evidence material.

Updates to documentation/feedback/tests welcome

yallup · 2024-03-25T17:02:49Z

OK, I've refactored this into something which now gets DKL correctly, and reorganises into something which impacts the samples class a little more gently. I've also implemented a few more criteria which shows why this layout is slightly more general.

The DKL was actually quite subtle, and after some thought it was not obvious how you could easily re-use the calculation from the evidence material.

Updates to documentation/feedback/tests welcome

Thanks Will, much cleaner in namespace when a separate module. My only request, and I am happy to make these changes is to be able to access the value as well as the criteria. As not all are expressed as a ratio my suggestion was to make all functions in terminate return a tuple of (Bool, float) representing the criteria and the underlying value

If that sounds reasonable I will make those changes to this PR

add pc term criteria

387ff78

yallup requested a review from williamjameshandley February 29, 2024 11:56

yallup and others added 11 commits February 29, 2024 12:32

cover no dead points

8fd2ac1

sign change

ad1d358

Added read_csv for weighted pandas

01ce2a9

Added labelled pandas testing

365b85c

Added weighted_labelled_pandas read_csv

9708866

Added read_csv for NestedSamples

17505fa

Added read_csv to anesthetic

4ee2b73

Updated pydocstyle

6fc79b5

bump version to 2.7.1

4016ccf

bump version to 2.8.0

43fa01d

updated documentation

2888a4f

williamjameshandley added 5 commits March 2, 2024 00:04

Removed inheritance from documentation

f9d3e68

Merge branch 'master' into read_csv

41ada6c

Merge branch 'master' into read_csv

77e6cd9

Merge branch 'master' into termination

97e248e

bump version to 2.8.0

849d629

williamjameshandley requested changes Mar 2, 2024

View reviewed changes

williamjameshandley and others added 4 commits March 2, 2024 00:42

Merge branch 'master' into read_csv

0ec4a2c

remove formatting

999526f

Merge branch 'termination' of github.com:handley-lab/anesthetic into …

51097bd

…termination

remove formatting

75d8628

yallup requested a review from williamjameshandley March 4, 2024 10:09

hallucinated ref

c662252

yallup mentioned this pull request Mar 4, 2024

Opinionated code formatting #367

Open

Merge branch 'read_csv' into termination

8a1117f

williamjameshandley and others added 4 commits March 4, 2024 10:58

Merge branch 'master' into termination

5ef5cd9

Updated to include chain reading

0fec842

Merge remote-tracking branch 'origin/read_csv' into termination

fdb5b24

Merge branch 'termination' of github.com:handley-lab/anesthetic into …

8dc9b87

…termination

williamjameshandley requested changes Mar 5, 2024

View reviewed changes

yallup added 3 commits March 5, 2024 10:33

Merge branch 'master' into termination

e414992

use inbuilts

a34b3d6

Merge branch 'master' into termination

26c0603

lukashergt reviewed Mar 7, 2024

View reviewed changes

yallup added 2 commits March 7, 2024 14:13

split out criteria

4a5141c

Merge branch 'termination' of github.com:handley-lab/anesthetic into …

bf4c30c

…termination

yallup requested a review from williamjameshandley March 7, 2024 14:18

williamjameshandley added 2 commits March 18, 2024 12:19

Suggested refactor

a878699

Updated documentation

68a3043

williamjameshandley reviewed Mar 18, 2024

View reviewed changes

williamjameshandley added 3 commits March 18, 2024 12:27

Merge branch 'master' into termination

9a70d46

bump version to 2.9.0

b1cda3a

Updated out-of-date docs

27f5ddb

williamjameshandley mentioned this pull request Jul 11, 2024

tracking convergence of the chain PolyChord/PolyChordLite#91

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Emulated termination criteria #363

Emulated termination criteria #363

yallup commented Feb 29, 2024 •

edited

Loading

yallup commented Feb 29, 2024

codecov bot commented Feb 29, 2024 •

edited

Loading

AdamOrmondroyd commented Mar 1, 2024 •

edited

Loading

williamjameshandley left a comment

williamjameshandley left a comment

williamjameshandley Mar 5, 2024

yallup Mar 7, 2024

lukashergt left a comment

yallup commented Mar 7, 2024

williamjameshandley left a comment

yallup commented Mar 25, 2024

Emulated termination criteria #363

Are you sure you want to change the base?

Emulated termination criteria #363

Conversation

yallup commented Feb 29, 2024 • edited Loading

Description

Checklist:

yallup commented Feb 29, 2024

codecov bot commented Feb 29, 2024 • edited Loading

Codecov Report

AdamOrmondroyd commented Mar 1, 2024 • edited Loading

williamjameshandley left a comment

Choose a reason for hiding this comment

williamjameshandley left a comment

Choose a reason for hiding this comment

williamjameshandley Mar 5, 2024

Choose a reason for hiding this comment

yallup Mar 7, 2024

Choose a reason for hiding this comment

lukashergt left a comment

Choose a reason for hiding this comment

yallup commented Mar 7, 2024

williamjameshandley left a comment

Choose a reason for hiding this comment

yallup commented Mar 25, 2024

yallup commented Feb 29, 2024 •

edited

Loading

codecov bot commented Feb 29, 2024 •

edited

Loading

AdamOrmondroyd commented Mar 1, 2024 •

edited

Loading