Adding Diganostics to Ecdf Plots in the spirit of TARP #261

arrjon · 2024-11-27T13:23:35Z

This pull request focuses on enhancing the functionality of the ECDF plotting functions plot_sbc_ecdf. The changes include adding new rank computation methods in the spirit of Lemos, Pablo, et al. "Sampling-based accuracy testing of posterior estimators for general inference.". Feedback is welcome!

The idea is to compute ranks based on distances to the origin or a random point instead of the marginal fractional ranks. This way one can analyse the joint calibration of the model instead of looking at single parameters. If the random reference has some dependency on the data, this can reveal biases which would not be detected by the standard ECDF plots (e.g. when the prior equals the posterior the ECDF indicate a well calibrated model; when we have a tutorial on the diagnostics as discussed in #236, I would add an examples there). One can also pass a distance function as an argument, which could be for instance a distance based on the log-probability of the approximator.

Furthermore, I fixed some minor bugs in the function plot_posterior_2d.

paul-buerkner · 2024-11-27T14:45:59Z

That looks cool, thank you! @jerrymhuang would you mind reviewing this PR since you have been working on the diagnostics module previously?

stefanradev93 · 2024-12-02T04:09:28Z

@jerrymhuang Can you fix the conflicts due to the very recent changes in dev diagnostics?
@paul-buerkner As an author of the original ECDF paper, can you approve the new interface and functionality?

jerrymhuang

Nice changes. Thank you @arrjon !

@stefanradev93 @paul-buerkner The conflict is fixed to align with the recent partitioning scheme for the diagnostics module. The distance rank is also tested in the Linear Regression notebook.

paul-buerkner

Thank you for adding this feature! I had only a little time to check the paper but from what I understand, your implementation matches what they propose. I would have to investigate the properties of the method in more detail to fully understand it. But since we are currently having the new feature only optional with the standard SBC as default, I think we can safely merge this PR and improve the feature later on if we figure out that is needed for some yet unknown reason.

arrjon · 2024-12-02T09:32:27Z

Before implementing the feature, I considered the following: The key requirement is that the rank of the true parameter with respect to samples from the posterior distribution is uniformly distributed, regardless of how the parameters are transformed (e.g. just looking at the marginals or apply a norm). With this condition satisfied, everything, including the simultaneous confidence bands, works out fine.

In the paper by Lemos et al., they basically showed that you can even use a rank based on a distance to random reference point to compute valid coverage statistics. So instead of ECDFs (as we do here) they computed expected coverages based on these ranks.

paul-buerkner · 2024-12-02T09:43:23Z

That makes sense, thank you! We could later on generalize or extend this feature to allow for other kind of metrics that a distance, including the (log) likelihood density as target, which we suggested in https://arxiv.org/abs/2211.02383. But that is subject to a separate PR.

arrjon · 2024-12-02T09:49:28Z

That makes sense, thank you! We could later on generalize or extend this feature to allow for other kind of metrics that a distance, including the (log) likelihood density as target, which we suggested in https://arxiv.org/abs/2211.02383. But that is subject to a separate PR.

This is already possible by constructing a distance based on the likelihood density and passing it with the argument distance to ranks_kwargs. But we could certainly implement it in way, so the user can just apply it!

paul-buerkner · 2024-12-02T12:09:55Z

Good point! And I agree, some more convenience functionaly for common use cases would be cool down the line.

paul-buerkner · 2024-12-02T12:10:15Z

@stefanradev93 you can merge if you are happy with the PR as well.

stefanradev93 · 2024-12-02T22:33:37Z

Thank you all!

arrjon added 19 commits November 25, 2024 18:45

ecdf with random points

c5ec374

single axis

8672abe

single axis

bcb7b5a

add comments

abcdf3c

clean

ac2e239

clean

55823e3

title fix

22e13a8

docstring

f1519c8

posterior 2d

83e6ed0

posterior 2d fix

d3f50e8

posterior 2d fix

3bb2250

posterior 2d fix

6c71e89

fix reference

3c922a3

clean up

29cedf0

add comment

47b6d18

add tests

74f510a

pass kwargs

a5cee6d

fix title

9483782

make more customizable

0b09c09

paul-buerkner requested a review from jerrymhuang November 27, 2024 14:45

stefanradev93 requested a review from paul-buerkner December 2, 2024 04:04

jerrymhuang added 2 commits December 1, 2024 23:33

Merge branch 'dev' into ecdf_random

25495ef

fix conflict

9f23593

jerrymhuang approved these changes Dec 2, 2024

View reviewed changes

paul-buerkner approved these changes Dec 2, 2024

View reviewed changes

stefanradev93 merged commit 24e70aa into bayesflow-org:dev Dec 2, 2024
13 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding Diganostics to Ecdf Plots in the spirit of TARP #261

Adding Diganostics to Ecdf Plots in the spirit of TARP #261

Uh oh!

arrjon commented Nov 27, 2024 •

edited

Loading

Uh oh!

paul-buerkner commented Nov 27, 2024

Uh oh!

stefanradev93 commented Dec 2, 2024

Uh oh!

jerrymhuang left a comment •

edited

Loading

Uh oh!

paul-buerkner left a comment

Uh oh!

arrjon commented Dec 2, 2024

Uh oh!

paul-buerkner commented Dec 2, 2024

Uh oh!

arrjon commented Dec 2, 2024

Uh oh!

paul-buerkner commented Dec 2, 2024

Uh oh!

paul-buerkner commented Dec 2, 2024

Uh oh!

Uh oh!

stefanradev93 commented Dec 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Adding Diganostics to Ecdf Plots in the spirit of TARP #261

Adding Diganostics to Ecdf Plots in the spirit of TARP #261

Uh oh!

Conversation

arrjon commented Nov 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paul-buerkner commented Nov 27, 2024

Uh oh!

stefanradev93 commented Dec 2, 2024

Uh oh!

jerrymhuang left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

paul-buerkner left a comment

Choose a reason for hiding this comment

Uh oh!

arrjon commented Dec 2, 2024

Uh oh!

paul-buerkner commented Dec 2, 2024

Uh oh!

arrjon commented Dec 2, 2024

Uh oh!

paul-buerkner commented Dec 2, 2024

Uh oh!

paul-buerkner commented Dec 2, 2024

Uh oh!

Uh oh!

stefanradev93 commented Dec 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

arrjon commented Nov 27, 2024 •

edited

Loading

jerrymhuang left a comment •

edited

Loading