Added dynamic rhat #77

sidravi1 · 2021-02-07T06:46:18Z

Overview

Displays Rhat values in the progress bar. Partially addresses #8.

Details

Here it is in action:

rhat_dynamic.mov

To do

Probably should add variable names to Rhat
Probably move it to a new line else will get messy with a lot of vars
Basic offline Rhat and warning if too high (like pymc3)
More testing - especially for mvn 🙊

mcx/sample.py

mcx/trace.py

rlouf · 2021-02-08T13:40:34Z

The code looks great and I really like the result!

Probably should add variable names to Rhat

Probably move it to a new line else will get messy with a lot of vars

Since this is a rough indicator, I thought we could only display the "worst" value of Rhat among all variables (in terms of distance to 1). Other values can be shown in the inference summary. The ideal would be to update a graph with all the values of Rhat over time, but that's a project in itself.

Basic offline Rhat and warning if too high (like pymc3)

As discussed, implementing the rank-normalized Rhat for the inference summary would be best. Adding a warning if a value is too high is a good idea, and it is even better if that warning is actionable: what can I do as a modeler with this information?

Where should it be displayed? After the progress bar or do we print (at least part of) the inference summary first?

More testing - especially for mvn speak_no_evil

Indeed, multivariate random variables are more error-prone :) The best is probably to take examples from the paper and check that computing Rhat on these chains gives the expected result.

rlouf · 2021-02-09T16:22:44Z

Btw for the sake of making incremental changes it would be better to address (3) in a separate PR.

mcx/diagnostics/gelman_rubin.py

mcx/sample.py

mcx/diagnostics/gelman_rubin.py

rlouf · 2021-04-12T06:18:23Z

Hey @sidravi1 what's the status on this PR?

sidravi1 · 2021-04-13T12:42:37Z

Hey Rémi - thanks for the push. Been too occupied by work and kids these last few weeks (months?). I haven’t worked on this since the refactor. I’ll work on it on Thursday.

…

On Mon, Apr 12, 2021 at 2:18 AM Rémi Louf ***@***.***> wrote: Hey @sidravi1 <https://github.com/sidravi1> what's the status on this PR? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#77 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ADLYXBNVDANACHUHK3TATJ3TIKGD7ANCNFSM4XHD6WVA> .

rlouf · 2021-04-15T05:06:30Z

No problem, this is open source, not paid work 🙂

sidravi1 · 2021-06-13T20:08:43Z

Hi @rlouf - Got dynamic rhat working though using set_postfix does slow down performance by ~50%

Tested it with this mvnormal model as well.

@mcx.model
def linear_regression_mvn(x, lmbda=1.):
    sigma <~ dist.Exponential(lmbda)
    sigma2 <~ dist.Exponential(lmbda)
    rho <~ dist.Uniform(-1, 1)
    cov = jnp.array([[sigma**2, rho*sigma*sigma2],[rho*sigma*sigma2, sigma2**2]])
    coeffs_init = jnp.ones(x.shape[-1])
    coeffs <~ dist.MvNormal(coeffs_init, cov)
    y = jnp.dot(x, coeffs.T)
    preds <~ dist.Normal(y, sigma)
    return preds

sampler = mcx.sampler(
    rng_key,
    linear_regression_mvn,
    (x_data_mvn,),
    {'preds': y_data_mvn},
    HMC(10),
)
posterior = sampler.run()

If all good, i'll clean up the commit history before the merge.

rlouf · 2021-06-14T06:30:53Z

Great! Could you try using the mininterval flag and setting it to something like .5s or 1s and report the slowdown then? (https://github.com/tqdm/tqdm/blob/master/tqdm/std.py#L873-L880)

sidravi1 · 2021-06-14T13:26:33Z

Great! Could you try using the mininterval flag and setting it to something like .5s or 1s and report the slowdown then? (https://github.com/tqdm/tqdm/blob/master/tqdm/std.py#L873-L880)

mininterval doesn't seem to help much. The bottleneck is actually the rhat updating and not tqdm. What are your thoughts on making it optional? We could also use a pattern where you can register callbacks to run a bunch of other online stats

Should also point out that the bottleneck is most noticeable when the model is simple (the linear example), when it's more complex (multivariate example) then it doesn't really reduce it that much.

rlouf · 2021-06-15T12:16:01Z

It's all a question of user interface. The original idea was that, since we spend 99% of our time debugging models, the sample function would be interactive by default: it displays as much information as possible to see when issues arise and can be interrupted at any time to diagnose these issues. compile=True would show nothing but the progress bar and would correspond to situations where we need inference to be as fast as possible; we could also define a fast_sample function for that purpose.

Now, if you have to wait an extra few seconds for simple models but it does not affect large models, it is not really a problem.

Nevertheless, I like your idea of designing these online metrics as callbacks. This would allow users to customize the metrics being displayed and/or follow their own metrics. It is also cleaner from a code perspective. This way sample would be called with callbacks=[rhat, ess, divergences] by default.

PS: is the multiple progress bar a bug?

sidravi1 · 2021-06-15T12:38:27Z

Ok. Make sense.

Should we merge this in and switch to callback design pattern in another PR (when we implement ESS or divergences) or do you want me to update this one?

The multiple progress bars are because of the %%timeit cell magic on top. Just runs it multiple times to get the average run time.

rlouf · 2021-06-15T12:41:05Z

Would you mind updating this one?

sidravi1 · 2021-06-15T12:45:38Z

Yep! Can do :)

sidravi1 · 2021-06-28T15:00:25Z

Thanks for your patience @rlouf - I've made those changes. Let me know what you think.

rlouf · 2021-07-29T02:41:13Z

mcx/sample.py

+    call_backs:
+        The functions to run after each state update


This does not appear in the function's signature

Oops. Fixed! I'll also squash all the commits so it's ready to merge in

rlouf

Apart from my small comment on a docstring, everything is perfect. Ready to merge once that's fixed.

allows online metrics to be passed to sample_loop

sidravi1 · 2021-07-29T13:25:26Z

@rlouf - Thanks for reviewing. I've made that one docstring fix and squashed all the commits.

rlouf · 2021-07-29T13:32:27Z

Great work, the code was really clean and self-explanatory!

rlouf reviewed Feb 8, 2021

View reviewed changes

mcx/sample.py Outdated Show resolved Hide resolved

rlouf reviewed Feb 8, 2021

View reviewed changes

mcx/sample.py Outdated Show resolved Hide resolved

rlouf reviewed Feb 8, 2021

View reviewed changes

mcx/trace.py Show resolved Hide resolved

canyon289 reviewed Feb 9, 2021

View reviewed changes

mcx/diagnostics/gelman_rubin.py Show resolved Hide resolved

canyon289 reviewed Feb 9, 2021

View reviewed changes

mcx/diagnostics/gelman_rubin.py Outdated Show resolved Hide resolved

sidravi1 force-pushed the rhat branch from 5752f54 to 57b6f57 Compare February 12, 2021 04:32

rlouf reviewed Feb 12, 2021

View reviewed changes

mcx/sample.py Outdated Show resolved Hide resolved

rlouf reviewed Feb 12, 2021

View reviewed changes

mcx/diagnostics/gelman_rubin.py Outdated Show resolved Hide resolved

rlouf force-pushed the master branch 3 times, most recently from f8f3e6b to 965f6dd Compare February 23, 2021 11:28

sidravi1 force-pushed the rhat branch from 3bb1185 to af2a564 Compare June 13, 2021 12:28

sidravi1 requested a review from rlouf June 13, 2021 20:33

sidravi1 changed the title ~~[WIP] Added dynamic rhat~~ Added dynamic rhat Jun 13, 2021

rlouf mentioned this pull request Jun 18, 2021

Online display of divergences #107

Merged

rlouf reviewed Jul 29, 2021

View reviewed changes

rlouf requested changes Jul 29, 2021

View reviewed changes

added real-time Rhat

8e3df4f

allows online metrics to be passed to sample_loop

sidravi1 force-pushed the rhat branch from ac8b702 to 8e3df4f Compare July 29, 2021 13:22

rlouf merged commit af1adc9 into rlouf:master Jul 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added dynamic rhat #77

Added dynamic rhat #77

sidravi1 commented Feb 7, 2021 •

edited by rlouf

Loading

rlouf commented Feb 8, 2021 •

edited

Loading

rlouf commented Feb 9, 2021

rlouf commented Apr 12, 2021

sidravi1 commented Apr 13, 2021 via email

rlouf commented Apr 15, 2021

sidravi1 commented Jun 13, 2021

rlouf commented Jun 14, 2021

sidravi1 commented Jun 14, 2021 •

edited

Loading

rlouf commented Jun 15, 2021

sidravi1 commented Jun 15, 2021

rlouf commented Jun 15, 2021

sidravi1 commented Jun 15, 2021

sidravi1 commented Jun 28, 2021

rlouf Jul 29, 2021

sidravi1 Jul 29, 2021

rlouf left a comment

sidravi1 commented Jul 29, 2021

rlouf commented Jul 29, 2021

Added dynamic rhat #77

Added dynamic rhat #77

Conversation

sidravi1 commented Feb 7, 2021 • edited by rlouf Loading

Overview

Details

To do

rlouf commented Feb 8, 2021 • edited Loading

rlouf commented Feb 9, 2021

rlouf commented Apr 12, 2021

sidravi1 commented Apr 13, 2021 via email

rlouf commented Apr 15, 2021

sidravi1 commented Jun 13, 2021

rlouf commented Jun 14, 2021

sidravi1 commented Jun 14, 2021 • edited Loading

rlouf commented Jun 15, 2021

sidravi1 commented Jun 15, 2021

rlouf commented Jun 15, 2021

sidravi1 commented Jun 15, 2021

sidravi1 commented Jun 28, 2021

rlouf Jul 29, 2021

Choose a reason for hiding this comment

sidravi1 Jul 29, 2021

Choose a reason for hiding this comment

rlouf left a comment

Choose a reason for hiding this comment

sidravi1 commented Jul 29, 2021

rlouf commented Jul 29, 2021

sidravi1 commented Feb 7, 2021 •

edited by rlouf

Loading

rlouf commented Feb 8, 2021 •

edited

Loading

sidravi1 commented Jun 14, 2021 •

edited

Loading