initialize chains from estimated posterior samples #1655

aloctavodia · 2017-01-07T16:11:56Z

For advi and nuts and when njobs > 1 it will sample starting points from the estimated posterior, otherwise it will use the estimated posterior mean.

twiecki · 2017-01-07T17:59:50Z

pymc3/sampling.py

+        if njobs > 1:
+            start = pm.variational.sample_vp(v_params, njobs, progressbar=False, hide_transformed=False)
+        else:
+            start = v_params.means


I think we always want a sample from the posterior to start in the "typical set". The mean can be far away from that, which is counter-intuitive, but true for high-dimensional models: https://www.youtube.com/watch?v=pHsuIaPbNbY

aloctavodia · 2017-01-07T18:15:11Z

ok. I will revert that. I guess nuts (when jobs = 1) should also start from a posterior sample and not from the mean of the posterior, right? I will check the video late (very bad Internet connection right now). El ene 7, 2017 2:59 PM, "Thomas Wiecki" <notifications@github.com> escribió:

…

***@***.**** commented on this pull request. ------------------------------ In pymc3/sampling.py <#1655 (review)>: > @@ -432,7 +438,10 @@ def init_nuts(init='advi', n_init=500000, model=None, **kwargs): if init == 'advi': v_params = pm.variational.advi(n=n_init) - start = pm.variational.sample_vp(v_params, 1, progressbar=False, hide_transformed=False)[0] + if njobs > 1: + start = pm.variational.sample_vp(v_params, njobs, progressbar=False, hide_transformed=False) + else: + start = v_params.means I think we always want a sample from the posterior to start in the "typical set". The mean can be far away from that, which is counter-intuitive, but true for high-dimensional models: https://www.youtube.com/watch?v=pHsuIaPbNbY — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#1655 (review)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABRuTpNwrPgm7s-zScU2Hdnih3KwAO5tks5rP9KXgaJpZM4LddM7> .

twiecki · 2017-01-07T19:32:13Z

ok. I will revert that. I guess nuts (when jobs = 1) should also start
from a posterior sample and not from the mean of the posterior, right?

Exactly.

aloctavodia · 2017-01-08T18:15:21Z

That's right, but the discarded values will only affects the starting points. Then it will use the whole array to do the actual sampling.

aloctavodia · 2017-01-10T23:59:00Z

@twiecki @ColCarroll any (new) thoughts on this?

ColCarroll · 2017-01-11T00:43:33Z

Ah, just checked, and advi's random_seed implementation uses None as a default. I also think it might cause trouble using MRG_RandomStreams to set the seed (have to check whether that shares state with numpy.random). There might also be another subtle bug, but I will open a separate issue for that.

If you want to land this now, we'll have to remember to change this when the random_seed fix lands.

ColCarroll · 2017-01-11T00:43:55Z

(so, this all looks good, but I think there's a subtle problem elsewhere!)

ColCarroll · 2017-01-11T00:45:33Z

Oh, heh, #1656 is the problem!

twiecki · 2017-01-11T20:56:57Z

pymc3/sampling.py

-
-        start = {varname: np.mean(init_trace[varname]) for varname in init_trace.varnames}
+        init_trace = pm.sample(step=pm.NUTS(), draws=n_init,
+        random_seed=random_seed)[n_init//2:]


wrong indent.

@aloctavodia once these minor issues are fixed we can start to merge this I think.

twiecki · 2017-01-11T20:57:04Z

pymc3/sampling.py

        cov = np.power(model.dict_to_array(v_params.stds), 2)
    elif init == 'advi_map':
        start = pm.find_MAP()
-        v_params = pm.variational.advi(n=n_init, start=start)
+        v_params = pm.variational.advi(n=n_init, start=start,
+        random_seed=random_seed)


wrong indent

twiecki · 2017-01-18T15:19:25Z

Thanks @aloctavodia!

initialize chains from estimated posterior samples

15ba923

twiecki reviewed Jan 7, 2017

View reviewed changes

add random_seed to fix starting points

e13d186

aloctavodia mentioned this pull request Jan 11, 2017

Run multiple dispersed chains by default #1662

Closed

twiecki reviewed Jan 11, 2017

View reviewed changes

fix indentation

a2baf67

twiecki merged commit b6827a6 into pymc-devs:master Jan 18, 2017

bhargavvader mentioned this pull request Feb 11, 2017

random_seed in sample_vp does not fix the results #1656

Closed

aloctavodia deleted the init_sampling branch May 17, 2017 09:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

initialize chains from estimated posterior samples #1655

initialize chains from estimated posterior samples #1655

aloctavodia commented Jan 7, 2017

twiecki Jan 7, 2017

aloctavodia commented Jan 7, 2017 via email

twiecki commented Jan 7, 2017

aloctavodia commented Jan 8, 2017

aloctavodia commented Jan 10, 2017

ColCarroll commented Jan 11, 2017

ColCarroll commented Jan 11, 2017

ColCarroll commented Jan 11, 2017

twiecki Jan 11, 2017

twiecki Jan 18, 2017

twiecki Jan 11, 2017

twiecki commented Jan 18, 2017

initialize chains from estimated posterior samples #1655

initialize chains from estimated posterior samples #1655

Conversation

aloctavodia commented Jan 7, 2017

twiecki Jan 7, 2017

Choose a reason for hiding this comment

aloctavodia commented Jan 7, 2017 via email

twiecki commented Jan 7, 2017

aloctavodia commented Jan 8, 2017

aloctavodia commented Jan 10, 2017

ColCarroll commented Jan 11, 2017

ColCarroll commented Jan 11, 2017

ColCarroll commented Jan 11, 2017

twiecki Jan 11, 2017

Choose a reason for hiding this comment

twiecki Jan 18, 2017

Choose a reason for hiding this comment

twiecki Jan 11, 2017

Choose a reason for hiding this comment

twiecki commented Jan 18, 2017