Added iterative sampling. #433

twiecki · 2013-12-31T00:16:50Z

I moved most of sample() into iter_sample() (open to name suggestions). iter_sample() can be used in a for-loop. This is useful for convergence checking and animated plotting during sampling.

sample() now just loops over iter_sample(). I'll wait for the tests to see if that impaired performance somehow.

jsalvatier · 2013-12-31T22:29:27Z

pymc/sample.py

-            if progressbar:
-                progress.update(i)
+            trace.record(point)
+            yield trace
    except KeyboardInterrupt:


Looks like you can remove this try statement

jsalvatier · 2013-12-31T22:30:26Z

Seems fine, but I don't think I get the purpose. Why does it help with convergence checking/animated plotting?

twiecki · 2013-12-31T22:33:24Z

well, a pattern could be that you loop through iter_sample() and inside the for block check if some criterion is reached. Or you could make a call to update the plot with the most recent samples in the for-block (as I do).

jsalvatier · 2013-12-31T22:51:42Z

Is it that you want to make different versions of sample (some that do continuous updating some, that check a criteria) but you want to abstract out the shared part?

How come you can just add some statements in the for loop?

I'm not opposed to this, I just don't quite get the benefit. It does look pretty.

twiecki · 2013-12-31T22:56:48Z

Certainly that's possible with this. Not sure we can come up with a method that works for everyone to include in pymc but you could imagine sampling until you got 1000 samples with a geweke score < X.

My specific use case was real-time plotting during sampling.

jsalvatier · 2013-12-31T23:03:58Z

Ooh, okay, more for external users if they want to do their own intersampling logic. I get it now.

twiecki · 2013-12-31T23:08:34Z

Right, exactly.

jsalvatier · 2013-12-31T23:13:19Z

Would definitely be cool to see real-time plotting too at some point.

(lets merge this once it passes and check to make sure its not slowing things greatly)

twiecki · 2013-12-31T23:15:20Z

@jsalvatier yeah, I have a notebook that's pretty sweet. Will upload a blog post soon.

twiecki · 2014-01-01T17:17:32Z

OK, doesn't seem like there's a major performance regression.

Added iterative sampling.

jsalvatier · 2014-01-01T20:48:27Z

Great!

twiecki · 2014-01-02T15:19:09Z

Here are some visualizations:
http://twiecki.github.io/blog/2014/01/02/visualizing-mcmc/

fonnesbeck · 2014-01-02T18:56:01Z

Very cool.

Regarding using this for convergence, it is probably more robust to use Gelman-Rubin with multiple chains than Geweke. In general, we should be working towards running multiple chains by default, since every modern machine will be multicore. This would facilitate (on-the-fly) R-hat calculation.

twiecki · 2014-01-02T18:59:43Z

Agreed. Doing parallel sampling iteratively would require a bit more work but shouldn't be too hard. Will probably need to fix the parallel pickling issues we're having now (do those still exist @jsalvatier?).

jsalvatier · 2014-01-02T19:27:21Z

I don't think we're getting errors when doing things in parallel now, but
you have to be careful in what you do.

On Thu, Jan 2, 2014 at 10:59 AM, Thomas Wiecki notifications@github.comwrote:

Agreed. Doing parallel sampling iteratively would require a bit more work
but shouldn't be too hard. Will probably need to fix the parallel pickling
issues we're having now (do those still exist @jsalvatierhttps://github.com/jsalvatier
?).

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/433#issuecomment-31475215
.

Before, `iter_sample` returned a single-chain trace object, not a `MultiTrace` instance like `sample`. This is an issue for functions that rely on a `MultiTrace`-like interface. See #632.

twiecki added 3 commits December 30, 2013 18:44

Added iterative sampling.

66f7baf

Add unittest for iter_sample.

0473a26

Updated doc string for iter_trace.

5e4f7b7

jsalvatier reviewed Dec 31, 2013
View reviewed changes

Removed try/except around sample loop.

e7a587d

jsalvatier added a commit that referenced this pull request Jan 1, 2014

Merge pull request #433 from pymc-devs/iter_sample

5a4bdfa

Added iterative sampling.

jsalvatier merged commit 5a4bdfa into master Jan 1, 2014

twiecki deleted the iter_sample branch January 2, 2014 15:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added iterative sampling. #433

Added iterative sampling. #433

twiecki commented Dec 31, 2013

jsalvatier Dec 31, 2013

twiecki Dec 31, 2013

jsalvatier commented Dec 31, 2013

twiecki commented Dec 31, 2013

jsalvatier commented Dec 31, 2013

twiecki commented Dec 31, 2013

jsalvatier commented Dec 31, 2013

twiecki commented Dec 31, 2013

jsalvatier commented Dec 31, 2013

twiecki commented Dec 31, 2013

twiecki commented Jan 1, 2014

jsalvatier commented Jan 1, 2014

twiecki commented Jan 2, 2014

fonnesbeck commented Jan 2, 2014

twiecki commented Jan 2, 2014

jsalvatier commented Jan 2, 2014

Added iterative sampling. #433

Added iterative sampling. #433

Conversation

twiecki commented Dec 31, 2013

jsalvatier Dec 31, 2013

Choose a reason for hiding this comment

twiecki Dec 31, 2013

Choose a reason for hiding this comment

jsalvatier commented Dec 31, 2013

twiecki commented Dec 31, 2013

jsalvatier commented Dec 31, 2013

twiecki commented Dec 31, 2013

jsalvatier commented Dec 31, 2013

twiecki commented Dec 31, 2013

jsalvatier commented Dec 31, 2013

twiecki commented Dec 31, 2013

twiecki commented Jan 1, 2014

jsalvatier commented Jan 1, 2014

twiecki commented Jan 2, 2014

fonnesbeck commented Jan 2, 2014

twiecki commented Jan 2, 2014

jsalvatier commented Jan 2, 2014