added MCMC posterior_predictive_branch sampler by danielturek · Pull Request #1086 · nimble-dev/nimble

danielturek · 2020-12-12T03:06:32Z

@perrydv @paciorek Added checking for jointly posterior predictive networks of nodes and a new posterior_predictive_branch sampler to be assigned to them.

I've tested this (what I believe to be) pretty well, although admittedly no new testing was added. But I believe it's sound. Code review or testing is welcomed.

danielturek · 2020-12-12T13:52:21Z

@perrydv @paciorek I've reviewed all the failures carefully, because I was curious if the sampler assignment logic held up.

The first failures are cases where the new posterior_predictive_branch should be assigned, but in the tests, we're checking (usually) for the presence of either RW or conjugate samplers, to make sure the model inspection and sampler assignment works.

In test-trunc.R, in the test labelled "Test that MCMC with truncation avoids conjugate samplers". I encourage anyone interested to look at this test, and understand what's going on, with the new assignment of the posterior_predictive_branch sampler. Best fix might be making all the y terms into data.

Next failure is in test-dynamicIndexing.R. There are two tests, labelled "Testing multivariate normal-Wishart dependency conjugacy detection with dynamic indexing", and all the failures are taking place there. Maybe, a better fix for this one, would be just specifying the y as being "data", in both of the tests? Then, we'd still check for (and should assign) conjugate samplers, which is what the test is trying to test for. As is, with no data in the models, these nodes get the new posterior_predictive_branch sampler.

Finally, there are a bunch of failures again the "gold file" of exact MCMC results that I'm seeing, I think all from test-mcmc.R. I haven't looked carefully at all of these, but of course it makes sense that we'd be getting different samples now.

Thought about how to proceed?

…grees

danielturek · 2020-12-12T20:19:30Z

@perrydv @paciorek I fixed the testing failures.

In tests that check for certain sampler assignments, I added "data" to the models, so the old checking for particular conjugate or non-conjugate relationships still applies (rather than the new posterior_predictive_branch sampler).
In tests checking for exact sample results, or matching the MCMC gold file, I added a new system option MCMCusePosteriorPredictiveBranchSampler which disables use of this new sampler. This option is turned off for the test-mcmc.R tests, and it now matches old results.

I'd welcome any review on this. @perrydv also your careful eye for efficiency.

danielturek · 2020-12-12T22:47:53Z

@perrydv I'm going to modify this PR to use node graph IDs instead.

danielturek · 2020-12-13T14:50:06Z

@perrydv Unfortunately, some quick efficiency tests have shown this approach to be unusable. Back to the drawing board.

I have an alternate idea for a "bottom-up" approach to the problem, using getParents.

danielturek · 2020-12-14T12:26:32Z

@perrydv This checking and assignment is now "fast", and doesn't bog down the MCMC configuration process.

I'd welcome any code review, or to merge this into devel.

paciorek · 2020-12-15T22:24:58Z

Many of our options are framed as "actions", i.e., they have a verb in them. If "Sample" here is a verb, then it should be "jointlySample" not "jointSample". Obviously minor, but my two cents.

danielturek · 2020-12-16T11:53:44Z

@paciorek I added your idea of avoiding any checking, when there's no possibility whatsoever of any posterior branches. Indeed, in some cases this totally alleviates the checking time. Good idea.

perrydv · 2021-02-02T20:26:00Z

@danielturek Here are some suggestions on the code:

It would be nice to do model$getDependencies(thisCandNodeID, dataOnly = TRUE, downstream = TRUE, returnType = 'ids') once and re-use the results. It looks like it is done up to five times redundantly. One of those times has self = FALSE, which would need to be dealt with.
Could the three for loops be combined (which would help on the previous point)? It looks like the indToKeep and nodeIDs steps could be done in the first for loop.
I think the direct use of the maps internals could be replaced with model$expandNodeNamesFromGraphIDs.

perrydv · 2021-02-02T20:46:05Z

@danielturek Here is a possibly challenging benchmark case:

mu ~ dnorm(0, 1) # prior
y ~ dnorm(mu, 1) # data
for(i in 1:10000) {
  ypred[i] ~ dnorm(mu, 1) # predictive depth 1
  zpred[i] ~ dnorm(ypred[i], 1) # predictive depth 2
}

I think this will give 10000 posterior predictive branches, each with a (ypred[i], zpred[i]) pair.

danielturek · 2021-02-05T00:37:30Z

@perrydv I've made some efficiency improvements to identifying posterior predictive branches. Notably, I've combined the 3 loops into a single loop, which now only contains a single call to getDependencies for each candidate predictive branch node. I've also added testing for the sampler assignments to predictive branches.

Also, the "possibly challenging benchmark case" you presented, the MCMC configuration time is nearly identical for using / not using the option MCMCjointlySamplePredictiveBranches, each are about 22 or 23 seconds on my machine, so I don't see any much of a performance hit.

Thanks for the careful code review. I'd like to say this is ready to merge, but I welcome any further feedback.

danielturek · 2021-02-06T12:45:50Z

Few more small changes, but again I consider this ready to merge.

perrydv · 2021-02-10T15:07:02Z

👍 Merging this is good with me.

Here is an idea for where we could go with this and also to avoid conditioning on any posterior predictive nodes in MCMC.

We could give the maps a predictive_IDs similar to (and not mutually exclusive of) top_IDs, latend_IDs and end_IDs. Then getNodeNames could get a predictiveOnly and/or includePredictive option. getDependencies already has an omit option, so one could omit predictive nodes using that. Or it could get a new omitPredictive option to avoid what could become a lot of redundant steps otherwise. Then the calls to getDependencies for every sampler could exclude predictive nodes.

This is still different from grouping predictive nodes into shared branches as this PR does.

danielturek · 2021-02-10T16:20:39Z

@perry I fully support the idea of adding a predictiveNode flag to the maps. The determination of that could use (via refactoring) the detection logic which this PR adds to the MCMC. For now, I'm merging this PR.

paciorek · 2021-02-10T22:19:07Z

I also like the predictiveIDs idea. Should we start a NCT issue?

added MCMC posterior_predictive_branch sampler

15cf5fe

danielturek added 2 commits December 12, 2020 09:04

added data to tests, to recover old sampler assignments

3a88324

added nimble option to prevent post_pred_branch sampler, so testing a…

11bc7e8

…grees

danielturek added 2 commits December 12, 2020 23:03

checks for post pred braches using nodeIDs

7f1b94e

fixed mistake in defining variable

b6adade

danielturek added 4 commits December 13, 2020 14:24

made finding posterior predictive branches faster

0e2cdae

updated posterior predictive branches slightly

dbd44c3

minor logic adjustment

9f2a5a0

fixed typos in NEWS

06b07c4

changed option name to MCMCjointSamplePredictiveBranches

8d30be8

danielturek added 2 commits December 15, 2020 18:16

changed option name to MCMCjointlySamplePredictiveBranches

901e368

added additional checking

32dd10e

danielturek added 6 commits February 3, 2021 10:53

improving posterior predictive branch efficiency

0e6f051

Merge branch 'devel' into posterior_predictive_branches

3bd5d4f

added a blank line to trigger testing

cbcb9b2

added testing for posterior predictive branch detection

9e64239

made option true for testing

464e10f

efficieny improvements in finding posterior predictive branches

1177f0a

danielturek added 2 commits February 5, 2021 11:50

replaced direct map use with model

b6a69ab

removed commented lines

3dedb22

danielturek merged commit 8e84f42 into devel Feb 10, 2021

danielturek deleted the posterior_predictive_branches branch February 10, 2021 16:20

paciorek mentioned this pull request Oct 7, 2021

fix 0 or 1 as constants in conjugacy processing #1172

Merged

Conversation

danielturek commented Dec 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

danielturek commented Dec 12, 2020

Uh oh!

danielturek commented Dec 12, 2020

Uh oh!

danielturek commented Dec 12, 2020

Uh oh!

danielturek commented Dec 13, 2020

Uh oh!

danielturek commented Dec 14, 2020

Uh oh!

paciorek commented Dec 15, 2020

Uh oh!

danielturek commented Dec 16, 2020

Uh oh!

perrydv commented Feb 2, 2021

Uh oh!

perrydv commented Feb 2, 2021

Uh oh!

danielturek commented Feb 5, 2021

Uh oh!

danielturek commented Feb 6, 2021

Uh oh!

perrydv commented Feb 10, 2021

Uh oh!

danielturek commented Feb 10, 2021

Uh oh!

paciorek commented Feb 10, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

danielturek commented Dec 12, 2020 •

edited

Loading