wrap BNP cluster samplers so only used if cluster has observations by paciorek · Pull Request #839 · nimble-dev/nimble

paciorek · 2019-01-03T01:50:24Z

Do not merge: for @danielturek to review @paciorek approach to not running MCMC samplers for BNP cluster parameters when a given cluster has no constituent observations.

This work builds off @perrydv reversible jump sampler, extending it to wrap whatever sampler is assigned to the cluster parameter by configureMCMC.

* defensive WAIC model handling * undid changes to tr travis testing again * changed to model$calculate() * changed to currentVals2 * commented out currentVals <- values(model) * added back in someVals <- values(model) * added back in someVals <- values(model) * changed to using model$values() syntax * using values(model, sampledNodes) again.... * wont work * different approach: storing logProb values

also fixed bug where $waic not $WAIC used in demo code

* fix dimension handling in matrix2VecNimArr * added test. removed C++ diagnostic code. Supported only matrix input.

* added warning about multiply-defined nodes in genExpandedNode... * fix check of redefined nodes for temporary unknownIndex declarations

* monkeying with ignore.stderr * augmented failed to create shared library msg * full rework of compilation output to print errors under failure * update compiler error output so only printed upon user request if showCompilerOutput is FALSE

…ic nodes (#828)

* avoid flag multiply defined nodes when node is lifted from RHS * fix car_proper test model and omit stray browser

* fix bug in compareFilesByLine for gold checking * updated test-graphStructure to use std goldFile system * updated filtering goldFile in light of 0.6-12 changes; now that goldFile testing is fixed * minor edit to test-dynIdx to suppress output from sapply * minor edits to various test files to suppress sapply output so goldfile results match * updated user goldfile in light of changes s * reinstantiated user-supplied fxn tests in test-user; not clear why they were turned off * clean up error msg in initialize model regarding determ nodes * updated mcmc goldfile * update stripping of testthat deprecation message * clean up typo in test-mcmc.R * fixed extraneous stuff in test-mcmc.R * trying to fix goldfile comparisons * debug dynIdx test failures * try again with dynIdx test * another try with dynIdx * another try with dynIdx * fix dynIdx goldfile

for empty clusters

… control param to RW_block. (#772) * set up RW and RW_block to calculate prior log prob first. Added tries control param to RW_block. * make correct separation between first and second stage of dependency calculations in sampler_RW_block * set jump <- FALSE on -Inf prior logProb value * put in unnecessary calls to jump() in order to trigger runif() to get same sampling sequence for testing * fix typo in sampler_RW_block * in inst/CppCpde/Utils.cpp decide(), force RNG use even when input is NaN, purely for test comparison purposes. * Also modify R version of decide() * remove code designed for matching chains to calcPriorFirst-devel-proxy * updated test-dynamicIndexing in light of calcPriorFirst (changes to "basic mixture model with conjugacy") * monkey with exact mcmc output for calcPriorFirst * monkey with exact mcmc output for calcPriorFirst(2) * update test-trunc in light of calcPriorFirst * remove stray line from decide in calcPriorFirst machinations * fix bug in compareFilesByLine that was causing no goldFile comparison to be made; ever since Sep. 2017 * update mcmc/trunc goldfiles in light of calcPriorFirst changes to RNG sequence * updated tolerance for interval cens test

paciorek · 2019-01-03T01:55:48Z

@danielturek ignore the line

clusterNodeInfo <- list(clusterVars = 'muTilde', clusterNodes = list( paste0('muTilde[', 1:50, ']')))

in configureMCMC.R. That is only there because there is some work on a separate branch that would need to get merged in for us to be able to automatically create clusterNodeInfo. At the moment, I'm just hard-coding in what clusterNodeInfo looks like for a specific example model I was using when developing this.

In case you want to actually be able to run the code, here is the example model:

code <- nimbleCode({
    xi[1:n] ~ dCRP(conc0, n)
    for(i in 1:n){
        mu[i] <- muTilde[xi[i]]  
        y[i] ~ dnorm(mu[i], 1)
    }
    for(i in 1:n) {
        muTilde[i] ~ dnorm(mu0, sd = s0)
    }
})

n = 50
model <- nimbleModel(code, constants = list(n=n), data = list(y=rnorm(n)), inits = list(xi = rep(1,n),
                                                                                        conc0 = 1, muTilde = c(1:n), mu0=0, s0=1))
conf <- configureMCMC(model)
mcmc = buildMCMC(conf)
cmodel = compileNimble(model)
cmcmc <- compileNimble(mcmc,project=model)

paciorek · 2019-01-04T01:34:51Z

Also, @danielturek just flagging that this is now multiple times that I've revised configureMCMC such that I ignore your work on samplerAssignmentRules. I wanted to mention that as I'm not sure where that stands and want to make sure I am not messing anything up.

danielturek · 2019-01-07T17:13:32Z

@paciorek Thanks for flagging this for review.

I've taken a pretty careful look over it, and the approach looks sound, only one thing I wasn't sure if it's correct. Could a model have more than one dCRP node, and hence the code you added for clusterNodInfo, etc, would all be invoked twice? In which case, the additional code for modifying the samplers to "wrap" these (beginning on line 271 of MCMC_configuration.R), since this appears outside of the main loop beginning on line 201, that would only catch one of the occurrences of the dCRP node, right? Since closterNodeInfo and depNode would have been overwritten. Is that right? Is this situation possible?

If it's right, then I understand why the "wrapping" code must appear later, to remove the samplers from all the clusterNodes. But still, could easily be fixed by maintaining clusterNodeInfo as a list of lists, where each element contains the clusterNodeInfo of one dCRP node.

Of course, please disregard this if I'm missing some part of the big picture, and this situation isn't possible.

Thanks for the heads up about not maintaining samplerAssignmentRules. That's fine. If that system ever comes into use, I would need to bring it up-to-date anyway.

I pushed one commit to this PR, only with stylistic changes.

paciorek · 2019-01-07T18:41:24Z

Thanks @danielturek I think you are right that we need to handle the possibility of two dCRP nodes. I will fix that when I come back to this (hopefully soon).

* added case for 5-dimensional nimArray to NimArr.h * adding missing "int" to 5d nimArray implementation * updated NimArrBase.h to handle up to 5-dimensional nimArrays * added a test for 5-dimensional nimArrays * fixed type 4 becomes 5 in NimArr.h * added a failing 5d nimArray test to blockTests in test-coreR.R * Filled in tests. Fixed two identified and one new bug.

* added inherits=FALSE to exists where seems safe * full ddexp with tests * align parameterization of ddexp with Gelman/BUGS; fix tests; add conjugacy

fix incorrect param names in manual tables for ddexp

…om dependencies

…prior from dependencies" This reverts commit bd68b43.

…split in calculations (#845) * make sampler_categorical and sampler_binary use a prior/dependencies split in calculations * fix underflow in sampler_categorical

* change initializeModel approach to be faster * speed up by using maps objects more directly

not sampling empty clusters

This reverts commit 9d749ef.

paciorek · 2019-02-09T21:52:03Z

Ok, I am fixing the issue with possibility of having two dCRP nodes in one model and also making use of nimbleFunctionList in wrapped sampler in case one model has two types of wrapped samplers. For reasons not worth going into, I'm removing this branch and replacing with nosample_empty_clusters2.

danielturek and others added 19 commits November 21, 2018 15:38

added warning msg about liu-west

a3e0826

added cautionary notes about WAIC and what is theta; (#800)

2c4b967

also fixed bug where $waic not $WAIC used in demo code

grammar edits to Liu-West msgs to trigger travis

59ceb72

edit comment to trigger Travis

3141c1e

User manual entry for clearCompiled

b3ee1f3

change version num to 0.6.13 (#806)

643e093

minor manual change

4c02998

fixed typo in AF_slice roxygen

3c92c99

fix dimension handling in matrix2VecNimArr (#813)

0ce1a38

* fix dimension handling in matrix2VecNimArr * added test. removed C++ diagnostic code. Supported only matrix input.

added inherits=FALSE to exists where seems safe (#825)

bba275b

added warning about multiply-defined nodes in genExpandedNode... (#822)

74a52f0

* added warning about multiply-defined nodes in genExpandedNode... * fix check of redefined nodes for temporary unknownIndex declarations

updated initializeModel to avoid warnings from dependent determinisit…

829731f

…ic nodes (#828)

avoid flag multiply defined nodes when node is lifted from RHS (#826)

9531d29

* avoid flag multiply defined nodes when node is lifted from RHS * fix car_proper test model and omit stray browser

full draft of avoiding sampling cluster parameters

26cdb28

for empty clusters

cleanup printSamplers to avoid all deps of CRP_cluster_wrapper

9d08fdd

paciorek requested a review from danielturek January 3, 2019 01:50

stylisitic changes only

5adc555

danielturek and others added 5 commits January 8, 2019 16:43

add ddexp/dlaplace distribution (#840)

1d4eb32

* added inherits=FALSE to exists where seems safe * full ddexp with tests * align parameterization of ddexp with Gelman/BUGS; fix tests; add conjugacy

updated NEWS for 0.6.13

381e1a1

add location,var alt for ddexp;

585360f

fix incorrect param names in manual tables for ddexp

fix typo

e9351f5

paciorek and others added 26 commits January 24, 2019 07:41

add DOI

cb6c877

minor updates to REL_INST

c2d2ae0

fix CITATION syntax

306bf94

make sampler_categorical and sampler_binary separate logProb prior fr…

bd68b43

…om dependencies

Revert "make sampler_categorical and sampler_binary separate logProb …

4fac540

…prior from dependencies" This reverts commit bd68b43.

Merge branch 'devel' of https://github.com/nimble-dev/nimble into devel

017c758

make sampler_categorical and sampler_binary use a prior/dependencies …

70882c5

…split in calculations (#845) * make sampler_categorical and sampler_binary use a prior/dependencies split in calculations * fix underflow in sampler_categorical

Merge branch 'devel' of https://github.com/nimble-dev/nimble into devel

af7918f

update NEWS

2251bc1

update REL_INST

aa2ffd7

update REL_INST

6101423

change initializeModel approach to be faster (#849)

7bb7b51

* change initializeModel approach to be faster * speed up by using maps objects more directly

fix some PROTECT issues (#850)

35ef332

fix log(2) issue for Solaris

ecd3eec

minor cleanup for 0.7.0

294e013

new Rd files for 0.6.13/0.7.0

042f02f

updated manual for 0.7.0

815bf56

change to 0.7.0 number in html manual

b582953

update README for 0.7.0

0276ce3

update version number

87a2da0

update REL_INST

4a16e4d

minor change to REL_INST

43dfb66

fix minor issue in test-mcmc

438607a

handle multiple dcrp nodes in functionality for

9d749ef

not sampling empty clusters

dirty merge

4be8fbb

Revert "handle multiple dcrp nodes in functionality for"

71880be

This reverts commit 9d749ef.

paciorek closed this Feb 9, 2019

paciorek mentioned this pull request Feb 9, 2019

avoid sampling parameters of empty clusters in BNP #855

Merged

paciorek deleted the nosample_empty_clusters branch February 23, 2019 16:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wrap BNP cluster samplers so only used if cluster has observations#839

wrap BNP cluster samplers so only used if cluster has observations#839
paciorek wants to merge 64 commits into
BNPfrom
nosample_empty_clusters

paciorek commented Jan 3, 2019 •

edited

Loading

Uh oh!

paciorek commented Jan 3, 2019

Uh oh!

paciorek commented Jan 4, 2019

Uh oh!

danielturek commented Jan 7, 2019

Uh oh!

paciorek commented Jan 7, 2019

Uh oh!

paciorek commented Feb 9, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

paciorek commented Jan 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paciorek commented Jan 3, 2019

Uh oh!

paciorek commented Jan 4, 2019

Uh oh!

danielturek commented Jan 7, 2019

Uh oh!

paciorek commented Jan 7, 2019

Uh oh!

paciorek commented Feb 9, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

paciorek commented Jan 3, 2019 •

edited

Loading