Added argument to initialize Euclidean metric for samplers by bbbales2 · Pull Request #576 · stan-dev/cmdstan

bbbales2 · 2017-10-14T22:43:56Z

Submisison Checklist

Run tests: ./runCmdStanTests.py src/test
Declare copyright holder and open-source license: see below

Summary:

This is the CmdStan interface component of: stan-dev/stan#2260

This should allow initialization of Euclidean metric for samplers with an R dump file

Questions:

Where should the metric_file argument go? I think it makes sense under "init" but I didn't put it there cause that would mean breaking a bunch of other stuff (that depends on passing in init=whatever arguments). I just made it a high level argument for now.
I only added check-that-it's-working tests. Lemme know if I should add check-if-it's-failing tests. I figured the Stan tests should handle most of that though (I didn't really add any functionality beyond the interface)
Should the unit_e samplers allow initialization of their Euclidean metrics? The interface isn't there for them.
Nomenclature-wise, are we setting the metric? Or are we initializing the Euclidean metric? Or are we setting a Euclidean metric? Or what is the verbage equivalent to "setting the mass matrix".

How to Verify:

./runCmdStanTests.py src/test/interface/metric_test.cpp

Side Effects:

Documentation:

I still need to do it

Reviewer Suggestions:

@mitzimorris or @sakrejda, whoever feels inclined

Copyright and Licensing

Please list the copyright holder for the work you are submitting (this will be you or your assignee, such as a university or company): University of California, Santa Barbara

By submitting this pull request, the copyright holder is agreeing to license the submitted work under the following licenses:

Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)

mitzimorris · 2017-10-15T22:12:24Z

the challenge in adding this, as you've discovered - is the way that the command.hpp code sets up the cascade of dependent arguments. if you look at the current cmdstan manual (which needs to updated as part of the PR), the set of arguments for which passing in a metric file would be valid are the ones for "sample algorithm = hmc"

method = sample (Default)
        sample
          num_samples = 1000 (Default)
          num_warmup = 1000 (Default)
          save_warmup = 0 (Default)
          thin = 1 (Default)
          adapt
            engaged = 1 (Default)
            gamma = 0.050000000000000003 (Default)
            delta = 0.80000000000000004 (Default)
            kappa = 0.75 (Default)
            t0 = 10 (Default)
            init_buffer = 75 (Default)
            term_buffer = 50 (Default)
            window = 25 (Default)
          algorithm = hmc (Default)
            hmc
              engine = nuts (Default)
                nuts
                  max_depth = 10 (Default)
              metric = diag_e (Default)
              stepsize = 1 (Default)
              stepsize_jitter = 0 (Default)

given this, you could add argument "metric_file" as a subargument to argument "hmc" - you should be able to check that when the metric is "diag_e" the metric file is diagonal, and when the metric is "dense_e" the metric file is dense, and then you should be able to make this feature more general.

bbbales2 · 2017-10-15T22:34:13Z

valid are only "sample algorithm = hmc"

Hmm, that does make sense.

which needs to updated as part of the PR

Will do. Was just being lazy.

bbbales2 · 2017-10-21T15:39:12Z

This is ready for review

sakrejda · 2017-10-23T03:24:19Z

I read through this but I'd like to do one more pass before calling it good. I'll do that tomorrow.

…

On Sat, Oct 21, 2017, 11:39 AM Ben Bales ***@***.***> wrote: This is ready for review — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#576 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAfA6auNDxq8fOFs7Y2tt0xpVi4kZCETks5suhAhgaJpZM4P5jhC> .

sakrejda

Only comment I had was maybe say in the doc that metric is used as a starting point for adaptation when both are specified (assuming this is what happens?). Looks good otherwise.

sakrejda · 2017-10-23T18:02:32Z

src/docs/cmdstan-guide/running.tex

+metric, \code{inv\_metric} should be a positive-definite square matrix with
+number of rows and columns equal to the number of parameters in the model.
+The file pointed at by \code{metric\_file} should have the same format as
+the input data. This option can be used with and without adaptation enabled.


It would be helpful to add what happens if it's used with adaptation enabled.

bbbales2 · 2017-10-23T20:18:07Z

Only comment I had was maybe say in the doc that metric is used as a starting point for adaptation when both are specified (assuming this is what happens?)

Good point. I ran methods with adaptation and they didn't fail, but I need to make sure the adaptation isn't simply ignoring the provided initial matrix and re-adapting a new one. I'll check that and update the doc to make it clear.

mitzimorris · 2017-10-24T20:27:55Z

now that we've got this all plumbed through, I'm still trying to make sense of the use cases for this feature:

the model is fully converged, we want to run the fitted model to generate more samples - in this case, additional param adapt engaged=0 will stop adaptation and go straight to sampling. in which case, the fully consistant config requires:

random seed=<seed>
metric_file=<filename>
stepsize=<stepsize>

the model is taking a long time to converge, we want to checkpoint where it's at and restart.

running tests for case (2), I'm not sure adaptation is respecting stepsize.

here's my test metric file - same as used for stan unit test:

inv_metric <- structure(c(0.787405, 0.884987, 1.19869),.Dim=c(3))";

using model stan/src/test/test-models/good/mcmc/hmc/common/gauss3D.stan, which I copied into the cmdstan directory cmdstan/src/test/test-models, same filename.

trying this command sequence:

./src/test/test-models/gauss3D  random seed=12345 sample num_samples=200 num_warmup=199 algorithm=hmc metric_file=e_diag_3D.R stepsize=0.001
 grep -iA3 "step" output.csv

I set the stepsize smaller and smaller - if I run this with just one iteration, stepsize changes alot - should it?

mitzimorris · 2017-10-24T20:32:24Z

regarding my previous comment, fix for problems w/r/t stepsize and adaptation should go in at the stan services level - not a cmdstan problem.

bob-carpenter · 2017-10-24T20:35:29Z

Use case (1) and (2), definitely. A third case is external algorithms---Aki needs that for something, I believe.

sakrejda · 2017-10-24T20:38:43Z

Use case 3: 1) Fit the model to data A; 2) pretend like fitting the model to data A + B where B is much much smaller won't change posterior geometry much; 3) fit the model to data A + B without adaptation in parallel since now there's no adaptation to worry about (you could still do multiple chains with different starting points).

mitzimorris · 2017-10-24T20:41:48Z

w/r/t testing - do we have tests for use cases 2 and 3?
I've edited my comment above re usecases w/ my attempts to convince myself that this feature works for use case 2, and I'm not sure it does.

sakrejda · 2017-10-24T21:02:17Z

Shouldn't tests be in stan::services? Or do you mean in general showing that they are worthwhile use cases?

mitzimorris · 2017-10-24T21:46:49Z

yes, tests should be in stan::services.
current tests in stan::services test that sampler initializes itself with pre-specified metric - a unit test at the feature level, not a functional test at the use case level.

bbbales2 · 2017-10-24T21:59:24Z

Only comment I had was maybe say in the doc that metric is used as a starting point for adaptation when both are specified (assuming this is what happens?)

I checked this. The way adaptation works, the provided metric is tossed if adaptation is enabled. So the way I have the docs written now is wrong. If someone provides a metric, adaptation of that metric should be disabled (otherwise it's misleading). I can make these options mutually exclusive.

Is there a way to separately turn off timestep and metric adaptation? For my use case for this I was hoping I could just leave the timestep adaptation to Stan (and just provide the metric).

Looking at the code it seems like either they both happen or neither happen: https://github.com/stan-dev/stan/blob/develop/src/stan/mcmc/hmc/nuts/adapt_diag_e_nuts.hpp#L31

Should we add this in? Or make it so that if someone provides a metric, they're liable for the timestep as well?

Either way, don't merge this pull :P.

mitzimorris · 2017-10-24T22:07:08Z

hmm - if that's what the code is doing, that contradicts what Michael said here:

The stepsize parameter defines the initial step size from which the
algorithm begins. If adaptation is engaged then this is quickly modified,
but it can be helpful to start with a small step size on particularly nasty
problems to facilitate adaptation.

(https://groups.google.com/forum/#!topic/stan-users/O-PNZhzVjTI)

mitzimorris · 2017-10-24T22:16:20Z

OTOH, the current Stan manual says this:

Stan can be configured with a user- specified step size or it can estimate an optimal step size during warmup using dual averaging

mitzimorris · 2017-10-24T22:28:36Z

@bbbales2 - you're misreading the code -

Looking at the code it seems like either they both happen or neither happen: https://github.com/stan-dev/stan/blob/develop/src/stan/mcmc/hmc/nuts/adapt_diag_e_nuts.hpp#L31

the variable named adapt_flag_ gets changed by the sampler during its run.

the stepsize can be set at initialization:

https://github.com/stan-dev/stan/blob/476975dacfc13ff7a1aec4cf23ff0fd11a64caea/src/stan/mcmc/hmc/base_hmc.hpp#L152

mitzimorris · 2017-10-24T23:14:18Z

this PR looks good, and I believe the stan code for the samplers will do the right thing. however, it would be good for the docs to spell out the 3 use cases and appropriate config:
(1) specify stepsize, metric, set "adapt engaged=0". the appropriate "num_samples" is determined by desired precision of your quantity of interest which in turn depends on "N_eff" for that QoI.
(2) specify stepsize, metric and any other non-default settings used in initial run
(3) depends on external algorithm

betanalpha · 2017-10-25T01:35:12Z

This is correct — unless you set adapt engaged=0 the step size and metric will used only to initialize the adaptation routine. If they are good values then they should not change much. This is the desired behavior and especially useful if people want to use this feature to set informed guesses for the inv_metric on new runs, as opposed to just restarting.

…

On Oct 24, 2017, at 7:14 PM, Mitzi Morris ***@***.***> wrote: this PR looks good, and I believe the stan code for the samplers will do the right thing. however, it would be good for the docs to spell out the 3 use cases and appropriate config: (1) specify stepsize, metric, set "adapt engaged=0". the appropriate "num_samples" is determined by desired precision of your quantity of interest which in turn depends on "N_eff" for that QoI. (2) specify stepsize, metric and any other non-default settings used in initial run (3) depends on external algorithm — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#576 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABdNlifo6AHbVAxu2Y9UpijS_FaTNbVSks5svm9LgaJpZM4P5jhC>.

…ation and specifying a metric_file

mitzimorris

all good - many thanks!

mitzimorris · 2017-10-25T19:09:04Z

doc looks great! very clear.

bbbales2 · 2017-10-25T19:11:34Z

Np thanks for the review

mitzimorris · 2017-12-12T21:43:29Z

#563

Added argument to initialize Euclidean metric for samplers

747a57d

bbbales2 added 3 commits October 18, 2017 20:39

Moved command line arguments and added docs for specifying custom metric

00cb4d4

Changed good value for metric_file argument to ""

67ec5a9

Merge branch 'develop' into feature/set-mass-matrix

9ea3522

sakrejda approved these changes Oct 23, 2017

View reviewed changes

Added more detail on the interaction between enabling/disabling adapt…

2f89cdf

…ation and specifying a metric_file

mitzimorris approved these changes Oct 25, 2017

View reviewed changes

mitzimorris merged commit 87b0446 into stan-dev:develop Oct 26, 2017

Uh oh!

Comments

Conversation

bbbales2 commented Oct 14, 2017

Submisison Checklist

Summary:

How to Verify:

Side Effects:

Documentation:

Reviewer Suggestions:

Copyright and Licensing

Uh oh!

mitzimorris commented Oct 15, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bbbales2 commented Oct 15, 2017

Uh oh!

bbbales2 commented Oct 21, 2017

Uh oh!

sakrejda commented Oct 23, 2017 via email

Uh oh!

sakrejda left a comment

Choose a reason for hiding this comment

Uh oh!

sakrejda Oct 23, 2017

Choose a reason for hiding this comment

Uh oh!

bbbales2 commented Oct 23, 2017

Uh oh!

mitzimorris commented Oct 24, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mitzimorris commented Oct 24, 2017

Uh oh!

bob-carpenter commented Oct 24, 2017

Uh oh!

sakrejda commented Oct 24, 2017

Uh oh!

mitzimorris commented Oct 24, 2017

Uh oh!

sakrejda commented Oct 24, 2017

Uh oh!

mitzimorris commented Oct 24, 2017

Uh oh!

bbbales2 commented Oct 24, 2017

Uh oh!

mitzimorris commented Oct 24, 2017

Uh oh!

mitzimorris commented Oct 24, 2017

Uh oh!

mitzimorris commented Oct 24, 2017

Uh oh!

mitzimorris commented Oct 24, 2017

Uh oh!

betanalpha commented Oct 25, 2017 via email

Uh oh!

mitzimorris left a comment

Choose a reason for hiding this comment

Uh oh!

mitzimorris commented Oct 25, 2017

Uh oh!

bbbales2 commented Oct 25, 2017

Uh oh!

mitzimorris commented Dec 12, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

mitzimorris commented Oct 15, 2017 •

edited

Loading

mitzimorris commented Oct 24, 2017 •

edited

Loading