-
-
Notifications
You must be signed in to change notification settings - Fork 366
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
sampler testing framework #318
Comments
We have to be careful because, by construction, MCMC is stochastic and not exactly amenable to unit tests Mean/variance estimation: Assuming a Monte Carlo CLT we'll still have to worry about the expected randomness. Running an ensemble That said, iid gaussian and a correlated gaussian are natural first tests. Adaptation: Some distributions undercut the usual optimization criteria that we use for adaptation. Hierarchical models like The interaction between the distributions and adaptation would require sampler-specific tests, not happy generic tests. Speed regression tests: Depends on the machine running the tests, so we can't just define definite thresholds. Is it possible to build up the |
On 10/23/13 4:45 PM, Michael Betancourt wrote:
Right. That's why, for example, the RNG tests that Peter This is similar to what Andrew calls the "Cook-Gelman-Rubin" approach.
Right. That's what we're doing for the RNGs, but those
We mostly want to have tests in place to make sure we didn't mess
We already have tests that vary configuration (e.g, for number
I don't see why not. Daniel's a wizard with Jenkins. For the foreseeable future, the machine running the tests will And we can test on our own machines.
|
It's not a matter of varying the parameters but figuring out how they need to be varied. Just warning that because of these On Oct 23, 2013, at 10:22 PM, Bob Carpenter notifications@github.com wrote:
|
I'm ok with individually-tuned tests for each sampler. |
Testing framework proposed in https://github.com/stan-dev/stan/tree/feature/stat_valid_test -- currently needs to be updated so that the tests can be run without depending on CmdStan. |
@bob-carpenter, this is what we were talking about doing. This will depend on #1751, so I'll branch from there as I start working. |
We can't get new patches into samplers because there aren't any reliable tests.
We need tests for the samplers for
We also want to test things that Michael has suggested for HMC like
We have to make all these sensitive to the fact that we have MCMC.
The text was updated successfully, but these errors were encountered: