access to iteration number and chain id within model #1166

bob-carpenter · 2014-12-10T18:01:44Z

Sean O'Riordain suggested on stan-users that it would be useful to be able to access the iteration number in a Stan program so that you could write something like this to print theta every 100th iteration:

if (mod(iteration.count,100) == 0) print(theta);

It would also be nice to get the chain_id in case things are running in parallel.

This brings up the issue of what the values should be when, for instance, we're running diagnostics, just evaluating the log probability function, running optimization, etc. Maybe just -1 values everywhere?

There is also the issue of which blocks this should work in. Should prints in the transformed data block get a value of iteration=0 as the value? The chain_id would still work.

The text was updated successfully, but these errors were encountered:

betanalpha · 2014-12-10T18:04:25Z

The way the models are abstracted from the sampling code I
can’t think of any way to get at this information any time log_prob
is called.

On Dec 10, 2014, at 1:01 PM, Bob Carpenter notifications@github.com wrote:

Sean O'Riordain suggested on stan-users that it would be useful to be able to access the iteration number in a Stan program so that you could write something like this to print theta every 100th iteration:

if (mod(iteration.count,100) == 0) print(theta);
It would also be nice to get the chain_id in case things are running in parallel.

This brings up the issue of what the values should be when, for instance, we're running diagnostics, just evaluating the log probability function, running optimization, etc. Maybe just -1 values everywhere?

There is also the issue of which blocks this should work in. Should prints in the transformed data block get a value of iteration=0 as the value? The chain_id would still work.

—
Reply to this email directly or view it on GitHub.

bob-carpenter · 2014-12-10T18:07:50Z

It would need to be a separate argument passed through.

I'm torn, because on the one hand, I agree that it'd be useful
to have on the model, but on the other hand, I agree it breaks the
standalone log density abstraction.

Given that it would also complicate the code, which is always
a big negative, I haven't brought this up before (though I've thought
about it many times for exactly the same reason that Sean O'Riordain
brought up).

Bob

On Dec 10, 2014, at 1:04 PM, Michael Betancourt notifications@github.com wrote:

The way the models are abstracted from the sampling code I
can’t think of any way to get at this information any time log_prob
is called.

On Dec 10, 2014, at 1:01 PM, Bob Carpenter notifications@github.com wrote:

Sean O'Riordain suggested on stan-users that it would be useful to be able to access the iteration number in a Stan program so that you could write something like this to print theta every 100th iteration:

if (mod(iteration.count,100) == 0) print(theta);
It would also be nice to get the chain_id in case things are running in parallel.

This brings up the issue of what the values should be when, for instance, we're running diagnostics, just evaluating the log probability function, running optimization, etc. Maybe just -1 values everywhere?

There is also the issue of which blocks this should work in. Should prints in the transformed data block get a value of iteration=0 as the value? The chain_id would still work.

—
Reply to this email directly or view it on GitHub.

—
Reply to this email directly or view it on GitHub.

bob-carpenter · 2015-02-07T17:30:32Z

If we did do this, we'd probably want to do it via functions:

  int meta_iteration_num();
  int meta_chain_id();
  int meta_is_warming_up();
  int meta_is_sampling();

I'd want to print a warning with any use of such a meta function at the very least. And probably call it something other than "meta", which tends to get overused.

bob-carpenter · 2015-02-09T22:08:18Z

Joshua N Pritikin points out on stan-dev:

Maybe a mcmc_ prefit would be more descriptive since Stan also does
BGFS-esque optimization.

bob-carpenter · 2015-08-04T05:36:27Z

If iteration number is really just for print control every N-th iteration, we could control that from the outside by passing in a NULL ostream.

And maybe we should always print the chain ID before any printed output?

peleschramm · 2017-05-20T01:35:28Z

I would also like access to iteration number, but for hacking purposes (some Simulated Annealing may be possible this way, for example).

For just printing values, the solution I use is to have Stan periodically update the output csv file (including warmup), and run a script that periodically reads the csv file and plots the samples for whatever I'm interested in. That way you can view the entire trajectory as stan samples. If using matlabstan, the function "mstan.read_stan_csv" is helpful for this.

bob-carpenter · 2017-05-20T19:05:53Z

I just looked back through this issue. We've gone back and forth on whether we should allow this.

@betanalpha and I don't like it because it breaks the nice abstraction of an instantiated model as an immutable log density function. Making things immutable is powerful for reasoning about program behavior and writing correct code.

We could break this abstraction by either (1) storing iteration and chain number in special mutable variables, or (2) treating them like data and allowing data to be non-constant (mutable) in general. I'm inclined toward the latter because we may want to do this for data streaming or parallel algorithms like stochastic gradient descent for optimization (penalized MLE) or variational inference (VB) or data-parallel expectation propagation (EP) or gradient-based marginal optimization (penalized MML).

syclik · 2017-05-20T23:26:40Z

Add one more vote for no iteration number within the language. If you're really inclined, there are ways to use that information from an algorithm written in C++. You might also be able to wrap the generated model class with another class that has a mutable iteration number. If you're able to demonstrate a real need for that in the language with a model and an algorithm that utilizes it, I'd reconsider. But burden of proof is on a prototype and a real use case before we break immutability.

…

On May 20, 2017, at 3:05 PM, Bob Carpenter ***@***.***> wrote: I just looked back through this issue. We've gone back and forth on whether we should allow this. @betanalpha and I don't like it because it breaks the nice abstraction of an instantiated model as an immutable log density function. Making things immutable is powerful for reasoning about program behavior and writing correct code. We could break this abstraction by either (1) storing iteration and chain number in special mutable variables, or (2) treating them like data and allowing data to be non-constant (mutable) in general. I'm inclined toward the latter because we may want to do this for data streaming or parallel algorithms like stochastic gradient descent for optimization (penalized MLE) or variational inference (VB) or data-parallel expectation propagation (EP) or gradient-based marginal optimization (penalized MML). — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

bob-carpenter · 2017-05-22T16:08:02Z

On May 20, 2017, at 7:26 PM, Daniel Lee ***@***.***> wrote: Add one more vote for no iteration number within the language. If you're really inclined, there are ways to use that information from an algorithm written in C++.

You can manipulate the entire log density at this point, but can't get inside it.

You might also be able to wrap the generated model class with another class that has a mutable iteration number.

Same problem---won't be able to access it in the language unless you hack a function into Stan which returns a static that the algorithm sets.

If you're able to demonstrate a real need

The first suggested usage in this issue is printing: if (iteration() % 100 == 0) print(...); The second suggestion was to do some kind of annealing; I could see that wanting to do different things to the likelihood and prior in a Bayesian setting.

for that in the language with a model and an algorithm that utilizes it, I'd reconsider. But burden of proof is on a prototype and a real use case before we break immutability.

The problem with requesting a prototype is that this is a user-level request, but building a prototype is a developer-level problem. There's way to build a prototype with user-facing tools we have now.

syclik · 2017-05-22T16:22:54Z

Good points. Especially about the prototype and it's a developer-level problem. If it's just about printing, it seems like a lot of effort. Probably worth it in the long run to figure it out, but it's still a lot of effort. The second use case was simulated annealing and controlling a temperature parameter based on iteration? Maybe a little more thought into how that's written out in the language, what sorts of things are propagated as data / parameters or double / vars would help. On Mon, May 22, 2017 at 12:08 PM, Bob Carpenter <notifications@github.com> wrote:

…

> On May 20, 2017, at 7:26 PM, Daniel Lee ***@***.***> wrote: > > Add one more vote for no iteration number within the language. > > If you're really inclined, there are ways to use that information from an algorithm written in C++. You can manipulate the entire log density at this point, but can't get inside it. > You might also be able to wrap the generated model class with another class that has a mutable iteration number. Same problem---won't be able to access it in the language unless you hack a function into Stan which returns a static that the algorithm sets. > If you're able to demonstrate a real need The first suggested usage in this issue is printing: if (iteration() % 100 == 0) print(...); The second suggestion was to do some kind of annealing; I could see that wanting to do different things to the likelihood and prior in a Bayesian setting. > for that in the language with a model and an algorithm that utilizes it, I'd reconsider. But burden of proof is on a prototype and a real use case before we break immutability. The problem with requesting a prototype is that this is a user-level request, but building a prototype is a developer-level problem. There's way to build a prototype with user-facing tools we have now. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#1166 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAZ_F1mS45flCLY1LX06dXIVO57d2tofks5r8bLigaJpZM4DGxC2> .

betanalpha · 2017-05-22T16:30:05Z

I am unconvinced on printing (what does striped printing by you? You can always manipulate the output directly in any of the environments). But I am doubly unconvinced on the algorithm side. The issue is that all of these ideas are trying to break the abstraction of the Stan Modeling Language specifying a function proportional to the posterior density and nothing more. Because there is a fundamental prior/likelihood separation, for example, any algorithm that relies on that separation would be an awkward hack. Stan was not built for algorithm development of this sort. If we wanted to support it then we would have to allow users to specify prior and likelihood separately and then expose various algorithmic manipulations like optimize and Markov transition. I’m not saying that we shouldn’t do that (okay, I would but that’s an orthogonal conversation) just that breaking out current abstraction to make an infinitesimal and fragile step towards that is a bad idea.

…

On May 22, 2017, at 9:22 AM, Daniel Lee ***@***.***> wrote: Good points. Especially about the prototype and it's a developer-level problem. If it's just about printing, it seems like a lot of effort. Probably worth it in the long run to figure it out, but it's still a lot of effort. The second use case was simulated annealing and controlling a temperature parameter based on iteration? Maybe a little more thought into how that's written out in the language, what sorts of things are propagated as data / parameters or double / vars would help. On Mon, May 22, 2017 at 12:08 PM, Bob Carpenter ***@***.***> wrote: > > > On May 20, 2017, at 7:26 PM, Daniel Lee ***@***.***> > wrote: > > > > Add one more vote for no iteration number within the language. > > > > If you're really inclined, there are ways to use that information from > an algorithm written in C++. > > You can manipulate the entire log density at this point, but can't > get inside it. > > > You might also be able to wrap the generated model class with another > class that has a mutable iteration number. > > Same problem---won't be able to access it in the language unless > you hack a function into Stan which returns a static that the > algorithm sets. > > > If you're able to demonstrate a real need > > The first suggested usage in this issue is printing: > > if (iteration() % 100 == 0) print(...); > > The second suggestion was to do some kind of annealing; I could > see that wanting to do different things to the likelihood and prior in > a Bayesian setting. > > > for that in the language with a model and an algorithm that utilizes it, > I'd reconsider. But burden of proof is on a prototype and a real use case > before we break immutability. > > > The problem with requesting a prototype is that this is a user-level > request, but building > a prototype is a developer-level problem. There's way to build a prototype > with user-facing tools we have now. > > — > You are receiving this because you commented. > Reply to this email directly, view it on GitHub > <#1166 (comment)>, or mute > the thread > <https://github.com/notifications/unsubscribe-auth/AAZ_F1mS45flCLY1LX06dXIVO57d2tofks5r8bLigaJpZM4DGxC2> > . > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1166 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABdNlg--tY28kpsjw9xyrsJi8ey3g6lHks5r8bZggaJpZM4DGxC2>.

bob-carpenter added feature language labels Dec 10, 2014

bob-carpenter added this to the Future milestone Dec 10, 2014

bob-carpenter mentioned this issue Nov 24, 2016

print_progress() should input / output the chain ID #1348

Closed

alashworth mentioned this issue Mar 12, 2019

access to iteration number and chain id within model alashworth/test-issue-import#50

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

access to iteration number and chain id within model #1166

access to iteration number and chain id within model #1166

bob-carpenter commented Dec 10, 2014

betanalpha commented Dec 10, 2014

bob-carpenter commented Dec 10, 2014

bob-carpenter commented Feb 7, 2015

bob-carpenter commented Feb 9, 2015

bob-carpenter commented Aug 4, 2015

peleschramm commented May 20, 2017 •

edited

bob-carpenter commented May 20, 2017

syclik commented May 20, 2017 via email

bob-carpenter commented May 22, 2017 via email

syclik commented May 22, 2017 via email

betanalpha commented May 22, 2017 via email

access to iteration number and chain id within model #1166

access to iteration number and chain id within model #1166

Comments

bob-carpenter commented Dec 10, 2014

betanalpha commented Dec 10, 2014

bob-carpenter commented Dec 10, 2014

bob-carpenter commented Feb 7, 2015

bob-carpenter commented Feb 9, 2015

bob-carpenter commented Aug 4, 2015

peleschramm commented May 20, 2017 • edited

bob-carpenter commented May 20, 2017

syclik commented May 20, 2017 via email

bob-carpenter commented May 22, 2017 via email

syclik commented May 22, 2017 via email

betanalpha commented May 22, 2017 via email

peleschramm commented May 20, 2017 •

edited