DIC with multiple chains #648

jseabold · 2014-11-13T18:13:30Z

The way the deviance information criteria code is written in 2.x right now, IIUC only the last chain is used because the default argument for nchains in the Trace objects is -1. Is this intentional? I guess in practice, it may not end up mattering much if you're reasonably happy with the sampling from the posterior in the last chain, but if I'm running multiple chains from overdispersed starting values wouldn't I want to compute the DIC from traces of all the chains?

The text was updated successfully, but these errors were encountered:

fonnesbeck · 2014-11-13T18:27:18Z

It was intentional, but I'm happy to rethink it. I figured one chain would be enough to get a reasonable DIC estimate, and if it varied a lot from chain to chain, then you would not be happy with the set of non-converged samples anyhow, so you wouldnt need DIC.

At present it is a property, but we could make it a function that took a model and a chain index as an argument:

dic_all_chains = pymc.dic(my_model, chain=None)
dic_current_chain = pymc.dic(my_model, chain=-1)
dic_other_chain = pymc.dic(my_model, chain=2)

Would that be a better implementation?

jseabold · 2014-11-13T18:49:50Z

Thanks. I agree that the current implementation is pretty reasonable given what you say. I don't have a sense of whether there would be any real difference in the computed value given that you're happy with convergence. I've seen sometimes in the literature preference given to one model over another based on pretty small (subjectively) differences in DIC, but that's a methodological issue.

Maybe more important sub-question, what about also adding a start/burn keyword? Maybe my workflow needs some changing, but I've been running some pretty time consuming models and then deciding on burn-in. For the most part this is accommodated, but the DIC is an exception to this.

fonnesbeck · 2014-11-13T19:41:39Z

The PyMC 2 is very much geared towards a burn-at-sampling workflow, given the burn and thin arguments for sample (PyMC 3, however follows the workflow you suggest). I am usually throwing away 80-90% of my samples conservatively, at sampling, then I don't have to worry about it later. Implemented as a property, obviously there can't be arguments, but we can implement a function or method if that helps.

jseabold · 2014-11-13T19:48:52Z

I had that impression but surprisingly found that for the most part I could get by with post-sampling adjustments. I just rolled my own as a solution. Up to you whether you want to add the convenience function. Feel free to close this as you see fit.

I only tried briefly to install pymc 3. Will probably switch to it after I wrap up this project and continue to do more statistics by simulation.

twiecki · 2015-02-24T07:22:39Z

Should this issue be moved to pymc?

jseabold · 2015-02-24T14:25:23Z

Splitting up the repos? 👍

fonnesbeck · 2015-02-26T03:39:56Z

Moving this over to #2

sammosummo · 2016-04-15T20:44:59Z

@jseabold What was your solution?

twiecki added the v.2 label Feb 24, 2015

fonnesbeck closed this as completed Feb 26, 2015

fonnesbeck mentioned this issue Feb 26, 2015

Calculate DIC based on all chains pymc-devs/pymc2#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DIC with multiple chains #648

DIC with multiple chains #648

jseabold commented Nov 13, 2014

fonnesbeck commented Nov 13, 2014

jseabold commented Nov 13, 2014

fonnesbeck commented Nov 13, 2014

jseabold commented Nov 13, 2014

twiecki commented Feb 24, 2015

jseabold commented Feb 24, 2015

fonnesbeck commented Feb 26, 2015

sammosummo commented Apr 15, 2016

DIC with multiple chains #648

DIC with multiple chains #648

Comments

jseabold commented Nov 13, 2014

fonnesbeck commented Nov 13, 2014

jseabold commented Nov 13, 2014

fonnesbeck commented Nov 13, 2014

jseabold commented Nov 13, 2014

twiecki commented Feb 24, 2015

jseabold commented Feb 24, 2015

fonnesbeck commented Feb 26, 2015

sammosummo commented Apr 15, 2016