Explain sample_stats naming convention #1063

nitishp25 · 2020-02-10T18:03:00Z

Description

Checklist

Does the PR follow official PR format?
Is the code style correct (follows pylint and black guidelines)?
Is the change listed in changelog?

nitishp25 · 2020-02-10T18:04:54Z

@OriolAbril I have updated the descriptions and added a few missing variables. Are there any more variables to be added?

doc/schema/schema.md

arviz/data/io_pystan.py

OriolAbril · 2020-02-11T16:39:45Z

I was thinking about it and I think we should either rename it (preferably) or not have it in sample stats. My main concerns are: as I understand it, one of the goals of ArviZ is also to ease the comparison between different inference libraries, we do not use it for now, but our users may use it and accessing a value in an inference data should not depend on the inference library; the second concern is that we may have one use for it at some point (I can't think of nothing else than including the acceptance prob warning in our summary right now but who knows) the naming differences may block or delay the new feature. This second concern is why I prefer renaming over removing it.

nitishp25 · 2020-02-12T02:48:26Z

I see. I will rename it once you confirm @OriolAbril @ahartikainen

sethaxen · 2020-02-20T20:30:01Z

I also support renaming.

ahartikainen · 2020-02-21T05:51:41Z

For Stan the outputs for sample_stats are

original name	current name	meaning	source
lp__	lp	the log posterior density (up to a constant) ¹	src1
accept_stat__	accept_stat	the average acceptance probabilities of all possible samples in the proposed tree.	src2
stepsize__	stepsize	the step size used by NUTS in its Hamiltonian simulation.	src2
treedepth__	treedepth	the depth of tree used by NUTS, which is the log (base 2) of the number of leapfrog steps taken during the Hamiltonian simulation.	src2
n_leapfrog__	n_leapfrog	the actual number of leapfrog steps computed	src3
divergent__	diverging	the number of leapfrog transitions with diverging error. Because NUTS terminates at the first divergence this will be either 0 or 1 for each iteration.	src2
energy__	energy	the value of the Hamiltonian (up to an additive constant) at each iteration.	src2

^{1. The lp__ value also represents the potential energy in the Hamiltonian system and is rate bounded by

the randomly supplied kinetic energy each iteration, which follows a Chi-square distribution in the number

of parameters.}

ahartikainen · 2020-02-21T05:55:58Z

Then there is also other settings

see for example for CmdStan output

# stan_version_major = 2
# stan_version_minor = 18
# stan_version_patch = 0
# model = eight_schools_nc_model
# method = sample (Default)
#   sample
#     num_samples = 100
#     num_warmup = 1000 (Default)
#     save_warmup = 0 (Default)
#     thin = 1 (Default)
#     adapt
#       engaged = 1 (Default)
#       gamma = 0.050000000000000003 (Default)
#       delta = 0.80000000000000004 (Default)
#       kappa = 0.75 (Default)
#       t0 = 10 (Default)
#       init_buffer = 75 (Default)
#       term_buffer = 50 (Default)
#       window = 25 (Default)
#     algorithm = hmc (Default)
#       hmc
#         engine = nuts (Default)
#           nuts
#             max_depth = 10 (Default)
#         metric = diag_e (Default)
#         metric_file =  (Default)
#         stepsize = 1 (Default)
#         stepsize_jitter = 0 (Default)
# id = 0 (Default)
# data
#   file = eight_schools.data.R
# init = 2 (Default)
# random
#   seed = 779839997
# output
#   file = eight_schools_output1.csv
#   diagnostic_file =  (Default)
#   refresh = 100 (Default)

These are also probably needed, and also for other libs too.

sethaxen · 2020-02-21T06:08:04Z

Then there is also other settings

see for example for CmdStan output

These would probably go under attributes though, right?

ahartikainen · 2020-02-21T06:10:33Z

Yes, I would put them under attributes.

Maybe under sampler_settings or something similar.

sethaxen · 2020-02-21T06:59:43Z

Yes, I would put them under attributes.

Maybe under sampler_settings or something similar.

That sounds good. Perhaps that should be a separate issue.

For Turing.jl, we have the following sample stats for HMC:

acceptance_rate: MH stats, i.e. sum of MH accept prob for all leapfrog steps (src)
hamiltonian_energy: value of the hamiltonian energy for the accepted proposal to within an additive constant
hamiltonian_energy_error: difference in the hamiltonian energy between the initial point and the proposed point,
is_adapt: boolean, whether the current sample is part of adaptation
max_hamiltonian_energy_error: energy in tree with largest absolute difference from initial energy (src)
n_steps: total # of leap frog steps, i.e. phase points in a trajectory (src)
numerical_error: termination due to large energy deviation from starting (possibly numerical errors) (src)
lp/log_density: log probability to within an additive constant
tree_depth: the number of tree doublings in the balanced binary tree
step_size : current integration step size (src)
nom_step_size: Get the nominal integration step size. The current integration step size may differ from this, for example if the step size is jittered. Nominal step size is usually used in adaptation. (src)

For SMC:

le: The log evidence retrieved from the particle
weight: The weight of the particle the sample was retrieved from.

Others that are supposedly parameters but I've never seen used:

elapsed
eval_num
lf_eps

OriolAbril · 2020-02-21T18:16:16Z

I'll try to summarize the information gathered grouping equivalent sampler stats from different libraries. Descriptions and sources should still be checked in the original comment. Feel free to edit (I think members should have permission).

HMC

Stan	Turing.jl	PyMC3	Pyro	NumPyro	Unified name
lp__	lp/log_density	model_logp	-	potential_energy	lp
accept_stat__	acceptance_rate	mean_tree_accept	acceptance rate	accept_prob	acceptance_rate
stepsize__	step_size	step_size	-	adapt_state.step_size	step_size
-	nom_step_size	-	-
-	-	step_size_bar	-
treedepth__	tree_depth	depth	-		tree_depth
n_leapfrog__	n_steps	tree_size	-	num_steps	n_steps
divergent__	numerical_error	diverging	divergences	diverging	diverging
energy__	hamiltonian_energy	energy	-	energy	energy
-	hamiltonian_energy_error	energy_error	-		energy_error
-	max_hamiltonian_energy_error	max_energy_error	-		max_energy_error
-	~~is_adapt~~	~~tune~~	-		removed (see #1126)

@fehiepsi could you please check Pyro and NumPyro names? I think I am on the right track but not completely sure.

I don't know enough about tfp as to include anything here. Is there anybody we could tag that comes to mind?

SMC

any thoughts @aloctavodia ?

Turing.jl	PyMC3	Unified name
le
weight

MH?

Notes:

I think the only sampler stats currently used (in plotting or in stats) are diverging and energy.

PyMC3 reference: src1, (MH related, not really sure they are relevant: src2, src3, src4)
Pyro reference: src
NumPyro reference: src

fehiepsi · 2020-02-21T18:56:27Z

Thank @OriolAbril, they are correct names.

ahartikainen · 2020-02-22T07:50:45Z

What tfp use?

junpenglao · 2020-02-22T08:18:47Z

TFP does not have internal naming convention, as they are function output (tensor or array) and user are free to name it whatever they want - I was manually mapping it eg: https://colab.research.google.com/github/tensorflow/probability/blob/master/tensorflow_probability/examples/jupyter_notebooks/Modeling_with_JointDistribution.ipynb#scrollTo=4qQdOPk90f7t

ahartikainen · 2020-02-22T08:38:46Z

Stan hmc has these (I need to verify)

accept_stat__ stepsize__ int_time__

aloctavodia · 2020-03-19T11:46:27Z

Currently PyMC3's SMC does not return any statistics, but I should fix that.

OriolAbril · 2020-04-16T01:00:22Z

I updated my previous comment with the table summary to try to restart the discussion. I tried to use the names that felt had more consensus (e.g. more than one library had similar or equal names), please comment, I actually don't know the reasons behind any of the naming conventions chosen by each library.

The one I am having trouble with step_size related parameters. After warmup (whose sample stats should be stored in warmup groups, see #1126), step_size should have converged to a given value, therefore we are basically storing repeated values unless the step size is jittered (which seems to only be available in Turing). It feels unnecessary to store step_size in an (nchains, ndraws) array, but given the jitter possibility it may be the simplest way to accommodate all the libraries into a single naming convention (with nom_step_size generally missing).

Regarding step_size_bar, I thought it could be removed from sample_stats: PyMC3 docs say that after tuning step_size is set to step_size_bar but it does not seem to be the case. All stored ArviZ inferencedata objects have different values for step_size and step_size_bar (each of them is constant for all draws but they are different between them).

nitishp25 · 2020-04-20T06:37:21Z

Btw, the renaming work is to be done in this PR itself right? Or leave this one only for the descriptions of the existing sample_stats?

canyon289 · 2020-06-07T13:06:13Z

Checking in on old PRs. Any possibility of bringing this over the line or should we close?

OriolAbril · 2020-06-07T13:12:06Z

This one depends on core contributors having some consensus on the names to use. Maybe a lab meeting could help in finishing this?

canyon289 · 2020-06-14T20:47:38Z

Sounds good. Ill add it as topic for next lab meeting

sethaxen

Mostly clarifying questions. Also your above table is super useful, and I got a question just this week about the meaning if these parameters, and it would be great if this table was somewhere. And perhaps ArviZ's docs is the best place for it to be. Would you like to include it?

doc/schema/schema.md

OriolAbril · 2020-09-30T17:26:52Z

We have agreed on the names, see table in previous comment, but we still need to update the PR to match said table and add the definitions. There are some definitions in previous comments when describing Stan and Turing sample stats.

I think now would be a good time to implement the convention so sample stats can be used in https://github.com/arviz-devs/arviz_dashboard independently of the sampling backend

ahartikainen · 2020-09-30T18:08:35Z

Explicit name is better, then let's add description somewhere what is what.

E.g. attrs field could contain something?

nitishp25 · 2020-10-03T19:41:40Z

Okay, so will you be working on this PR now? I don't have much knowledge about the definitions but I can update if you want

ahartikainen

Do we want to mention sample_stats_prior which is analogous to sample_stats but the 1to1 correspondence is with the prior and not with the posterior

doc/source/schema/schema.md

ahartikainen · 2020-11-25T06:36:17Z

Oh, did we miss the int_time__ --> https://mc-stan.org/docs/2_25/cmdstan-guide/mcmc-intro.html

int_time__ - total integration time (static HMC sampler)

Also lp__ is mentioned as the total log probability density (up to an additive constant) at each sample

OriolAbril · 2020-11-27T02:07:21Z

Oh, did we miss the int_time__

Looks like it. int_time looks like a good name, unless someone wants to propose an alternative.

added definition proposals.

Co-authored-by: Ari Hartikainen <ahartikainen@users.noreply.github.com>

mjhajharia · 2021-04-04T20:05:56Z

I'll try to summarize the information gathered grouping equivalent sampler stats from different libraries. Descriptions and sources should still be checked in the original comment. Feel free to edit (I think members should have permission).

HMC

Stan Turing.jl PyMC3 Pyro NumPyro Unified name
lp__ lp/log_density model_logp - potential_energy lp
accept_stat__ acceptance_rate mean_tree_accept acceptance rate accept_prob acceptance_rate
stepsize__ step_size step_size - adapt_state.step_size step_size

nom_step_size - -

step_size_bar -
treedepth__ tree_depth depth - tree_depth
n_leapfrog__ n_steps tree_size - num_steps n_steps
divergent__ numerical_error diverging divergences diverging diverging
energy__ hamiltonian_energy energy - energy energy

hamiltonian_energy_error energy_error - energy_error

max_hamiltonian_energy_error max_energy_error - max_energy_error

~~is_adapt~~ ~~tune~~ - removed (see Add warmup iterations and _group_warmup #1126)
@fehiepsi could you please check Pyro and NumPyro names? I think I am on the right track but not completely sure.

I don't know enough about tfp as to include anything here. Is there anybody we could tag that comes to mind?

SMC

any thoughts @aloctavodia ?

Turing.jl PyMC3 Unified name
le
weight

MH?

Notes:

I think the only sampler stats currently used (in plotting or in stats) are diverging and energy.

PyMC3 reference: src1, (MH related, not really sure they are relevant: src2, src3, src4)
Pyro reference: src
NumPyro reference: src

Maybe I missed something, but I saw

perf_counter_diff , process_time_diff and perf_counter_start in trace.sample_stats

OriolAbril · 2021-04-04T22:02:26Z

These were added after this comment was written, let's continue this in pymc-devs/pymc-examples#95 better

nitishp25 requested a review from OriolAbril February 10, 2020 18:03

OriolAbril reviewed Feb 10, 2020

View reviewed changes

doc/schema/schema.md Outdated Show resolved Hide resolved

doc/schema/schema.md Outdated Show resolved Hide resolved

ahartikainen reviewed Feb 11, 2020

View reviewed changes

arviz/data/io_pystan.py Outdated Show resolved Hide resolved

sethaxen mentioned this pull request Feb 28, 2020

Improve Stan conversion and docs arviz-devs/ArviZ.jl#31

Closed

nitishp25 force-pushed the sample-stats-schema branch 2 times, most recently from 000776a to af4834f Compare April 27, 2020 15:47

OriolAbril mentioned this pull request Jul 3, 2020

Remove technical debt in from_pymc3 and move to pymc3 repo #1278

Closed

sethaxen requested changes Jul 7, 2020

View reviewed changes

OriolAbril force-pushed the sample-stats-schema branch from 7690699 to c7ff582 Compare November 25, 2020 02:21

OriolAbril requested review from sethaxen and ahartikainen November 25, 2020 02:21

OriolAbril approved these changes Nov 25, 2020

View reviewed changes

ahartikainen approved these changes Nov 25, 2020

View reviewed changes

doc/source/schema/schema.md Outdated Show resolved Hide resolved

doc/source/schema/schema.md Outdated Show resolved Hide resolved

OriolAbril added this to In progress in Documentation via automation Jan 8, 2021

OriolAbril moved this from In progress to Review in progress in Documentation Jan 8, 2021

OriolAbril changed the title ~~[WIP] Explain sample_stats naming convention~~ Explain sample_stats naming convention Jan 13, 2021

nitishp25 and others added 8 commits January 16, 2021 03:04

add variable explanations

9acbe47

add max_treedepth sample stat

5df8602

remove max_treedepth

70cccdf

update schema to reflect agreed on convention

2140adb

added definition proposals.

add a couple notes

f12d839

Improvements from code review

09e2dca

Co-authored-by: Ari Hartikainen <ahartikainen@users.noreply.github.com>

add int_time

104532c

add to changelog

97e6ba5

OriolAbril force-pushed the sample-stats-schema branch from dd3a59b to 97e6ba5 Compare January 16, 2021 01:05

Merge branch 'master' into sample-stats-schema

d8298ba

OriolAbril merged commit ae8a613 into arviz-devs:master Jan 16, 2021

Documentation automation moved this from Review in progress to Done Jan 16, 2021

OriolAbril mentioned this pull request Jan 21, 2021

Update from_xyz converters to follow schema conventions #1514

Closed

7 tasks

utkarsh-maheshwari mentioned this pull request Feb 7, 2021

Update from_cmdstan converter to follow schema convention #1541

Merged

3 tasks

sethaxen mentioned this pull request Feb 22, 2021

Update MCMCChains sample_stats names to match scheme arviz-devs/ArviZ.jl#112

Merged

OriolAbril mentioned this pull request Apr 4, 2021

sampler stats pymc-devs/pymc-examples#95

Merged

sethaxen mentioned this pull request May 27, 2021

Adopt AbstractMCMC.jl interface TuringLang/AdvancedHMC.jl#259

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explain sample_stats naming convention #1063

Explain sample_stats naming convention #1063

nitishp25 commented Feb 10, 2020 •

edited by OriolAbril

nitishp25 commented Feb 10, 2020

OriolAbril commented Feb 11, 2020

nitishp25 commented Feb 12, 2020

sethaxen commented Feb 20, 2020

ahartikainen commented Feb 21, 2020

ahartikainen commented Feb 21, 2020 •

edited

sethaxen commented Feb 21, 2020

ahartikainen commented Feb 21, 2020

sethaxen commented Feb 21, 2020

OriolAbril commented Feb 21, 2020 •

edited

fehiepsi commented Feb 21, 2020

ahartikainen commented Feb 22, 2020

junpenglao commented Feb 22, 2020

ahartikainen commented Feb 22, 2020 •

edited

aloctavodia commented Mar 19, 2020

OriolAbril commented Apr 16, 2020

nitishp25 commented Apr 20, 2020

canyon289 commented Jun 7, 2020

OriolAbril commented Jun 7, 2020

canyon289 commented Jun 14, 2020

sethaxen left a comment

OriolAbril commented Sep 30, 2020 •

edited

ahartikainen commented Sep 30, 2020

nitishp25 commented Oct 3, 2020 •

edited

ahartikainen left a comment

ahartikainen commented Nov 25, 2020 •

edited

OriolAbril commented Nov 27, 2020

mjhajharia commented Apr 4, 2021

HMC

SMC

MH?

OriolAbril commented Apr 4, 2021

Explain sample_stats naming convention #1063

Explain sample_stats naming convention #1063

Conversation

nitishp25 commented Feb 10, 2020 • edited by OriolAbril

Description

Checklist

nitishp25 commented Feb 10, 2020

OriolAbril commented Feb 11, 2020

nitishp25 commented Feb 12, 2020

sethaxen commented Feb 20, 2020

ahartikainen commented Feb 21, 2020

ahartikainen commented Feb 21, 2020 • edited

sethaxen commented Feb 21, 2020

ahartikainen commented Feb 21, 2020

sethaxen commented Feb 21, 2020

OriolAbril commented Feb 21, 2020 • edited

HMC

SMC

MH?

fehiepsi commented Feb 21, 2020

ahartikainen commented Feb 22, 2020

junpenglao commented Feb 22, 2020

ahartikainen commented Feb 22, 2020 • edited

aloctavodia commented Mar 19, 2020

OriolAbril commented Apr 16, 2020

nitishp25 commented Apr 20, 2020

canyon289 commented Jun 7, 2020

OriolAbril commented Jun 7, 2020

canyon289 commented Jun 14, 2020

sethaxen left a comment

Choose a reason for hiding this comment

OriolAbril commented Sep 30, 2020 • edited

ahartikainen commented Sep 30, 2020

nitishp25 commented Oct 3, 2020 • edited

ahartikainen left a comment

Choose a reason for hiding this comment

ahartikainen commented Nov 25, 2020 • edited

OriolAbril commented Nov 27, 2020

mjhajharia commented Apr 4, 2021

HMC

SMC

MH?

OriolAbril commented Apr 4, 2021

nitishp25 commented Feb 10, 2020 •

edited by OriolAbril

ahartikainen commented Feb 21, 2020 •

edited

OriolAbril commented Feb 21, 2020 •

edited

ahartikainen commented Feb 22, 2020 •

edited

OriolAbril commented Sep 30, 2020 •

edited

nitishp25 commented Oct 3, 2020 •

edited

ahartikainen commented Nov 25, 2020 •

edited