Sampling is inaccurate especially for log-normal distributions #184

michaeldickens · 2016-03-14T18:31:14Z

I refreshed a model four times and got these four different results. The results vary by more than an order of magnitude. I believe this tends to happen more with log-normal distributions than with other kinds of distributions.

OAGr · 2016-03-14T19:51:54Z

Basically, what seems to be happening isn't a specific bug as much as it is a challenge of having long tails with monte carlo sampling, especially for relatively small sample counts. The numbers visible on this dashboard the means, which vary greatly depending on outliers. If you investigate this further I would bet that the medians are closer together. Also, it looks like the confidence intervals don't change much.

I think this presents a few options:

Show medians, not means on this page.
More samples everywhere.
More samples, specifically in these situations.
Warnings where the value is highly outlier-sensitive.

OAGr · 2016-03-14T19:54:57Z

There are likely a few things to consider regarding this model.

BEWARE when dividing be a wide normal distribution. This typically presents cases where you divide by 0, or divide by something close to 0, which is a big outlier. Changing spending and hens per human to be lognormal probably makes more sense, then much of this problem seems fixed.

OAGr · 2016-06-16T23:48:27Z

We now use a 'X to Y' syntax, which addresses this issue.

OAGr closed this as completed Jun 16, 2016

saurabharch mentioned this issue Nov 5, 2022

[Snyk] Fix for 1 vulnerabilities saurabharch/guesstimate-app#44

Open

snyk-bot mentioned this issue Dec 26, 2022

[Snyk] Fix for 1 vulnerabilities saurabharch/guesstimate-app#52

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sampling is inaccurate especially for log-normal distributions #184

Sampling is inaccurate especially for log-normal distributions #184

michaeldickens commented Mar 14, 2016

OAGr commented Mar 14, 2016

OAGr commented Mar 14, 2016

OAGr commented Jun 16, 2016

Sampling is inaccurate especially for log-normal distributions #184

Sampling is inaccurate especially for log-normal distributions #184

Comments

michaeldickens commented Mar 14, 2016

OAGr commented Mar 14, 2016

OAGr commented Mar 14, 2016

OAGr commented Jun 16, 2016