Llprior #56

jake-westfall · 2017-01-02T19:20:42Z

This changes the implementation of the default priors so that they are based entirely on the log-likelihood function, never referencing the multiple regressions of Y and X on the covariates as in the previous implementation. For Normal response models, the results are essentially the same as before, but not exactly, as the new implementation relies on a quadratic approximation to the log-likelihood function (which is close but not perfect). Non-normal response models are now handled exactly the same as Normal response models now. However, the quadratic approximation to the log-likelihood is not as good for non-Normal response models, so the interpretation of the priors in terms of the standard deviation of the implied partial correlation should be considered a fairly rough approximation. Nevertheless, the default priors for non-Normal models are now much more intuitively sensible than before. This same implementation should also work almost entirely "as-is" for other link functions / response distributions that we have not explicitly implemented; all we really need to do is point to the appropriate statsmodels distribution family in priors.py.

We might be able to upgrade the quadratic approximation to a quartic approximation, which should give even better results. But I have not worked this out yet. In any case, for now this is ready for production.

Seems to work in ideal cases, but does not pass tests yet. In particular, need to modify the na-checking code. Will do in future commit.

Previously when dropna=True user was warned that NAs would be dropped, but they never actually got dropped. This also fixes the LL prior implementation for slope-only models.

Not 79 because fuck 79.

Also removes the calls to pd.stats.ols that were giving unsightly warnings during model building.

We now numerically approximate the 2nd derivative manually rather than using GLM.hessian() from statsmodels.

coveralls · 2017-01-02T19:43:38Z

Coverage increased (+0.1%) to 96.839% when pulling 93a369c on llprior into 038984b on master.

Llprior

jake-westfall added 8 commits November 7, 2016 11:42

log-likelihood-based priors for fixed slopes

1ee2297

Seems to work in ideal cases, but does not pass tests yet. In particular, need to modify the na-checking code. Will do in future commit.

dropna=True now actually drops NAs

d81aedb

Previously when dropna=True user was warned that NAs would be dropped, but they never actually got dropped. This also fixes the LL prior implementation for slope-only models.

Better error handling for wacky distribution families

7bf1e19

Extend LL priors to fixed intercept and to other link functions

6d92a1b

Code cleanup: Enforce strict 80-column limit across package

6eb1743

Not 79 because fuck 79.

Automatically detect/handle intercepts and cell means

66b3262

All default priors now based on log-likelihood method

effcc25

Also removes the calls to pd.stats.ols that were giving unsightly warnings during model building.

LL priors are now more accurate

93a369c

We now numerically approximate the 2nd derivative manually rather than using GLM.hessian() from statsmodels.

tyarkoni merged commit ddc19c8 into master Jan 2, 2017

jake-westfall deleted the llprior branch January 3, 2017 00:31

jake-westfall pushed a commit that referenced this pull request Sep 29, 2017

Merge pull request #56 from bambinos/llprior

9caa363

Llprior

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llprior #56

Llprior #56

jake-westfall commented Jan 2, 2017

coveralls commented Jan 2, 2017 •

edited

Llprior #56

Llprior #56

Conversation

jake-westfall commented Jan 2, 2017

coveralls commented Jan 2, 2017 • edited

coveralls commented Jan 2, 2017 •

edited