-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mixed Effects/Hierarchical GLM support #9121
Comments
Wendy commented: In mixed-effect GLM, the response variable is a function of x^TB+zgi^Tbi+ei for group i. This means all groups share the common coefficients B. For each subgroup, it has a set of coefficients bi belong to that group. Hence, we need to estimate the coefficients that are common to all groups and also the coefficients bi for each group. At a first glance, this will involve a lot of data movement. I need to group the data as a whole unit to estimate B, then, into many subgroups to estimate bi. This only involves a bunch of calls to manipulate the data Frame. As usual, we need to derive the loglikelihood, get the gradient/hessian, and try to solve for how to change the coefficients. This is not going to be trivial at all because we have one global coefficient sets and several local coefficients. |
Wendy commented: Information from customers: Example analysis with open data: [https://ourcodingclub.github.io/2017/03/15/mixed-models.html#two|https://ourcodingclub.github.io/2017/03/15/mixed-models.html#two] Example implementation in R: [https://cran.r-project.org/web/packages/lme4/index.html|https://cran.r-project.org/web/packages/lme4/index.html] Example implementation in a distributed environment: [https://github.com/linkedin/photon-ml|https://github.com/linkedin/photon-ml|smart-link] |
Wendy commented: There are several distribution combinations for HGLM and they are:
However, learning from Michalk, I am going to break each implementation into a JIRA. |
William Prucknic commented: Could tweedie be included? |
Wendy commented: William: Tweedie be added to family or rand.family? Please let me know. Wendy |
William Prucknic commented: I noticed tweedie missing from the list of family's above and when using GLMs I use Tweedie often when analyzing insurance data. That’s why I made the suggestion. However, I don’t have any experience w/ mixed effects models, so I know I am out of my element here. |
JIRA Issue Migration Info Jira Issue: PUBDEV-6509 |
No description provided.
The text was updated successfully, but these errors were encountered: