Join GitHub today
GitHub is home to over 20 million developers working together to host and review code, manage projects, and build software together.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
Already on GitHub? Sign in to your account
lmer vs lme/SAS PROC MIXED; fixed estimate in random-slopes model #376
Comments
bbolker
added a commit
that referenced
this issue
May 4, 2016
|
|
bbolker |
c93fab8
|
|
findings so far:
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
bbolker commentedMay 4, 2016
•
edited
via e-mail from Georg Ferber:
data set here and my beginnings of exploration here
To give you a brief background: We perform a simulation study where we generate studies investigating the relationship of the QT-interval (or rather its change from a baseline value) and the concentration of a hypothetical drug. In one of the scenarios that we investigated, we observed that in about 10 % of the cases, the estimate for the fixed effect for concentration, which, in this case, should be around 0, was much larger and there was an outlier in the random effects. When we tried to identify the reason for this, we discovered that with the same dataset, the estimates obtained with the lme function in the nlme package gave us much more "reasonable" (i.e. a slope closer to the expected value of 0) results and that SAS proc mixed also gave these "reasonable" results. It should be noted that lme at times did not converrge, Likewise, by using a model with a smaller number of random effects, we could get values we expected. We think that in the meantime we could identify the element in the data that caused the problem to occur, but we still do not understand what happens.
More precisely, we fit this model:
where
trtandTPare factors, whileCis a continuous covariate) and we are interested (among others) in the fixed effect forC.In the dataset attached (
Oddsets1.Rdata), you find a list of 10 data.frames that cause these strange results together with a set of results, where the row named "Slope" is the one of primary interest. Note that this table is not a directly based on thelmermodel, but uses the packagelsmeansto come up with a Kenward-Rogers approximation. However, the estimate for the coefficient for C is unchanged.Below is an overview over the results (fixed effect for C) using the three programs.
Upon furhter investigation, we found that the large values for the coefficient disappear, when the model is changed to
while they remain essentially unchanged if we set
REML=F.Some more results are summarized in the attached word document.
We are currently thinking about strategies to cope with the problem above, but, of course, would be very glad to better understand what happens and how to avoid the problem.
We hope that you are also interested in feedback from the user community and that you find this problem interesting.