-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
instrumental variables: can they play a role in dynamite? #81
Comments
This is an interesting question, I'm not sure how much we can do with IV in dynamite outside of the classical linear Gaussian setting, as we would have to most likely model the correlation structure (similar to what brms does with
|
Thank you very much, I will try it. Anderson-Hsiao, Arellano-Bond and System GMM estimators are often used for models with the lagged dependent variables (for example: y_t~y_t-1 + x) and a small number of T (times of the panel) to avoid bias in the estimates. It is possible that dynamite could show a similar bias without some similar kind of correction for small T, maybe it is possible to check if that is the case by simulation, varying T in models with the lagged outcome as covariate. In principle the two approaches (instrumental variables-type estimators, such as Anderson-Hsiao and GMM, and simultaneous sistems of equations, such as dynamite) seems more complementary than alternatives and I agree that the first place where to start to check is for normal responses (univariate and multivariate), the implementations of these type of estimators I have found focus on normal outcomes too, so it could be also a normal-only feature of dynamite or a separate function if it results in a way to reduce the bias in small T panels with lagged outcomes as covariate. |
I'm not sure if the following is what you had in mind, but I did a small simulation experiment to try this out, and it does not seem that there is much bias (although this should be repeated multiple times with different data, but this would take time).
|
A better simulation below. With small N and small T there is indeed bias, which diminishes with increasing N.
|
I agree repeated simulations are needed to draw conclusions, but these results are promising. I want to specify that I am not sure if the I think a comparison of the estimates of dynamite without instrumental variable correction, dynamite with instrumental variable correction and pgmm from the plm package could give more insights on the issue. I have tried to test this data generating process on pgmm too and it seems actually more biased than dynamite even in this scenario with very small T, which was designed to show the merits of GMM, which could be seen as a sign that there are no issues even for small T, or most likely we are missing something important in the
|
I've tried to repeat the experiment with a fixed effect for each individual They are closer to the estimates of an ols than a within estimator. I report also an ols estimator and within estimator because tipically a consistent estimator such as GMM lies between the two (ols and within estimators have opposite biases, Panel Data Econometrics with R, Yves Croissant, Giovanni Millo, chapter 7):
Something interesting is that the bias in this context doesn't seem to propagate to the other parameter x, just the autoregressive parameter y_lag seems to be biased with this data generating process. |
If there is a correlation between
I think the bias for ols and dynamite estimates reduces because in my naive simulation process |
Thanks for testing things out. In both of these cases, the issue is that now the dynamite model is no longer correctly specified. We need to account for the group-specific effect by including a group-specific random intercept in the model by adding the
|
In other packages about dynamic panels, based on GMM or similar tools, instrumental variables play often a central role, I wonder if instrumental variables could play a role also in this framework. Simultaneous systems of equations approaches like the one of dynamite seems a more transparent way of dealing with endogeneity than GMM, since they allow to express explicitly an equation for each endogenous variable, but I am not sure these methods deal with the same sources of endogeneity and if the inclusion of instrumental variables can improve dynamite.
Also I think that understanding better the similarities and differences of these different approaches could allow to show light on the different assumptions implied by these different approaches and make better comparison between these models and allow to choose between them, based on the context and the set of assumptions that seems more realistic.
Some example of other packages for dynamical panel that rely on GMM or GMM-inspired models:
pgmm: https://cran.r-project.org/web/packages/plm/
pvargmm: https://cran.r-project.org/web/packages/panelvar/
pdynmc: https://cran.r-project.org/web/packages/pdynmc/
I have to mention that in my experience GMM (I am referring to pgmm) tends to be difficult to use in practice, since it depends on a lot of hyperparameters and the set of instruments chosen, which can alter the results substantially: it seems very difficult to choose between different specifications of the model. Also it is not clear to me if the existing implementation of pgmm support categorical variables. But most importantly GMM seems to me a less transparent way of dealing with endogenity than explicitly modelling each endogenous variables like in dynamite.
The text was updated successfully, but these errors were encountered: