Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ENH: oulier-influence for misspecified models ? #7955

Open
josef-pkt opened this issue Dec 15, 2021 · 1 comment
Open

ENH: oulier-influence for misspecified models ? #7955

josef-pkt opened this issue Dec 15, 2021 · 1 comment

Comments

@josef-pkt
Copy link
Member

For now just a question: How does model misspecification affect outlier-influence measures?

possible case, scale misspecification
based on #7951 (comment)

Under full MLE, scale=1, e.g. Poisson or HetModels. Any dispersion/variance parameters are part of the estimated parameters.
However, the model could be misspecified, either in scale, e.g. with over dispersion, or in any of the robust cov_types, HC, ....

GLM, OLS have scale as auxiliary parameter in continuous endog case, but it's not part of params and needs to be treated separately.

GLM allows excess dispersion, scale != 1, also in the full MLE case like Poisson, Binomial.
This affects the studentized residuals in the Influence classes.

more general:
How should we treat misspecified models, models that use robust cov_type or excess scale for wald inference?

Currently. there is no systematic treatment of sandwich cov_params or other misspecification in the Influence classes.

Related: Should we add a cov_type="excess-scale" generically. It applies only to models that does not estimate scale directly.

@josef-pkt
Copy link
Member Author

possible case for looking into this: overdispersed Poisson

Compare GLM or MLE influence under two scale assumptions

  • MLE: scale fixed at 1, resid_pearson are not scaled (are scaled by 1)
  • Pseudo-MLE: with pearson_chi2 scale,
    • resid_pearson will be fully scaled.
    • parts that rely on bse/cov_params will be scaled.
    • parts that rely on hessian, or score will not be scaled

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant