SUMM: score tests and similar #2041

josef-pkt · 2014-10-11T21:11:46Z

I need a summary issue for the score_test, lagrange multiplier test, artificial regression, moment restriction tests

They are all similar or asymptotically the same with different assumption on robustness to misspecification. There are many open issues for this
#1163 basic issue
#1756 conditional moment restriction
#2054 score_tests with robust covariance assumptions

special cases
overdispersion test in Poisson (and related count models)

implementation*

what do we have available for a test?

two (or three) cases:

if we have a fully specified model: we can use score_test directly
we have score_obs, score, hessian w.r.t. all parameters that we want to test
no distinction in handling score from parameters used in estimation from those under test
e.g. variable addition tests
we only have a partial model without fully specified alternative in the model: we can use artificial regression, "scores" defined outside of the model or conditional moment tests, or embed in GMM
- the only generic setup I can see is conditional moment tests cm_test
- testing covariance assumption in mean models: a large battery of LM test for OLS, but in a similar way should be extendable to entire LEF, I have it for GLM reusing the methods from the link and family.
- special case tests: e.g. score tests for poisson versus zero inflated or hurdle model.
general tests on properties: Hausman, Information Matrix equality, ...
can be formulated as LM test in linear case, but I don't know how they fit in "generically"

issues on infrastructure
#2072 get_robustcov to be reused for score tests

artificial regression: Gauss-Newton regression of Davidson-MacKinnon
generic "one-step" estimators (see for example Newey McFadden 1994 Handbook of Econometrics
-> reuse Newton optimizer, iterative WLS results ?

multiple hypothesis, multiple testing
just an idea: using normal based score test for single restriction
simplest case: we have many possibly omitted variables that we want to test
we can test all of them at the same time - standard score test, like results.wald_test
we could also test the individual restrictions/additions - like results.t_test
questions: correlation of the omitted variables, jointly or in loop?
for example Pagan and Vella 1989 report separate t-statistics for a list of moment restrictions (omitted variables or heteroscedasticity)
Stata also reports separate tests with optional p-value correction in some of the estat specification tests after regress

Where are the score_test examples that I already did?

https://github.com/statsmodels/statsmodels/pull/1781/files#diff-724e15f4019b2a24906f6b271d67e370R433
Poisson overdispersion are not in repo

edit
variable addition score test: reduced rank #5364
We need to handle the case when some added variables are already implicitly included in the restricted model. GLM.score_test uses the full number of variables and not the rank of the addition.
(found while running an example for GAM in #5296 )

The text was updated successfully, but these errors were encountered:

josef-pkt · 2014-10-23T03:21:57Z

good read for link between conditional moment tests and score tests, although a bit dated:
Pagan, Adrian, and Frank Vella. 1989. “Diagnostic Tests for Models Based on Individual Data: A Survey.” Journal of Applied Econometrics 4 (S1): S29–59. doi:10.1002/jae.3950040504.

josef-pkt · 2014-11-01T18:12:25Z

a reference as reminder:
Chapter 5 in Cameron Trivedi has a good discussion of MLE, QMLE based LM tests and related conditional moment tests (I read it a year ago and had forgotten about it.)

In general it's not easy to see what the maintained assumptions in a test are. Wooldridge tries to be robust to higher moment misspecification, Cameron Trivedi mix them up a lot more.
My impression is that Wooldridge's multistep artificial regression based tests (1990, 1991) use a lot of case specific proofs instead of basing it on a "generic" principle (like GMM for example). But I didn't read the main 1990 paper again.
Maybe generic GMM based test would be easier to implement, than the Wooldridge procedures where one of the wiping step replicates the inner part of the sandwich calculation. see #1790

correction: GMM based tests (cm-tests) might be easy, but "too robust". It might be difficult to add additional maintained assumptions to get less than fully robust tests. (tradeoff small sample performance and robustness)

josef-pkt · 2016-04-22T16:05:43Z

R package statmod has glm.scoretest as a omitted/added variable test with dispersion as optional parameter.

josef-pkt · 2016-09-23T04:10:02Z

another model: goodness-of-fit in logistic matched case-control (i.e. Conditional Logit)
It looks like it essentially reduces to a conditional moment test or to a variable addition test where we include the squares or the regressors (dropping singular or duplicate, especially categorical variables)
Chen, Li-Ching, and Jiun-Yi Wang. 2013. “Testing the Fit of the Logistic Model for Matched Case-Control Studies.” Computational Statistics & Data Analysis 57 (1): 309–19. doi:10.1016/j.csda.2012.07.001.

looks a bit like a complement to Hosmer-Lemeshow because it only considers continuous variables.
(e.g. curvature of link function doesn't matter in categorical/saturated models. I haven't checked the details.) see also #1061 for related issue in current diagnostic tests.

josef-pkt · 2016-10-16T16:49:55Z

(random thought for categorical explanatory variables)

FAQ: How do we do a score test for an interaction effect?
side issue: construct the exog columns for the interaction effect that is orthogonal (for the given data) to the main effects. (not the balance case where we can construct ex ante orthogonal constrasts)

score test conditions on included variables, but we need both z/t test for single parameters and chi2/F test for joint hypothesis.

this is mainly a use case and example for multi-column joint tests, everything is generic, when we have the equivalent of t_test and wald_test for score/LM tests. One possible special issue: how do we create the raw design matrix for the interaction effect before conditioning, e.g. using patsy. Also if we don't remove collinearities, then we need to handle them in the score_test (e.g. adjust degrees of freedom based on rank)

josef-pkt · 2017-01-30T19:20:33Z

another reference,
This has a collection of several tests for binary endog models all implemented with auxiliary regression (Davidson and MacKinnon type), i.e. LM or CM test using plain residuals (not weighted/scaled)
The only exception is a P-test for non-nested models for testing between two specified link functions.
For RESET test they only compare across number of polynomial terms. Minimum sample size in Monte Carlo is 500.

Ramalho, Esmeralda A., and Joaquim J. S. Ramalho. 2012. “Alternative Versions of the RESET Test for Binary Response Index Models: A Comparative Study*.” Oxford Bulletin of Economics and Statistics 74 (1): 107–30. doi:10.1111/j.1468-0084.2011.00654.x.

josef-pkt · 2017-06-02T19:37:57Z

SAS seems to have generalized/robust score_test for GEE/GLM #1739 (comment)

josef-pkt · 2020-11-18T21:02:42Z

to separate tests of CMT, normal instead of joint chisquare

CMTNewey and CMTTauchen have ztest methods for the individual constraints.
Those are designed for GMM
The individual lm or cm tests don't have an option for individual hypothesis, only the joint chisquare based results

Can I use them for the a specification test for MLE, e.g. excess probability tests like zero-inflation or binned prob test in count models.?

josef-pkt · 2022-04-04T15:55:28Z

score_test robust to misspecification cov_type

newer article I found (I didn't read it yet)

Bera, Anil K., Yannis Bilias, Mann J. Yoon, Süleyman Taşpınar, and Osman Doğan. 2020. “Adjustments of Rao’s Score Test for Distributional and Local Parametric Misspecifications.” Journal of Econometric Methods 9 (1). https://doi.org/10.1515/jem-2017-0022.

already referenced in #1163 (comment)

They do not cite Boos which was my main reference

I have problems finding the Boos article in issues and doesn't seem to be in docs
Boos reference is

Boos, Dennis D. "On generalized score tests." The American Statistician 46, no. 4 (1992): 327-333.

josef-pkt added type-enh comp-base design type-refactor labels Oct 11, 2014

This was referenced Oct 20, 2014

score_tests cluster robust, sandwiches #2054

Open

everything GMM, GLMGMM #1790

Open

josef-pkt mentioned this issue Oct 31, 2014

two step estimation, robustness, and nuisance parameters like scale #2073

Open

josef-pkt mentioned this issue Nov 4, 2014

cov_params reduced rank - consequences #1717

Open

This was referenced Nov 13, 2014

LM / score tests: omitted test missing in diagnostic #2091

Open

Score/LM conditional moment tests #2096

Merged

josef-pkt mentioned this issue Aug 30, 2015

ENH: Design score confidence interval #2606

Open

josef-pkt mentioned this issue Oct 3, 2015

ENH: score/Lagrange Multiplier test for GMM #2637

Open

josef-pkt mentioned this issue Nov 23, 2015

SUMM: QMLE, Logit, Poisson with continuous data #2712

Open

josef-pkt mentioned this issue Mar 21, 2017

ENH: score confidence interval for GLM, discrete #2722

Open

josef-pkt mentioned this issue May 23, 2017

ENH/DOC: reset_test #3690

Open

josef-pkt mentioned this issue Sep 4, 2017

ENH: chisquare diagnostic test for models #3904

Open

josef-pkt added this to the 0.10 milestone May 26, 2018

josef-pkt added the topic-diagnostic label Jun 3, 2018

josef-pkt mentioned this issue Nov 2, 2018

BUG/ENH: LM, score_test variable addition with redundant variables (rank) #5364

Open

josef-pkt mentioned this issue Dec 3, 2018

ENH: helper function or options for splines, various polynomial basis in nonlinearity tests #5403

Open

josef-pkt mentioned this issue Feb 20, 2019

Set degrees of freemdom in reset_ramsey #5508

Closed

josef-pkt removed this from the 0.10 milestone Oct 28, 2021

josef-pkt added this to the 0.14 milestone Oct 28, 2021

josef-pkt mentioned this issue May 22, 2022

ENH: score- LM-test under misspecified alternative #8288

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SUMM: score tests and similar #2041

SUMM: score tests and similar #2041

josef-pkt commented Oct 11, 2014 •

edited

josef-pkt commented Oct 23, 2014

josef-pkt commented Nov 1, 2014

josef-pkt commented Apr 22, 2016 •

edited

josef-pkt commented Sep 23, 2016 •

edited

josef-pkt commented Oct 16, 2016

josef-pkt commented Jan 30, 2017

josef-pkt commented Jun 2, 2017 •

edited

josef-pkt commented Nov 18, 2020

josef-pkt commented Apr 4, 2022

SUMM: score tests and similar #2041

SUMM: score tests and similar #2041

Comments

josef-pkt commented Oct 11, 2014 • edited

josef-pkt commented Oct 23, 2014

josef-pkt commented Nov 1, 2014

josef-pkt commented Apr 22, 2016 • edited

josef-pkt commented Sep 23, 2016 • edited

josef-pkt commented Oct 16, 2016

josef-pkt commented Jan 30, 2017

josef-pkt commented Jun 2, 2017 • edited

josef-pkt commented Nov 18, 2020

josef-pkt commented Apr 4, 2022

josef-pkt commented Oct 11, 2014 •

edited

josef-pkt commented Apr 22, 2016 •

edited

josef-pkt commented Sep 23, 2016 •

edited

josef-pkt commented Jun 2, 2017 •

edited