## GMM估计

### What is GMM?

The generalize method of moments (GMM) is a general framework for deriving estimators

GMM estimators use assumptions about the moments of the random variables to derive an objective function

- The assumed moments of the random variables are known as the population moments
- The data provide the sample moments
- We minimize the objective function to select the parameters that yield the smallest differences between the population moments and the sample moments

### MM and GMM

In the method of moments (MM), we have the same number of sample moment conditions as we have parameters

In the generalized method of moments (GMM), we more sample moment conditions than we have parameters

### OLS is an MM estimator

We know that OLS estimates the parameters of the condtional expectation of

$$y_{i} = x_{i} \beta + \mu_{i}$$ 

under the assumption that $E[\mu | x] = 0$

The population moment conditions for OLS are
$$E(x \mu) = E[x(y-x \beta)]=0$$

The corresponding sample moment conditions are
$$(1/N) \sum_{i=1}^{N} x_{i}(y_{i} - x_{i} \beta) = 0$$

Solving for $\beta$ yields
$$\hat{\beta}_{OLS} = (\sum_{i=1}^{N} x_{i}^{\prime} x_{i})^{-1} \sum_{i=1}^{N} x_{i}^{\prime} y_{i}$$

### Generalized method-of-moments (GMM)

The MM only works when the number of moment conditions equals the number of parameters to estimate

Generalized method-of-moments (GMM) estimators choose the estimates that minimize a quadratic form of the moment conditions

### Definition of GMM estimator

Our research question implies q population moment conditions

$$E[m(w_{i},\theta)] = 0$$

- m is q × 1 vector of functions whose expected values are zero in the population
- $w_{i}$ is the data on person i
- $\theta$ is k × 1 vector of parmeters, k ≤ q

The sample moments that correspond to the population moments are

$$\bar{m}(\theta) = (1/N)\sum_{i=1}^{N} m(w_{i},\theta)$$

When k < q, the GMM choses the parameters that are as close as possible to solving the over-identified system of moment equations

$$\hat{\theta}_{GMM} = arg min_{\theta} \ \ \bar{m}(\theta)^{\prime} W \bar{m}(\theta)$$

### Some properties of the GMM estimator

$$\hat{\theta}_{GMM} = arg min_{\theta} \ \ \bar{m}(\theta)^{\prime} W \bar{m}(\theta)$$

W only affects the efficiency of the GMM estimator

- Setting $W = I$ yields consistent, but inefficent estimates
- Setting $W = Cov[\bar{m}(\theta)]^{-1}$ yields an efficient GMM estimator
- We can take multiple steps to get an efficient GMM estimator

In [1]:
import ipystata

In [7]:
%%stata

sysuse auto

* linear regression
regress mpg gear_ratio turn

(1978 Automobile Data)

      Source |       SS           df       MS      Number of obs   =        74
-------------+----------------------------------   F(2, 71)        =     43.09
       Model |  1339.68678         2  669.843392   Prob > F        =    0.0000
    Residual |  1103.77268        71   15.546094   R-squared       =    0.5483
-------------+----------------------------------   Adj R-squared   =    0.5355
       Total |  2443.45946        73  33.4720474   Root MSE        =    3.9429

------------------------------------------------------------------------------
         mpg |      Coef.   Std. Err.      t    P>|t|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
  gear_ratio |   3.032884   1.372978     2.21   0.030     .2952433    5.770524
        turn |  -.7330502   .1424009    -5.15   0.000     -1.01699   -.4491108
       _cons |   41.21801   8.990711     4.58   0.000     23.29104    59.14498
---------------------------

In [4]:
%%stata

gmm (mpg - {b1}*gear_ratio - {b2}*turn - {b0}), instruments(gear_ratio turn)

.  gmm (mpg - {b1}*gear_ratio - {b2}*turn - {b0}), instruments(gear_ratio turn)

Step 1
Iteration 0:   GMM criterion Q(b) =  471.67875  
Iteration 1:   GMM criterion Q(b) =  3.058e-21  
Iteration 2:   GMM criterion Q(b) =  2.545e-31  

Step 2
Iteration 0:   GMM criterion Q(b) =  1.691e-32  
Iteration 1:   GMM criterion Q(b) =  1.691e-32  (backed up)

note: model is exactly identified

GMM estimation 

Number of parameters =   3
Number of moments    =   3
Initial weight matrix: Unadjusted                 Number of obs   =         74
GMM weight matrix:     Robust

------------------------------------------------------------------------------
             |               Robust
             |      Coef.   Std. Err.      z    P>|z|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
         /b1 |   3.032884   1.501664     2.02   0.043     .0896757    5.976092
         /b2 |  -.7330502    .117972    -6.21   0.000    -.9642711   -.5018293


In [8]:
%%stata

* Two-stage least squares (same as ivregress 2sls)
 ivregress 2sls mpg gear_ratio (turn = weight length headroom)


Instrumental variables (2SLS) regression          Number of obs   =         74
                                                  Wald chi2(2)    =      90.94
                                                  Prob > chi2     =     0.0000
                                                  R-squared       =     0.4656
                                                  Root MSE        =     4.2007

------------------------------------------------------------------------------
         mpg |      Coef.   Std. Err.      z    P>|z|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
        turn |  -1.246426   .2012157    -6.19   0.000    -1.640801   -.8520502
  gear_ratio |  -.3146499   1.697806    -0.19   0.853    -3.642288    3.012988
       _cons |   71.66502    12.3775     5.79   0.000     47.40556    95.92447
------------------------------------------------------------------------------
Instrumented:  turn
Instruments:   gear_ratio weig

In [9]:
%%stata

gmm (mpg - {b1}*turn - {b2}*gear_ratio - {b0}), instruments(gear_ratio weight length headroom) onestep

.  gmm (mpg - {b1}*turn - {b2}*gear_ratio - {b0}), instruments(gear_ratio weight length headr
Step 1
Iteration 0:   GMM criterion Q(b) =  475.42283  
Iteration 1:   GMM criterion Q(b) =  .16100633  
Iteration 2:   GMM criterion Q(b) =  .16100633  

GMM estimation 

Number of parameters =   3
Number of moments    =   5
Initial weight matrix: Unadjusted                 Number of obs   =         74

------------------------------------------------------------------------------
             |               Robust
             |      Coef.   Std. Err.      z    P>|z|     [95% Conf. Interval]
-------------+----------------------------------------------------------------
         /b1 |  -1.246426   .1970566    -6.33   0.000    -1.632649   -.8602019
         /b2 |  -.3146499   1.863079    -0.17   0.866    -3.966217    3.336917
         /b0 |   71.66502   12.68722     5.65   0.000     46.79853    96.53151
------------------------------------------------------------------------------
Instruments