# STATISTICAL PROPERTIES OF OLS

<br>

## Introduction

<br>
In the previous notebook we discussed several properties of the OLS coefficient estimators, in particular those properties described in the context of the Gauss-Markov theorem :

<br>
<ul style="list-style-type:square">
    <li>
        <b>linearity</b>
    </li>
    <br>
    <li>
        <b>unbiasedness (finite-sample property)</b>
    </li>  
    <li>
        <b>efficiency (finite-sample property)</b>
    </li>
</ul>

<br>
In this notebook we will examine the asymptotic (or large-sample) property of consistency, and discuss the relationship between the latter and unbiasedness. 


## Consistency of the OLS Estimators

<br>
A quick look at the notebook regarding the statistical properties of the OLS estimation will remind us that :

<br>
<blockquote> 
<i>the asymptotic (or large-sample) properties of an estimator $\hat{\boldsymbol{\beta}}$ refer to the properties of the sampling distribution of that estimator as the sample size <b>m</b> becomes very (or indefinitely) large, as <b>m</b> approaches infinity.</i>
</blockquote>

<br>
<blockquote>
<i>the estimator $\hat{\boldsymbol{\beta}}$ is a consistent estimator of the population parameter $\boldsymbol{\beta}$ if its sampling distribution converges to (or collapses on) the value of the population parameter as $m \rightarrow \infty$</i>
</blockquote>

Let's start by re-writing $\hat{\boldsymbol{\beta}}_\boldsymbol{OLS-1}$ as : 

<br>
$
    \quad
    \begin{align}
        \hat{\boldsymbol{\beta}}_\boldsymbol{OLS-1}
        &=  
            \qquad \qquad \qquad \qquad \qquad \qquad \qquad 
            \qquad \qquad \qquad \qquad \qquad \qquad \qquad 
            &
        \newline
        &= \frac
            {\sum_{i=1}^{m} 
                (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})
                (\boldsymbol{\mathbf{Y}_i} - \overline{\mathbf{Y}})
            }
            {\sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})^2}
            = \frac
                {
                      \sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}}) \boldsymbol{\mathbf{Y}_i}
                    - \sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}}) \overline{\mathbf{Y}}
                }
                {\sum_{i=1}^{N} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})^2}
            = \frac
                {
                      \sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}}) \boldsymbol{\mathbf{Y}_i}
                    - \overline{\mathbf{Y}} \sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})
                }
                {\sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})^2}
        \newline
        & = \frac
            {
                  \sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}}) \boldsymbol{\mathbf{Y}_i}
                - \overline{\mathbf{Y}} 0
            }
            {\sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})^2}
        = \frac
            {\sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}}) \boldsymbol{\mathbf{Y}_i} }
            {\sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})^2}
            & \text{by } \textbf{PRE}
        \newline \newline
        &= \frac
            {
                \sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}}) 
                (\boldsymbol{\beta_0} + \boldsymbol{\beta_1} \boldsymbol{\mathbf{X}_i} + \boldsymbol{\mathbf{\varepsilon}_i})
            }
            {\sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})^2}
        \newline
        &= 
              \boldsymbol{\beta_0} \sum_{i=1}^{m} \boldsymbol{\mathbf{c}_i}
            + \boldsymbol{\beta_1} \sum_{i=1}^{m} \boldsymbol{\mathbf{c}_i} \boldsymbol{\mathbf{X}_i}
            + \sum_{i=1}^{m} \boldsymbol{\mathbf{c}_i} \boldsymbol{\mathbf{\varepsilon}_i}
            & \text{by } \textbf{P1} \text{ and } \textbf{P3}
        \newline
        &= 
            \boldsymbol{\beta_1} + \sum_{i=1}^{m} \boldsymbol{\mathbf{c}_i} \boldsymbol{\mathbf{\varepsilon}_i}
            & \text{by } \textbf{A2}
        \newline
        &= 
             \boldsymbol{\beta_1} 
            +\frac
                {\sum_{i=1}^{m} 
                    (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}}) \boldsymbol{\mathbf{\varepsilon}_i} 
                    - \overline{\boldsymbol{\varepsilon}} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})
                }
                { \sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})^2 }
        = 
             \boldsymbol{\beta_1} 
            +\frac
                {\sum_{i=1}^{m} 
                    (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}}) 
                    (\boldsymbol{\mathbf{\varepsilon}_i}  - \overline{\boldsymbol{\varepsilon}})
                }
                { \sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})^2 }
                & [\textbf{E1}]
    \end{align}
$

where the $\boldsymbol{\mathbf{c}_i}$ are defined by

<br>
$
    \quad
    \boldsymbol{\mathbf{c}_i} = \dfrac
        {\sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})}
        {\sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})^2}
$

<br>
The first part of the main equation is exactly the one we used in the notebook about the Gauss-Markov theorem to prove the linearity of the OLS estimators. The reference to <b>P1</b> and <b>P3</b> can be found in the mentioned notebook.

<br>
In order to prove consistency, we need to apply the probability-limit operator and the <b>Law of Large Numbers</b> to <b>E1</b>. The law states that under general conditions, the sample moments converge to their corresponding population moments.

<br>
$
    \quad
    \begin{align}
        p\lim_{m \rightarrow \infty} \ \hat{\boldsymbol{\beta}}_\boldsymbol{OLS-1}
        &=
            \qquad \qquad \qquad \qquad \qquad \qquad \qquad  \qquad \qquad \quad
            & \text{by } \textbf{E1}
        \newline
        &= 
            \boldsymbol{\beta_1} 
            + p\lim_{m \rightarrow \infty} \ 
            \dfrac
                {\sum_{i=1}^{m} 
                    (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}}) 
                    (\boldsymbol{\mathbf{\varepsilon}_i}  - \overline{\boldsymbol{\varepsilon}})
                }
                { \sum_{i=1}^{N} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})^2 }
            & \text{by dividing both numerator and denominator by } m
        \newline
        &
            & \text{we prevent the two } \sum \text{ from going to infinity when } m \rightarrow \infty
        \newline
        &= 
            \boldsymbol{\beta_1} 
            + \dfrac
                {
                    p\lim_{m \rightarrow \infty} \ 
                    \frac{1}{m}
                    \sum_{i=1}^{m} 
                        (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}}) 
                        (\boldsymbol{\mathbf{\varepsilon}_i}  - \overline{\boldsymbol{\varepsilon}})
                }
                {
                    p\lim_{m \rightarrow \infty} \ 
                    \frac{1}{m}
                    \sum_{i=1}^{m} (\boldsymbol{\mathbf{X}_i} - \overline{\mathbf{X}})^2
                }
            & \text{Law of Large Numbers}
        \newline
        &= 
            \boldsymbol{\beta_1} 
            + \dfrac
                { \mathrm{Cov} (\mathbf{X}, \boldsymbol{\varepsilon}) }
                { \mathrm{Var} (\mathbf{X}) }
            & \text{by } \textbf{A2}
        \newline
        &= \boldsymbol{\beta_1}        
    \end{align}
$

<br>
Therefore, $\hat{\boldsymbol{\beta}}_\boldsymbol{OLS-1}$ is a consistent estimator of the corresponding population parameter $\boldsymbol{\beta_1}$ .


## References

<br>
<ul style="list-style-type:square">
    <li>
         University of Valencia - Ezequiel Uriel - 
         <a href="https://bit.ly/2x9cSh6">
         The simple regression model : estimation and properties</a>        
    </li>
</ul>