### Our GARCH(1,1) model

Mean Model
$$ r_t = \sigma_t^2 e_t $$ 
$$e_t | \mathcal{F}_{t-1} \sim N(0, 1)$$

Volatility Model 
$$ \sigma_t^2 = \omega +  \alpha e_{t-1}^2 + \beta \sigma_{t-1}^2 + 
    \gamma x_t^2$$

### Log likelihood
 $$ l(\omega, \alpha, \beta, \gamma) =  \sum_{t=1}^T {\frac{1}{2} ({-\log{2\pi} -\log{\sigma_t^2} - \frac{e_t^2}{\sigma_t^2}})} $$

### Asymtotic distribution of parameters.

1. Find first partial derivative for each parameter. For example for $\gamma$,

$$ \begin{split}
    \frac{\partial l}{\partial \gamma} &= \frac{\partial l}{\partial \sigma^2} \frac{\partial \sigma^2}{\partial \gamma} \\
    &= \sum_{t=1}^T \frac{x_t^2}{2\sigma_t^2} (\frac{e_{t}^2}{\sigma_t^2} -1)
\end{split} $$



2.  Find second partial derivative for each parameter, to each parameter. For four parameters, we will have 16 partial derivatives. For instance,

$$ \begin{split}
    \frac{\partial^2 l}{\partial \gamma^2} &= \frac{\partial}{\partial \sigma^2} (\frac{\partial l }{\partial \gamma}) \frac{\partial \sigma^2}{\partial \gamma} \\
    &= \sum_{t=1}^T \frac{x_t^4}{\sigma_t^4} (\frac{1}{2} - \frac{e_t^2}{\sigma_t^2})
\end{split} $$

3. Find the expectation of the negative of each second partial derivative. Question: do we need to find the expectation, or is the raw form of the second derivative (such as shown in step 2) sufficient?

4. This 4x4 matrix then forms our fisher information matrix, $I_E(\theta)$ where $\theta$ is the vector of our parameters.

5. The asymptotic distribution of our parameters follows a multi variate normal distribution with mean $(\omega_0, \alpha_0, \beta_0, \gamma_0), $ and variance as the inverse fisher information matrix mentioned in step 3. 

6. Using the variance, we can find the p value associated with each parameter.

---
Questions:
1. Do we need the regularity conditions for the log likelihood function to hold in order for asymptotic normality and consistency to hold? If so, is this where our assumptions of parameter boundaries and distributions come into place?  


### First Derivatives/Score



#### a. 
$$ \begin{split}
    \frac{\partial l}{\partial \gamma} &= \frac{\partial l}{\partial \sigma^2} \frac{\partial \sigma^2}{\partial \gamma} \\
    &= \sum_{t=1}^T \frac{x_t^2}{2\sigma_t^2} (\frac{e_{t}^2}{\sigma_t^2} -1)
\end{split} $$

#### b.
$$ \begin{split}
    \frac{\partial l}{\partial \omega} &= \frac{\partial l}{\partial \sigma^2} \frac{\partial \sigma^2}{\partial \omega} \\
    &= \sum_{t=1}^T \frac{1}{2\sigma_t^2} (\frac{e_{t}^2}{\sigma_t^2} -1)
\end{split} $$

#### c.
$$ \begin{split}
    \frac{\partial l}{\partial \alpha} &= \frac{\partial l}{\partial \sigma^2} \frac{\partial \sigma^2}{\partial \alpha} \\
    &= \sum_{t=1}^T \frac{e_{t-1}^2}{2\sigma_t^2} (\frac{e_{t}^2}{\sigma_t^2} -1)
\end{split} $$

#### d. 
$$ \begin{split}
    \frac{\partial l}{\partial \beta} &= \frac{\partial l}{\partial \sigma^2} \frac{\partial \sigma^2}{\partial \beta} \\
    &= \sum_{t=1}^T \frac{{\sigma_{t-1}}^2}{2\sigma_t^2} (\frac{e_{t}^2}{\sigma_t^2} -1)
\end{split} $$

where $\hat\theta$ is the MLE


## Functions for first derivative

In [2]:
import numpy as np
import pandas as pd
"""
    Compute the partial derivative of l with respect to alpha.
    
    Parameters:
    x : np.array
        The input time series data.
    sigma_squared : np.array
        The variance values \( \sigma_t^2 \) for each time step.
    e : np.array
        The error terms \( e_t \) for each time step.
    
    Returns:
    float
        The computed derivative value.
    """

### a.
def partial_l_gamma(x, sigma_squared, e):
    T = len(x)
    derivative = np.sum((x**2 / (2 * sigma_squared)) * ((e**2 / sigma_squared) - 1))
    return derivative

### b.
def partial_l_omega(sigma_squared, e):
    T = len(x)
    derivative = np.sum((1 / (2 * sigma_squared)) * ((e**2 / sigma_squared) - 1))
    return derivative

### c.
def partial_l_alpha(x, sigma_squared, e):
    T = len(x)
    derivative = np.sum((x**2 / (2 * sigma_squared)) * ((e**2 / sigma_squared) - 1))
    return derivative

### d.
def partial_l_beta(x, sigma_squared, e):
    sigma_squared_tminus1 = sigma_squared.shift(1)
    T = len(x)
    derivative = np.sum((sigma_squared_tminus1 / (2 * sigma_squared)) * ((e**2 / sigma_squared) - 1))
    return derivative


### Second Derivatives
#### (1) 
$$ \begin{split}
    \frac{\partial^2 l}{\partial \gamma^2} &= \sum_{t=1}^T \frac{x_t^4}{\sigma_t^4} (\frac {1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (2)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \omega^2} &= \frac{1}{\sigma_t^4} (\frac {1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (3) 
$$ \begin{split}
    \frac{\partial^2 l}{\partial \alpha^2} &= \sum_{t=1}^T \frac{e_{t-1}^4}{\sigma_t^4} (\frac{1}{2} -\frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (4)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \beta^2} &= \sum_{t=1}^T \frac{\sigma_{t-1}^4}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (5)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \omega \partial \gamma} &= \sum_{t=1}^T \frac{x_t^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (6)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \alpha \partial \gamma} &= \sum_{t=1}^T \frac{e_{t-1}^2 x_t^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (7)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \beta \partial \gamma} &= \sum_{t=1}^T \frac{\sigma_{t-1}^2 x_t^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (8)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \gamma \partial \omega} &= \sum_{t=1}^T \frac{x_t^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (9)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \alpha \partial \omega} &= \sum_{t=1}^T \frac{e_{t-1}^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (10)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \beta \partial \omega} &= \sum_{t=1}^T \frac{\sigma_{t-1}^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (11)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \gamma \partial \alpha} &= \sum_{t=1}^T \frac{e_{t-1}^2 x_t^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (12)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \omega \partial \alpha} &= \sum_{t=1}^T \frac{e_{t-1}^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (13)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \beta \partial \alpha} &= \sum_{t=1}^T \frac{e_{t-1}^2 \sigma_{t-1}^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (14)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \gamma \partial \beta} &= \sum_{t=1}^T \frac{x_t^2\sigma_{t-1}^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (15)
$$ \begin{split}
    \frac{\partial^2 l}{\partial \omega \partial \beta} &= \sum_{t=1}^T \frac{\sigma_{t-1}^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

#### (16) 
$$ \begin{split}
    \frac{\partial^2 l}{\partial \alpha \partial \beta} &= \sum_{t=1}^T \frac{\sigma_{t-1}^2 e_{t-1}^2}{\sigma_t^4} (\frac{1}{2} - \frac {e_{t}^2}{\sigma_t^2})\\
    
\end{split} $$

## Asymptotic distribution of the MLE

Let $\theta = (\omega, \alpha, \beta, \gamma)^T$ be our parameter vector. 

The fisher information is given by 

$$ I(\theta) = - \begin{pmatrix}
\frac{\partial^2 l}{\partial \omega^2} & \frac{\partial^2 l}{\partial \omega \partial \alpha} & \frac{\partial^2 l}{\partial \omega \partial \beta} & \frac{\partial^2 l}{\partial \omega \partial \gamma} \\

\frac{\partial^2 l}{\partial \alpha \partial \omega} & \frac{\partial^2 l}{\partial \alpha^2} & \frac{\partial^2 l}{\partial \alpha \partial \beta} & \frac{\partial^2 l}{\partial \alpha \partial \gamma} \\

\frac{\partial^2 l}{\partial \beta \partial \omega} & \frac{\partial^2 l}{\partial \beta \partial \alpha} & \frac{\partial^2 l}{\partial \beta^2} & \frac{\partial^2 l}{\partial \beta \partial \gamma} \\

\frac{\partial^2 l}{\partial \gamma \partial \omega} & \frac{\partial^2 l}{\partial \gamma \partial \alpha} & \frac{\partial^2 l}{\partial \gamma \partial \beta} & \frac{\partial^2 l}{\partial \gamma^2} \\
\end{pmatrix}$$

where each partial derivative is given above.




Assuming consistency (we probably prove this before this part?), the asymptotic distribution of the MLE $\hat{\theta}$ converges to a multivariate normal distribution with expected value $\theta$ and variance $I(\theta)^{-1}$,i.e. $$\hat{\theta} \sim MVN_d(\theta_0, I(\hat{\theta})^{-1})$$


Proving consistency: <a href='https://en.wikipedia.org/wiki/Maximum_likelihood_estimation#Consistency'>Consistency</a>
Regulartiy conditions: <a href='https://en.wikipedia.org/wiki/Fisher_information#Regularity_conditions'>Regularity, Fisher Information</a>

# Proof of consistency
1. Ergodicity,
2. Stationarity,
3. $\theta_0$ is not on the boundary of the parameter space.

If a time series $X_t$ is stationary and erogdic, then 
$$\frac{1}{T} \sum_{t=1}^T X_t \xrightarrow[]{\text{p}} \mu $$ 
where  $\mu = E[X_t] < \infty$.

### Ergodic Theorem

$$
