# GARCH Models

## Theoretical Background of ARMA-GARCH


### Defining ARMA and GARCH

The *autoregressive moving average* (ARMA) representation of a time series combines a stationary autoregressive (AR) model with a stationary moving-average error process (MA). The *generalised autoregressive conditional heteroscedasticity* (GARCH) model is made up of two equations - the conditional mean equation and the conditional variance equation. By representing the conditional mean equation as an ARMA process, 


### Unconditional vs. Conditional Values

The unconditional mean and variance represents the mean and variance of the distribution and is assumed to be constant. On the other hand, the conditional mean and variance can change at every point in time, and hence depends on historical values (i.e. conditioned on past information). Volatility often forms in 'clusters', meaning that high volatility tends to be sustained over a certain time period. This forms the foundation of GARCH models.

### Return Distributions

(*from this point on, 'returns' refers to log-returns*)

Let $\mathcal{F_{t-1}}$ be the *filtration* of past returns, which is simply an *information set* of all of the observed past returns up to a time $t-1$. Let $r_t$ represent the series of log returns for $t=\{1,\dots,T\}$. If the distribution of returns is assumed to be *Normal*, one can write

\begin{equation}
    r_t \vert \mathcal{F_{t-1}} \sim N(\bar{r_t}, \sigma_t^2),
\end{equation}

where $\bar{r_t}$ is the conditional mean and $\sigma_t^2$ represents the conditional variance of returns. However, while the returns are often assumed to follow a Normal distribution, this is not the case with our data, which exhibits clear leptokursis and skew. This leads to 'fat-tails' and skewness. Therefore, multiple distributions are considered:

- Normal Distribution (NORM)
- Generalized Error Distribution (GED)
- Student t Distribution (STD)
- Skewed Normal Distribution (SNORM)
- Skewed Generalised Error Distribution (SGED)
- Skewed Student t Distribution (SSTD)
- Generalized Hyperbolic Function Distribution (GHYP)
- Generalized Hyperbolic Skewed Student tDistribution
- Normal Inverse Gaussian Distribution (NIG)

Subsequently, the Akaike information criteria (AIC) will be used to assess the quality of each model. The AIC penalises a high number of estimated parameters and is hence a good criteria to obtain a parsimonious model, balancing goodness of fit and the number of parameters:

\begin{equation}
    AIC = 2k - 2\ln(\hat{L}),
\end{equation}

where $k$ is the number of estimated parameters and $\hat{L}$ is the maximum value of the likelihood function for the model.