### Week 10 - Volatility Models GARCH

### 1. Motivation

Switch to the mean to the variance.

Useful in risk management, VaR, option pricing, long-run marginal expected short-fall, predictive regressions, interaction between volatility and the real economy.

Models of volatility:
*   Generalised Autoregressive Conditional Heteroskedasticity (GARCH)
*   Realised Volatility/Variance (RV)
*   Stochastic Volatility (SV)
*   Implicit Volatility (IV)
*   Range Volatility

We see with return series tends to have bursts of volatility with positive autocorrelation in the variance, with quiet and more volatile periods, almost like 2 regimes. You are now saying the variance is non-constant over time, hence heteroskedastic.

Descriptive statistics are based under the assumption of a constant variance, which suggests misspecification.

Autocorrelation of returns shows no significant AR relationship, based on the ACF and PACF, i.e. no predictability.

However, we see autocorrelation in squared returns, i.e. predictability.
Note: this is the variance of returns provided the mean return is zero. If it is ver small, we could treat squared returns as a reasonable approximation.


### 2. Testing for Constant or Time-Varying Variance

Obtain squared-demeaned return series i.e. variance.

Do a AR equation and check the ACF/PACF.

Test the following hypothesesis:
*   Null all coefficients for AR terms are jointly 0 (constant variance)
*   At least one restriction is violated (time-varying variance)

This test represents a preliminary test for ARCH and denoted as ARCH(q).

Returns are represented as:
$r_t = \phi_0 + u_t$

Squared demeaned returns is the variance of the series:
$u_t^2 = (r_t - \phi_0)^2$

In practice, we would estimate the above return model and extract the model residuals $\hat u_t$.

Then you estimate a ARCH(q) or a AR(q) model for the variance:

$\hat u_t^2 = \delta_0 + \delta_1 \hat u_{t-1}^2 + ... + \delta_q \hat u_{t-q}^2 + v_t$

Then you compute the test statistic: 

$ARCH = TR^2$ where T is the sample size and $R^2$ is the coefficient of determination from the regression.

Under the null hypothesis of constant variance, the test statistic is distributed under a chi-squared distribution with q degrees of freedom. We can estract the appropriate critical values and p-values from this distribution.

Given this, if the test statistic p-value is greater than 0.05, we fail to reject the null at the 5% level. If it is less than 0.05, we reject the null in favour of the alternative. This procedure is formally known as the Lagrange multiplier test.

Eviews: Quick / Estimate Equation / r c / View / Residual Diagnostics / Heteroskedasticity Tests / ARCH / q / Ok.





### 3. Autoregressive Conditional Heteroskedasticity (ARCH) Model Specification

ARCH models capture autocorrelation in the variance. 

The variance is determined by the size of previous shocks.

e.g. the ARCH(1) model is:

$r_t = \phi_0 + u_t$

$h_t = \alpha_0 + \alpha_1 u_{t-1}^2$

$u_t -> N(0,h_t)$

THe key features is that the conditional mean is a constant, and the distirbution of $u_t$ is normally distributed.

$f(r_t|r_{t-1},r_{t-2},..,\theta) = \frac{1}{\sqrt{2\pi h_t}} \exp(-\frac{(r_t - \phi_0)^2}{2 h_t})$

Interpretation is that small shocks $u_{t-1}$ will result in a small variance $h_t$ and vice-versa. Large shocks will result in an increase in variance in the next period if $\alpha_1 > 0$.

The "News Impact Curve" is a plot of $h_t$ against $u_t$. We see shocks of the same magnitude have the same effect on $h_t$ (no asymmetry effects).

A very specifal case of the ARCH(1) model is a constant variance $\alpha_1 = 0$. 

The model can be generalised to have more lags.

Estimating the ARCH model parameters:

$\theta = [\phi_0, \alpha_0, \alpha_1, .., \alpha_q]$

We estimate using maximum likelihood using an iterative alogorithm. Given conditional normality, the log-likelihood function is:

$\log L(\theta) = \frac{1}{T} \sum^T_{t=1} \log (r_t|r_{t-1},r_{t-2},...,\theta)$

$-\frac{1}{2} \log (2\pi) - \frac{1}{2T}\sum^T_{t=1} \log h_t - \frac{1}{2T} \sum^T_{t=1} \frac{u^2_t}{h_t}$

Where $u_t = y_t - \phi_0$

$h_t = \alpha_0 + \sum^q_{i = 1} \alpha_i u^2_{t-1}$

Eviews: Quick / Estimate Equation / r c / Estimation settings: ARCH / ARCH order 1 / GARCH order 0 / OK

View / GARCH Graph / Conditional Variance/ Proc / Make GARCH Variance Series / OK






### 4. Is the Model a Good Fit? Diagnostics

We have estimated the ARCH model.

Is the model any good?

If the model is specified correctly, then the standardised reisiduals 

$z_t = \frac{u_t}{\sqrt{h_t}}$, the variance of it should be 1, i.e. $var(z_t) = 1$ and that there should be no evidence of autocorrelation in the variance. 

This is because our model assumptions is that the error term is normally distributed with 0 mean and variance $h_t$

Also, we want to check if we have the true process. E.g. What if the true process is an ARCH(2) when we have an ARCH(1)?

$Var(z_t) = \frac{var(u_t)}{h_t}$

If the correct model specification is done, that the true $var(u_t)$ and $var(h_t)$ are the same.

However, if for example the $var(u_t)$ is ARCH(3) and $h_t$ is ARCH(1), then the ratio would be ARCH(2) as the variance expressions in the numerator and denominator would not cancel out.

$u_t$ is the true variance and $h_t$ is the specified variance from the model.

Hence, we want to test the following hypothesis:
*   Null: model is correctly specified where $z_t$ variance is constant
*   Alternative: model is misspecified and $z_t$ is time-varying

i.e. We should see white-noise standardised residuals.

Eviews: View / Residual Diagnostics / ARCH LM Test / Number of Lags: 1 (is there another lag in the true model?)

We would get the ARCH test statistic and see its relevant p-value based on the chi-square distribution under trhe null. If less than 0.05, we reject the null of correct specification and conclude that the ARCH variance specification has higher order dynamics than ARCH(1) for example.

We keep estimating a new model, reapplying the test until we can no longer reject the null.


### 5. Generalised ARCH (GARCH)

When estimating ARCH models, you find that you generally need a fairly long q. i.e. Volatility seems to have long-memory and is persistent. 

This can mean using a lot of degrees of freedom and lags.

Like MA, we can capture this persistence where variance tends to cluster in periods while controlling for the number of parameters, we add an additional explanatory variable in the lagged conditional variance. 

So the GARCH term is responsible for capturing volatility persistence and memory features of volatiliy, which means that many lagged volatilities has predictability on current and future volatility. The GARCH terms allow for an infinite ARCH structure. Same ARMA for volatility.

$h_t = \alpha_0 + \alpha_1 u^2_{t-1} + \beta_1 h_{t-1}$

We can show GARCH(1,1) to be a infinite ARCH model with just this additional parameter $\beta_1$.

We have the GARCH(1,1) model:
$(1-\beta_1 L)h_t = \alpha_0 + \alpha_1 u^2_{t-1}$

Assuming $|\beta_1|<1$, then we can invert the term:
$h_t = (1-\beta_1L)^{-1} \alpha_0 + (1-\beta_1L)^{-1} u^2_{t-1}$

$= \frac{\alpha_0}{1-\beta_1}+\alpha_1(1+\beta_1 L + \beta_1^2 L^2 + ...)u^2_{t-1}$

$= \frac{\alpha_0}{1-\beta_1}+(\alpha_1 u^2_{t-1} + \alpha_1 \beta_1 u^2_{t-2} + \alpha_1 \beta^2_1 u^2_{t-3} + ...)$

Hence, we have an infinite order ARCH model.

The $\beta_1$ effectively tells us the effect of past shocks. The bigger the number, the bigger the strength of previous periods and thus, the longer the memory.

We can allow for q lags of the ARCH term and p lags of the GARCH terms, resulting in GARCH(p,q) model.

Again model assumptions have normal error term and conditional mean being constant.

The GARCH(1,1) model generally is the best model. The estimates of $\beta_1$ is around 0.9 and the $\alpha_1$ term is around 0.05.

Special case is ARCH(1) model is the constant variance model.

Given the assumption of normality, we can estimate the GARCH model parameters:

$\theta = [\phi_0, \alpha_0, \alpha_1, ..., \alpha_q,\beta_1,\beta_2,...,\beta_p]$

Using MLE. The log-likelihood is:

$\log L(\theta) = \frac{1}{T}\sum^T_{t=1} \log f(r_t|r_{t-1},r_{t-2},...;\theta)$

$=1\frac{1}{2}\log 2\pi - \frac{1}{2T}\sum^T_{t=1} \log h_t - \frac{1}{2T} \sum^T_{t=1} \frac{u^2_t}{h_t}$

$u_t = r_t-\phi_0$

$h_t = \alpha_0 + \sum^q_{i=1}\alpha_i u^2_{t-1} + \sum^p_{i=1} \beta_i h_{t-i}$

