# Load Packages

In [None]:
using Printf, Statistics, StatsBase, Random, Distributions
include("jlFiles/printmat.jl")
Random.seed!(678)          #set the random number generator to this starting point

In [None]:
using Plots

gr(size=(480,320))
default(fmt = :svg)

# Introduction

This exam explores how autocorrelation ought to change how we test statistical hypotheses.

# Task 1

Code a function for simulating $T$ observations from an AR(1) series

$
y_t = (1-\rho)\mu + \rho y_{t-1} + \varepsilon_t \sigma
$
where $\varepsilon_t$ is N(0,1).

That is, generate $y_1,y_2,...,y_T$ from this formula.

To make also the starting value ($y_0$) random, simulate $T+100$ data points, but then discard the first 100 values of $y_t$.

Generate a single "sample" using `(T,ρ,σ,μ) = (500,0,3,2)`. Calculate and report the average (mean) and the first 5 autocorrelations (hint: `autocor()`) of this sample. Redo a 2nd time, but with `ρ=0.75`.

In [None]:
(T,ρ,σ,μ) = (500,0,3,2)

y = SimAR1(T,ρ,σ,μ)

...

# Task 2

Do a Monte Carlo simulation. Use the parameters `(T,ρ,σ,μ) = (500,0,3,2)`.

1. Generate a sample with $T$ observations and calculate the average. Repeat $M=10,000$ times and store the estimated averages in a vector of length $M$. (The rest of the question uses the symbol $\mu_i$ to denote the average from sample $i$.)

2. What is average $\mu_i$ across the $M$ estimates? (That is, what is $\frac{1}{M}\sum\nolimits_{i=1}^{M}\mu_i$?) _Report_ the result.

3. What is the standard deviation of $\mu_i$ across the $M$ estimates? Compare with the theoretical standard deviation (see below). _Report_ the result.

4. Does the distribution of $\mu_i$ look normal? _Plot_ a histogram and compare with the theoretical pdf (see below).


## ...basic stats (the theoretical results)

says that the sample average of an iid ("independently and identically distributed") data series is normally distributed with a mean equal to the true (population) mean $\mu$ and a standard deviation equal to $s=\sigma_y/\sqrt{T}$ where $\sigma_y$ is the standard deviation of $y$.

To compare with our simulation results, you could estimate $\sigma_y$ from a single simulation with very many observations (say 10'000).

# Task 3

Redo task 2, but now use  `ρ=0.75` (the other parameters are unchanged).

# Task 4

You decide to test the hypothesis that $\mu=2$. Your decision rule is 

- reject the hypothesis if $|(\mu_i-2)/s|>1.645$ with $s=\sigma_y/\sqrt{T}$

With this decision rule, you are clearly assuming that the theoretical result (definition of $s$) is correct.

Estimate both $\mu_i$ and $\sigma_y$ from each sample.

In what fraction of the $M$ simulation do you reject your hypothesis when $\rho=0$ and when $\rho=0.75$? For the other parameters, use `(T,σ,μ) = (500,3,2)` (same as before).