# Solutions to Mock Exam 1

*This exam is representative in terms of types of questions. It is longer than the actual exam, such that you have more practice questions. This would be more suitable for a three hour exam instead of two hours.*

*Visit [Canvas](https://canvas.uva.nl/courses/20506/assignments/179028) for more information on the exam.*

## Question 1
1. Write a docstring for the function `f(z)`, that takes the list `z` input, defined in the cell below. In other words, what does the function do?

In [1]:
def f(z):
    '''
    Returns the index of the first positive element in the list z. 
    If all elements in z are non-positive, then it throws an error.
    '''
    
    j = 0
    
    while z[j] <= 0:
        j = j + 1
        
    return j

2. For each of the following expressions, state whether the two arrays can be broadcast together, and if so, what the expression will evaluate to:
    * `np.array([1, 2]) + np.array([7])`
    * `np.array([[1, 2], [3, 4]]) + np.array([7, 8])`
    * `np.array([[1, 2], [3, 4]]) + np.array([7, 8, 9])`
    * `np.array([[1, 2], [3, 4], [5, 6]]) + np.array([[7], [8], [9]])`

#### Answers

1. ``` '''Returns the index of the first positive element in the list z. 
    If all elements in z are non-positive, then it throws an error.'''      ```

2. Broadcasting works as follows: counting from the back, the dimensions of the arrays match if they either agree, are equal to 1, or absent. In particular:
    * The dimensions are `(2,)` and `(1,)`. The second array gets expanded to `np.array([7, 7])`, and the result is `np.array([8, 9])`.
    * The dimensions are `(2, 2)` and `(2,)`. The second array gets expanded to `np.array([7, 8], [7, 8])`, and the result is `np.array([[ 8, 10], [10, 12]])`.
    * The dimensions are `(2, 2)` and `(3,)`. The second dimension does not match (2 and 3). This errors.
    * The dimensions are `(3, 2)` and `(3, 1)`. The second array gets expanded to `np.array([[ 7,  7], [8, 8], [9, 9]])`, and the result is `np.array([[ 8,  9], [11, 12], [14, 15]])`.

## Question 2

You are given a set of $N$ returns $\{R_t\}_{t=T-N+1}^T$.
1. Suppose that $N=10$ and $\{R_t\}=\{-0.07,  0.87,  1.13, -0.48,  0.22,  0.94, -0.62, -0.01,  2.06, -0.08\}$. What is the historical 20% VaR?
* Name 2 problems associated with the historical VaR and offer a solution for each of them.
* You gather some more returns and produce the graph in the figure below. Interpret it.
<img src="figq2.png" alt="qqplot" style="display:block; margin-left: auto; margin-right: auto; width: 90%;">

4. You decide to use a parametric model for $VaR_{t+1}^p$, in the form of a Student's $t$ distribution. You fit the distribution to the data and obtain the following estimates: $(m, h, \nu)=(0.01, .3, 4)$, where $m$, $h$, and $\nu$ are respectively the location and scale parameters and the degrees of freedom. Obtain the 1% Student's $t$ VaR.
* Explain how after constructing VaR forecasts for all $N$ observations, you could backtest your model

#### Answers

1. Sort the array and pick the smallest value such that at least 20% (2 out of 10) of the values are smaller than or equal to it, i.e., $-0.48$ (because $-0.48$ and $-0.62$ are smaller than or equal to it). The VaR is minus that number, so $0.48$ (recall that VaR should be a positive number).

2. 
    * Historical VaR assumes that the last $N$ returns are representative for tomorrow. Solution: use a filtered method or a rolling window method.
    * Only 20% of the data are really used to compute the VaR (note how the historical VaR doesn't change at all if we multiply all the positive returns by, e.g., 10,000). We could use a larger $N$, but that exacerbates the first problem. Solution: use a parametric distribution, such as the Student's $t$. To solve both problems, the methods can be combined.
        
* This is a QQ plot; it plots the empirical/historical quantiles against those of a fitted parametric distribution, in this case the Normal. If the fit is good, then the points should lie on the 45 degree line. Here, the data are clearly more heavy-tailed than the normal distribution would imply. E.g., focussing on the point at the bottom left, for some $p$, the $p$% percentile of the normal is about $-4$, whereas the empirical one is almost $-6$.

* $
VaR_{t+1}^{p} =-F^{-1}_\nu(p; m, h) =-m-hF^{-1}_\nu(p; 0, 1) =-0.01-.3F^{-1}_4(p; 0, 1) =-0.01-.3(-3.747)=1.114.
$

* This can be tested by comparing the number of realized exceedances to the number of expected exceedances (which is equal to $N*p$). If the number of realized exceedances is much larger (or smaller) than the number of expected exceedances, the assumptions are invalid.

## Question 3

Consider a researcher who sorts a set of assets based on characteristic $X$, and finds the following average equal weighted returns.

1. Explain the impact of choosing value weighted portfolio returns versus equal weighted portfolio returns on the average returns.
* Based on the average returns, explain whether the researcher has an anomaly or not.

#### Answers

1. Using equal weighted returns implies that small cap stocks have a large impact on the returns. When using value weigthed returns, a small set of large cap firms have a large impact.
* There is no way to conclude this based on the average returns. An anomaly are the abnormal returns that cannot be explained by the asset pricing model we consider. We should first compute the alphas and betas, and compare the alphas to be able to conclude this.

## Question 4
Consider a  two-period binomial stock price tree ($T=2$, $N=2$, $\delta t\equiv T/N=1$), where $S_{t}=S_{t-1}u$ or $S_t=S_{t-1}d$,
with $S_0=4$, $u=2$ and $d=1/u=\frac{1}{2}$. The risk-free bond satisfies $B_t=e^{rt}$, where $e^{r \delta t}=5/4$.

1. Derive the risk free probability of an up move from the condition that the stock must earn the risk free rate
under $\mathbb{Q}$.

* Let $\displaystyle S^\ast_t\equiv \max_{0\leq j \leq t} S_j$. Obtain $C_0$, the no-arbitrage at time zero of a European *lookback option* whose payoff at time $T$ is $[C_T=S^*_T-S_T]$.

    **Hint**: the payoff of the option is path-dependent, implying a non-recombining option tree.

#### Answers

1. The condition is
\begin{align*}
\mathbb{E}^{\mathbb{Q}}\left[ S_{t} |\mathcal{F}_{t-1}\right] &=e^{r\delta t}S_{t-1} \Leftrightarrow\\
pS_{t-1}u+(1-p)S_{t-1}d &=e^{r \delta t}S_{t-1} \Leftrightarrow\\
p(u-d)&=e^{r \delta t}-d,
\end{align*}
so that $p=(e^{r \delta t}-d)/(u-d)=(\frac{5}{4}-\frac{2}{4})/(2-\frac{1}{2})=\frac{1}{2}$.

2. The payoffs at maturity are $C_{uu}=0$, $C_{ud}=4$, $C_{du}=0$, and $C_{dd}=3$.  Backwards induction yields $C_u=1.6$, $C_d=1.2$, and $C_0=1.12$.

## Question 5

1. State the definition of a continuous-time martingale, and explain what it means.
* Let 
    $$dX_t=\nu dt +\sigma dW_t, \quad \nu=-\frac{1}{2}\sigma^2,$$ 
    a Brownian motion with drift. For each of the following functions, derive the SDE satisfied by $Y_t\equiv f(t, X_t)$, and state whether $Y_t$ is a martingale.
    1. $f(t, X_t)=\exp(X_t)$;
    * $f(t, X_t)=tX_t$.

#### Answers

1. $X_t$ is a martingale if
    * $\mathbb{E}[|X_t|]<\infty$
    * $\mathbb{E}[X_t\mid \mathcal F_{s}]=X_{s}$, $s<t$.

  This means that a martingale is a process without a drift; on average, we expect tomorrow's value to be equal to today's.
  
2. 
    1. $f(t, X_t)=\exp(X_t)$, so $\dot f(t, X_t)\equiv \tfrac{\partial}{\partial t}f(t, X_t)=0$, $f'(t, X_t)\equiv \tfrac{\partial}{\partial X_t}f(t, X_t)=\exp(X_t)$, and $f''(t, X_t)\equiv \tfrac{\partial^2}{\partial X_t^2}f(t, X_t)=\exp(X_t)$, so by Ito's lemma,
\begin{align*}
dY_t\equiv df(t,X_{t})&=\dot{f}(t,X_{t})dt+f^{\prime }(t,X_{t})dX_{t}+\frac{1}{2}f^{\prime \prime }(t,X_{t})\sigma^2dt\\
&=0+\exp(X_t)dX_{t}+\frac{1}{2}\exp(X_t)\sigma^2dt\\
&=\exp(X_t)(\nu dt +\sigma dW_t)+\frac{1}{2}\exp(X_t)\sigma^2dt\\
&=\exp(X_t)(-\frac{1}{2}\sigma^2 dt +\sigma dW_t)+\frac{1}{2}\exp(X_t)\sigma^2dt\\
&= \exp(X_t)\sigma dW_t\\
&= Y_t\sigma dW_t,
\end{align*}
an Ito process without a drift and hence a martingale.

    2. $f(t, X_t)=tX_t$, so $\dot f(t, X_t)=X_t$, $f'(t, X_t)=t$, $f''(t, X_t)=0$,  and
\begin{align*}
dY_t\equiv df(t,X_{t})&=\dot{f}(t,X_{t})dt+f^{\prime }(t,X_{t})dX_{t}+\frac{1}{2}f^{\prime \prime }(t,X_{t})\sigma^2dt\\
&=X_tdt+tdX_t+0\\
&=X_tdt+t(\nu dt +\sigma dW_t)\\
&=(X_t+\nu t)dt+\sigma t dW_t,\\
&=(X_t-\frac{1}{2}\sigma^2 t)dt+\sigma t dW_t,
\end{align*}
which has a non-zero drift and thus isn't a martingale.

## Question 6

A researcher wants to obtain a Monte Carlo estimate of the price of a European-style option with payoff $C_T(S_T)$. The risk-neutral dynamics of $S_t$ are specified  by the SDE

$$dS_t=\mu_tdt+\sigma_t dW_t.$$

The market is free from arbitrage, so that $C_0= \mathbb{E}^\mathbb{Q}[e^{-rT}C_T(S_T)]$.
1. Explain how the researcher could simulate a draw from the risk-neutral distribution of $C_T(S_T)$.
* Let $X\equiv e^{-rT}C_T(S_T)$ and denote by $\{X_i\}_{i=1}^n$ independent draws of $X$ as in the previous question. The researcher uses $\bar X_n\equiv \tfrac{1}{n}\sum_{i=1}^nX_i$ as an estimate of $C_0$. Derive a (asymptotic) confidence interval for $C_0$.

#### Answers

1. The process could be discretized by the Euler approximation
$$
S_{i}=S_{i-1}+\mu_t \delta t + \sigma_t  \sqrt{\delta t} Z_i,
$$
where $Z_i\stackrel{i.i.d.}{\sim} N(0,1)$. Given pseudo random numbers for $Z_i$, this can be iterated to give a path for $S_t$, including $S_T$ and hence $C_T(S_T)$. Repeating this $n$ times with independent random numbers produces $n$ paths.
* By the C.L.T.,
$$
\sqrt{n}(\bar{X}_n-\theta)\stackrel d\rightarrow N\left(0,\sigma^2\right), \quad \sigma^2=var(X).
$$
Hence,
\begin{align*}
\mathbb{P}[-1.96 \sigma\leq \sqrt{n}(\bar{X}_n-\theta)\leq 1.96 \sigma ]&=0.95\Leftrightarrow\\
\mathbb{P}[\bar{X}_n-1.96 \frac{\sigma}{\sqrt{n}}\leq \theta\leq \bar{X}_n+1.96 \frac{\sigma}{\sqrt{n}} ]&=0.95.
\end{align*}
Hence $c_l=\bar{X}_n-1.96 \frac{\sigma}{\sqrt{n}}$ and $c_u=\bar{X}_n+1.96 \frac{\sigma}{\sqrt{n}}$ is an asymptotically valid CI. $\sigma^2$ is unknown, but we can estimate it as
$$
\hat{\sigma}^2=\frac{1}{n}\sum_{i=1}^n \left(X_i-\bar{X}_n\right)^2.
$$