## Problem 1: Brownian motion
*In these problems we look at Brownian motion, which is the source of randomness. Asset prices, interest rates and payoffs are functions of Brownian motion. These expectations we are looking at tell us how randomness at different times is related.*

Let $W_t$ be a Brownian motion. Assume  
$$
s < t < u < v,
$$
and solve the problems below. In doing so, you will need to use that $W_t$ is Markov and has stationary independent increments. That is, for $0 < s < t$ we know that
$$
W_t - W_s \mid \mathcal{F}_s
= W_t - W_s \mid W_s = w_s
\sim \mathcal{N}(0,\, t - s).
$$

---



### 1.a) Find the conditional distribution of $W_t$ given $\mathcal{F}_s$.

Brownian motion can be written as:
$$ W_t =W_s +(W_t-W_s)$$
Given that $W_s=w_s$ er can write:
$$ W_t\mid W_s = w_s = w_s + (W_t-W_s)\sim \mathcal{N}(w_s,t-s)$$

---

### 1.b) Find  $\mathbb{E}[W_s W_t], \quad \mathrm{Cov}[W_s, W_t], \quad \text{and} \quad \mathrm{Cor}[X_t, Z_t].$



We know that:
- $E[W_t]=0$
- $Var(W_t)=t$
- $W_t-W_s \sim \mathcal{N}(0,t-s)$
- $W_t-W_s$ is independent of $\mathcal{F}_s$

we have: $ W_t =W_s +(W_t-W_s)$

---

**Find $\mathbb{E}[W_s W_t]$:**
We start by substituting the decomposition of $W_t$
$$\mathbb{E}[W_s W_t]=\mathbb{E}[W_s (W_s +(W_t-W_s))]$$
$$=\mathbb{E}[W_s^2]+ \mathbb{E}[(W_s(W_t-W_s))]$$

We know that $\mathbb{E}[W_s^2]=Var(W_s)=s$
And that $W_t-W_s$ is independent of $\mathcal{F}_s$

As $W_t-W_s$ is independent of $\mathcal{F}_s$, we have $\mathbb{E}[W_t-W_s]=0$

Giving us:
$$=\mathbb{E}[W_s^2]+ \mathbb{E}[(W_s(W_t-W_s))] = s+ \mathbb{E}[W_s\mathbb{E}[W_t-W_s\mid\mathcal{F}_s]]$$

$$s+\mathbb{E}[W_s·0]=0$$

$$\mathbb{E}[W_s W_t]=s$$
---

**Find $\mathrm{Cov}[W_s, W_t]$**
We know that:
$$Cov(X,Y)=E[XY]-E[X]E[Y]$$
So for us:
$$Cov(W_s, W_t)=E[W_s·W_t]-E[W_s]E[W_t]$$

But we know that $E[W_s]=0$ and $E[W_t]=0$ so:
$$Cov(W_s, W_t)=s$$

**Find $\mathrm{Cor}[X_t, Z_t]$**
Recall that:
$$Cor(X,Y)=\frac{Cov(W_s, W_t)}{\sqrt{Var(X)Var(Y)}}$$
Here: $Var(W_s)=s$ and $Var(W_t)$:
$$\mathrm{Cor}[X_t, Z_t]=\frac{s}{\sqrt{st}}=\sqrt{\frac{s}{t}}

### 1.c) Show that $W_t^2 - t$ is a martingale.
*Theorem 3.8 (First Fundamental Theorem) Given a fixed numeraire, the
market is free of arbitrage possibilities if and only if there exists a martingale measure Q.*

Intuition: A Martingale is a stochastic process (sequence of random variables) in which the expected value of the next observation, given all prior observations is equal to the most recent value. 

To show that $W_t^2 - t$ is a martingale we need to show that $\mathbb{E}[W_t^2 - t\mid\mathcal{F}_s]=W_s^2-s$

Write the Brownian motion as
$$
W_t = W_s + (W_t - W_s).
$$

Squaring both sides gives
$$
W_t^2
= W_s^2 + 2W_s(W_t - W_s) + (W_t - W_s)^2.
$$

Taking conditional expectation with respect to $\mathcal F_s$ yields
$$
\mathbb E[W_t^2 \mid \mathcal F_s]
= \mathbb E[W_s^2 \mid \mathcal F_s]
+ 2W_s \mathbb E[W_t - W_s \mid \mathcal F_s]
+ \mathbb E[(W_t - W_s)^2 \mid \mathcal F_s].
$$

Since $W_s$ is $\mathcal F_s$-measurable,
$$
\mathbb E[W_s^2 \mid \mathcal F_s] = W_s^2.
$$

Because Brownian increments are independent of $\mathcal F_s$ and have mean zero,
$$
\mathbb E[W_t - W_s \mid \mathcal F_s] = 0.
$$

Moreover, since $W_t - W_s \sim \mathcal N(0, t-s)$,
$$
\mathbb E[(W_t - W_s)^2 \mid \mathcal F_s] = t - s.
$$

Hence,
$$
\mathbb E[W_t^2 \mid \mathcal F_s] = W_s^2 + (t - s).
$$

Subtracting $t$ on both sides gives
$$
\mathbb E[W_t^2 - t \mid \mathcal F_s]
= W_s^2 - s.
$$

Therefore,
$$
W_t^2 - t \text{ is a martingale}.
$$

---


### 1.d) Find $\mathbb{E}[W_s W_t W_u].$

Using the tower property,
$$
\mathbb E[W_s W_t W_u]
= \mathbb E\!\left[ \mathbb E[W_s W_t W_u \mid \mathcal F_t] \right].
$$

Since $W_s$ and $W_t$ are $\mathcal F_t$-measurable, we can pull them out of the paranthesis:
$$
\mathbb E[W_s W_t W_u \mid \mathcal F_t]
= W_s W_t \mathbb E[W_u \mid \mathcal F_t].
$$

By the martingale property of Brownian motion,
$$
\mathbb E[W_u \mid \mathcal F_t] = W_t.
$$

Hence,
$$
\mathbb E[W_s W_t W_u] = \mathbb E[W_s W_t^2].
$$

Apply the tower property again:
$$
\mathbb E[W_s W_t^2]
= \mathbb E\!\left[ W_s \mathbb E[W_t^2 \mid \mathcal F_s] \right].
$$

From part (c),
$$
\mathbb E[W_t^2 \mid \mathcal F_s] = W_s^2 + (t-s).
$$

Therefore,
$$
\mathbb E[W_s W_t^2]
= \mathbb E[W_s^3] + (t-s)\mathbb E[W_s].
$$

Since $W_s \sim \mathcal N(0,s)$,
$$
\mathbb E[W_s] = 0, \qquad \mathbb E[W_s^3] = 0.
$$

Thus,
$$
\mathbb E[W_s W_t W_u] = 0.
$$

*Intuition: The result shows us that products involving an odd number of Brownian motion terms have zero expectation, reflecting the symmetry and absence of drift in Brownian motion.*

---

### e) Find $\mathbb{E}[W_s W_t W_u W_v].$


Using the tower property,
$$
\mathbb E[W_s W_t W_u W_v]
= \mathbb E\!\left[ \mathbb E[W_s W_t W_u W_v \mid \mathcal F_u] \right].
$$

Since $W_s, W_t, W_u$ are $\mathcal F_u$-measurable,
$$
\mathbb E[W_s W_t W_u W_v \mid \mathcal F_u]
= W_s W_t W_u \mathbb E[W_v \mid \mathcal F_u].
$$

By the martingale property,
$$
\mathbb E[W_v \mid \mathcal F_u] = W_u.
$$

Hence,
$$
\mathbb E[W_s W_t W_u W_v]
= \mathbb E[W_s W_t W_u^2].
$$

Condition on $\mathcal F_t$:
$$
\mathbb E[W_u^2 \mid \mathcal F_t] = W_t^2 + (u-t).
$$

Therefore,
$$
\mathbb E[W_s W_t W_u^2]
= \mathbb E[W_s W_t (W_t^2 + (u-t))]
= \mathbb E[W_s W_t^3] + (u-t)\mathbb E[W_s W_t].
$$

From part (b),
$$
\mathbb E[W_s W_t] = s.
$$

Moreover,
$$
\mathbb E[W_s W_t^3]
= \mathbb E\!\left[ W_s \mathbb E[W_t^3 \mid \mathcal F_s] \right].
$$

Since $W_t = W_s + (W_t - W_s)$ and $W_t - W_s \sim \mathcal N(0, t-s)$,
$$
\mathbb E[W_t^3 \mid \mathcal F_s]
= W_s^3 + 3W_s(t-s).
$$

Thus,
$$
\mathbb E[W_s W_t^3]
= \mathbb E[W_s^4] + 3(t-s)\mathbb E[W_s^2].
$$

For $W_s \sim \mathcal N(0,s)$,
$$
\mathbb E[W_s^2] = s, \qquad \mathbb E[W_s^4] = 3s^2.
$$

Hence,
$$
\mathbb E[W_s W_t^3] = 3s^2 + 3s(t-s) = 3st.
$$

Combining terms,
$$
\mathbb E[W_s W_t W_u W_v]
= 3st + (u-t)s
= s(u + 2t).
$$

*Intuition: e) demonstrates that even-order movements are positive and capture accumulation of variance over time. These results formalize how randomness spread through time.*

---
---

### 2.a) Show that $Z_t$ is a Brownian motion

To show that $Z_t$ is a Brownian motion, we need to verify that it satisfies the defining properties of Brownian motion. We define: 

$$Z_t = \rho X_t + \sqrt{1-\rho^2}Y_t$$

Since $X_t$ and $Y_t$ are independent Brownian motions, they both have stationary independent increments. Therefore, $Z_t$ has stationary independent increments.

For $s < t$, the increment $Z_t - Z_s$ is the sum of two independent Gaussian random variables:
$$
Z_t - Z_s = \rho(X_t - X_s) + \sqrt{1-\rho^2}(Y_t - Y_s)
$$
and is therefore also Gaussian.

Taking conditional expectation given $\mathcal{F}_s$. Here we use that $X_t$ and $Y_t$ are independent Brownian motions:
$$
\mathbb{E}[Z_t - Z_s \mid \mathcal{F}_s] = \rho \mathbb{E}[X_t - X_s \mid \mathcal{F}_s] + \sqrt{1-\rho^2}\mathbb{E}[Y_t - Y_s \mid \mathcal{F}_s] $$

$$= \rho \cdot 0 + \sqrt{1-\rho^2} \cdot 0 = 0
$$

Computing the variance:
$$
\text{Var}[Z_t - Z_s \mid \mathcal{F}_s] = \mathbb{E}[(Z_t - Z_s)^2 \mid \mathcal{F}_s]
$$
$$
= \rho^2 \mathbb{E}[(X_t - X_s)^2 \mid \mathcal{F}_s] + (1-\rho^2)\mathbb{E}[(Y_t - Y_s)^2 \mid \mathcal{F}_s]
$$
$$
= \rho^2(t-s) + (1-\rho^2)(t-s) = (t-s)
$$

In conclusion, $Z_t$ is a continuous stochastic process that has stationary independent increments, each of which is Gaussian with mean $0$ and variance proportional to time. Thus, $Z_t$ is a Brownian motion.

*Intuition: The construction $Z_t = \rho X_t + \sqrt{1-\rho^2}Y_t$ is a clever way to create a new Brownian motion that is correlated with $X_t$. The weights $\rho$ and $\sqrt{1-\rho^2}$ ensure that the variance stays equal to $t$. The important takeaway is that the weighted sum of independent Gaussian processes with the right weights produces another Gaussian process with the desired properties.*

---



### 2.b) Find $\text{Cor}[X_t, Z_t]$

We note that $\mathbb{E}[X_t] = 0$, $\mathbb{E}[Z_t] = 0$, $\text{Var}[X_t] = t$ and $\text{Var}[Z_t] = t$.

Computing the covariance:
$$
\text{Cov}[Z_t, X_t] = \mathbb{E}[Z_t X_t] = \mathbb{E}\left[\left(\rho X_t + \sqrt{1-\rho^2}Y_t\right)X_t\right]
$$
$$
= \rho \mathbb{E}[X_t^2] + \sqrt{1-\rho^2}\mathbb{E}[X_t Y_t]
$$

Since $X_t$ and $Y_t$ are independent, $\mathbb{E}[X_t Y_t] = 0$:
$$
= \rho \mathbb{E}[X_t^2] = \rho t
$$

Therefore, the correlation is:
$$
\text{Cor}[Z_t, X_t] = \frac{\text{Cov}[Z_t, X_t]}{\sqrt{\text{Var}[X_t]}\sqrt{\text{Var}[Z_t]}} = \frac{\rho t}{\sqrt{t}\sqrt{t}} = \rho
$$

*Intuition: $\rho$ is exactly the correlation coefficient between $X_t$ and $Z_t$. This can be used when modeling correlated financial instruments and thus ensuring they have a specific correlation structure.*

---

### 2.c) Find $\mathbb{E}[Z_t \mid X_t = x]$ and $\text{Var}[Z_t \mid X_t = x]$

The conditional mean and variance of $Z_t$ given $X_t = x$ can be found from the definition $Z_t = \rho X_t + \sqrt{1-\rho^2}Y_t$.

**Conditional Mean:**
$$
\mathbb{E}[Z_t \mid X_t = x] = \mathbb{E}[\rho X_t \mid X_t = x] + \sqrt{1-\rho^2}\mathbb{E}[Y_t \mid X_t = x]
$$

Since $X_t = x$ is given and $Y_t$ is independent of $X_t$:
$$
= \rho x + \sqrt{1-\rho^2} \cdot 0 = \rho x
$$

**Conditional Variance:**
$$
\text{Var}[Z_t \mid X_t = x] = \mathbb{E}\left[\left(\rho X_t + \sqrt{1-\rho^2}Y_t - \rho x\right)^2 \mid X_t = x\right]
$$

Expanding:
$$
= \mathbb{E}[(\rho X_t - \rho x)^2 \mid X_t = x] + 2\mathbb{E}[(\rho X_t - \rho x) \mid X_t = x]\cdot$$
$$\mathbb{E}[\sqrt{1-\rho^2}Y_t \mid X_t = x] + (1-\rho^2)\mathbb{E}[Y_t^2 \mid X_t = x]
$$

Since $X_t = x$ is given, $(\rho X_t - \rho x)^2 = 0$, and the middle term is zero. Also, $Y_t$ is independent of $X_t$:
$$
= 0 + 0 + (1-\rho^2)\mathbb{E}[Y_t^2] = (1-\rho^2)t
$$

*Intuition: When we condition on $X_t = x$, the expected value of $Z_t$ shifts proportionally to $\rho x$ (the correlated part), but the variance only comes from the independent component $Y_t$, which contributes $(1-\rho^2)t$. The stronger the correlation (larger $|\rho|$), the less uncertainty remains after conditioning.*

---



### 2.d) Find the covariance matrix of the random vector $Y_t$. Show that the covariance matrix is positive definite.

Let us find the moments of the vector $Y_t = \Sigma W_t$ where $W_t = (W_t^{(1)}, W_t^{(2)}, \ldots, W_t^{(N)})^T$.

We denote the $i$-th entry of $Y_t$ by $Y_i$, the entry in the $i$-th row and $j$-th column of $\Sigma$ by $\Sigma_{ij}$, and the $i$-th row of $\Sigma$ by $\Sigma_{i\cdot}$.

**Expected Value:**
$$
\mathbb{E}[Y_i] = \mathbb{E}\left[\sum_{n=1}^{N} \Sigma_{in}W_t^{(n)}\right] = \sum_{n=1}^{N} \Sigma_{in}\mathbb{E}[W_t^{(n)}] = 0
$$

**Variance:**
$$
\text{Var}[Y_i] = \mathbb{E}[Y_i^2] = \mathbb{E}\left[\sum_{m=1}^{M}\sum_{n=1}^{N} \Sigma_{im}\Sigma_{in}W_t^{(m)}W_t^{(n)}\right]
$$

Since the Brownian motions are independent, $\mathbb{E}[W_t^{(m)}W_t^{(n)}] = 0$ for $m \neq n$ and $\mathbb{E}[(W_t^{(n)})^2] = t$:
$$
= t\sum_{n=1}^{N} \Sigma_{in}^2 = t\Sigma_{i\cdot}\Sigma_{i\cdot}^T = t\|\Sigma_{i\cdot}\| = t
$$

**Covariance:**
$$
\text{Cov}[Y_i, Y_j] = \mathbb{E}[Y_iY_j] = \mathbb{E}\left[\sum_{m=1}^{M}\sum_{n=1}^{N} \Sigma_{im}\Sigma_{jn}W_t^{(m)}W_t^{(n)}\right]
$$
$$
= t\sum_{n=1}^{N} \Sigma_{in}\Sigma_{jn} = t\Sigma_{i\cdot}\Sigma_{j\cdot}^T
$$

The covariance matrix of $Y_t$ can therefore be written as:
$$
\text{Cov}[Y_t] = \text{Cov}[\Sigma W_t] = \Sigma\Omega\Sigma^T, \quad \Omega = \begin{pmatrix}
t & 0 & \cdots & 0 \\
0 & t & \cdots & 0 \\
\vdots & \vdots & \ddots & \vdots \\
0 & 0 & \cdots & t
\end{pmatrix}
$$

**Positive Definiteness:**

This matrix is positive definite by construction since we can write $\text{Cov}[Y_t]$ as the matrix square of two matrices:
$$
\text{Cov}[Y_t] = \Sigma\Omega\Sigma^T = AA^T
$$

where $A = \Sigma\sqrt{\Omega}$.

*Intuition: The construction $Y_t = \Sigma W_t$ transforms independent Brownian motions into correlated ones through the matrix $\Sigma$. The constraint $\|\Sigma_{i\cdot}\| = 1$ ensures each component $Y_i$ has variance $t$, making it a proper Brownian motion. The covariance matrix captures how the components co-move, and positive definiteness guarantees it's a valid covariance structure.*

---



### 2.e) What is the correlation matrix of $Y_t$?

The correlation between $Y_i$ and $Y_j$ is:
$$
\text{Cor}[Y_i, Y_j] = \frac{\text{Cov}[Y_i, Y_j]}{\sqrt{\text{Var}[Y_i]}\sqrt{\text{Var}[Y_j]}} = \frac{t\Sigma_{i\cdot}\Sigma_{j\cdot}^T}{\sqrt{t}\sqrt{t}} = \Sigma_{i\cdot}\Sigma_{j\cdot}^T
$$

Therefore, the correlation matrix of $Y_t$ is:
$$
\text{Cor}[Y_t] = \text{Cor}[\Sigma W_t] = \Sigma\Sigma^T
$$

*Intuition: The correlation matrix $\Sigma\Sigma^T$ is determined entirely by how the matrix $\Sigma$ mixes the independent Brownian motions. This gives us a systematic way to construct a multivariate Brownian motion with any desired correlation structure: simply find a matrix $\Sigma$ such that $\Sigma\Sigma^T$ equals the target correlation matrix (this is Cholesky decomposition). This is crucial for modeling multiple correlated interest rates or asset prices.*

---

### 2.f) What is the distribution of $Y_t^{(i)}$ and what is the joint distribution of $Y_t$?

**Distribution of $Y_t^{(i)}$:**

From part (d), we know that $Y_i$ can be written as:
$$
Y_i = \sum_{n=1}^{N} \Sigma_{in}W_t^{(n)}
$$

Since $Y_i^{(i)}$ is the sum of independent Gaussian random variables (each $W_t^{(n)} \sim \mathcal{N}(0,t)$), it is itself Gaussian. From part (d), we found:
$$
\mathbb{E}[Y_i] = 0, \quad \text{Var}[Y_i] = t
$$

Therefore:
$$
Y_t^{(i)} \sim \mathcal{N}(0, t) \text{ for all } t
$$

Furthermore, $Y_t^{(i)}$ has continuous trajectories (as a linear combination of continuous processes), and hence $Y_t^{(i)}$ is a Brownian motion.

**Joint Distribution of $Y_t$:**

Since $Y_t^{(i)}$ is a sum of independent normal random variables for all $i$, it follows that any linear combination $b^T Y_t$, where $b \in \mathbb{R}^M$, is also a Gaussian random variable.

By definition, if all linear combinations of the components of a random vector are Gaussian, then the vector follows a multivariate normal distribution.

Hence, the joint distribution of the processes collected in $Y_t$ is multivariate normal:
$$
Y_t \sim \mathcal{N}(0, \Sigma\Omega\Sigma^T)
$$

*Intuition: Each component $Y_i$ inherits the Brownian motion property from the independent Brownian motions it's built from. The joint distribution is multivariate normal because we're taking linear combinations of independent normal variables. The covariance matrix $\Sigma\Omega\Sigma^T$ fully characterizes how the components move together over time.*

---



### 2.g) Is $Y_t$ a multivariate Brownian motion?

Yes, we can define $Y_t$ as a multivariate Brownian motion with correlation matrix $\Sigma\Sigma^T$.

**Verification:**

To be a multivariate Brownian motion, $Y_t$ must satisfy:

1. **Each component is a Brownian motion:** From part (f), each $Y_t^{(i)} \sim \mathcal{N}(0,t)$ with continuous paths and independent increments.

2. **Joint normality:** From part (f), $Y_t$ follows a multivariate normal distribution.

3. **Proper covariance structure:** The covariance matrix $\text{Cov}[Y_t] = t\Sigma\Sigma^T$ grows linearly with time, and the correlation matrix $\Sigma\Sigma^T$ is constant over time.

4. **Independent increments:** For $s < t$, the increment $Y_t - Y_s = \Sigma(W_t - W_s)$ is independent of $\mathcal{F}_s$ because the Brownian increments $W_t - W_s$ are independent of $\mathcal{F}_s$.

Therefore, $Y_t$ satisfies all the properties of a multivariate Brownian motion.

*Intuition: This problem demonstrates a fundamental construction method for multivariate Brownian motion. We've just seen a very straightforward method to construct a multivariate Brownian motion with a specific correlation matrix: start with independent Brownian motions and mix them using a matrix $\Sigma$ where each row has unit norm. This technique is essential in practice for modeling multiple correlated interest rates, exchange rates, or asset prices. For example, if you want to model term structure dynamics with correlated factors, or price multi-asset derivatives, you would use exactly this construction.*

---

## Problem 3

**Problem Statement:** Consider a stochastic process $r_t$ for $t \geq 0$ with dynamics
$$
dr_t = (b - ar_t)dt + \sigma dW_t, \quad b > 0
$$

---



### 3.a) Show that the solution $r_T$ corresponding to these dynamics is:
$$
r_T = e^{-aT}r_0 + \frac{b}{a}\left[1 - e^{-aT}\right] + \sigma \int_0^T e^{-a(T-t)}dW_t
$$

**i) Apply Ito's formula to $f(t,r) = e^{at}r$:**

We have:
$$
\frac{\partial f}{\partial t} = ae^{at}r, \quad \frac{\partial f}{\partial r} = e^{at}, \quad \frac{\partial^2 f}{\partial r^2} = 0
$$

Applying Ito's formula:
$$
df(t,r_t) = ae^{at}r_t dt + e^{at}dr_t + \frac{1}{2} \cdot 0 \cdot (dr_t)^2
$$

Substituting $dr_t = (b - ar_t)dt + \sigma dW_t$:
$$
df(t,r_t) = ae^{at}r_t dt + e^{at}[(b - ar_t)dt + \sigma dW_t]
$$

**ii) Simplify to get an expression for $d(e^{at}r_t)$ that does not depend on $r_t$:**

Expanding:
$$
df(t,r_t) = ae^{at}r_t dt + be^{at}dt - ae^{at}r_t dt + \sigma e^{at}dW_t
$$

The terms with $r_t$ cancel:
$$
d(e^{at}r_t) = be^{at}dt + \sigma e^{at}dW_t
$$

**iii) Integrate from $0$ to $T$ and solve the time-integral:**

Integrating both sides from $0$ to $T$:
$$
\int_0^T d(e^{au}r_u) = e^{aT}r_T - e^{a \cdot 0}r_0 = \int_0^T be^{au}du + \sigma \int_0^T e^{au}dW_u
$$

Computing the time integral:
$$
\int_0^T be^{au}du = b\left[\frac{e^{au}}{a}\right]_0^T = \frac{b}{a}(e^{aT} - 1)
$$

Therefore:
$$
e^{aT}r_T - r_0 = \frac{b}{a}(e^{aT} - 1) + \sigma \int_0^T e^{au}dW_u
$$

Multiplying through by $e^{-aT}$:
$$
r_T = e^{-aT}r_0 + \frac{b}{a}(1 - e^{-aT}) + \sigma \int_0^T e^{-a(T-u)}dW_u
$$

*Intuition: This is the Vasicek/Ornstein-Uhlenbeck process, a fundamental mean-reverting model for interest rates. The solution shows that $r_T$ is pulled toward the long-run mean $\frac{b}{a}$ at rate $a$. The exponential decay $e^{-aT}$ shows that the influence of the initial value $r_0$ diminishes over time, while the process is constantly buffeted by cumulative random shocks represented by the stochastic integral.*

---



### 3.b) Use Ito isometry to show that:
$$
r_T|r_0 \sim \mathcal{N}\left(e^{-aT}r_0 + \frac{b}{a}(1-e^{-aT}), \frac{\sigma^2}{2a}(1-e^{-2aT})\right)
$$

From Problem 5, we know that an Ito integral with a deterministic integrand follows a Gaussian distribution. We also know that the expected value of an Ito integral is $0$.

**Expected Value:**

Taking expectations:
$$
\mathbb{E}[r_T|r_t] = e^{-aT}r_0 + \frac{b}{a}(1-e^{-aT}) + \sigma\mathbb{E}\left[\int_0^T e^{-a(T-u)}dW_u\right]
$$
$$
= e^{-aT}r_0 + \frac{b}{a}(1-e^{-aT})
$$

**Variance:**

Computing the variance of $r_T|r_0$:
$$
\text{Var}[r_T|r_t] = \mathbb{E}\left[\left(\sigma \int_0^T e^{-a(T-u)}dW_u\right)^2 \mid r_t\right]
$$

Using Ito isometry (for a deterministic integrand $g(u)$, we have $\mathbb{E}[(\int_0^T g(u)dW_u)^2] = \int_0^T g(u)^2 du$):
$$
= \sigma^2 \int_0^T e^{-2a(T-u)}du
$$

Evaluating the integral (substituting $v = T-u$, so $dv = -du$):
$$
= \sigma^2 \int_0^T e^{-2av}dv = \sigma^2 \left[-\frac{e^{-2av}}{2a}\right]_0^T = \sigma^2 \left(-\frac{e^{-2aT}}{2a} + \frac{1}{2a}\right) = \frac{\sigma^2}{2a}(1-e^{-2aT})
$$

Therefore:
$$
r_T|r_t \sim \mathcal{N}\left(e^{-aT}r_0 + \frac{b}{a}(1-e^{-aT}), \frac{\sigma^2}{2a}(1-e^{-2aT})\right)
$$

*Intuition: The conditional distribution is Gaussian because it's driven by Brownian motion. Both the mean and variance approach their limiting values as $T$ increases, with the variance converging faster (at rate $2a$) than the mean (at rate $a$).*

---



### 3.c) Find the limiting distribution of $r_T$ as $T \to \infty$

Sending $T \to \infty$:
$$
\lim_{T \to \infty} e^{-aT} = 0
$$
$$
\lim_{T \to \infty} (1 - e^{-aT}) = 1
$$
$$
\lim_{T \to \infty} (1 - e^{-2aT}) = 1
$$

Therefore, the limiting distribution is:
$$
r_\infty \sim \mathcal{N}\left(\frac{b}{a}, \frac{\sigma^2}{2a}\right)
$$

The fact that the limiting distribution exists and is well-defined allows us to conclude that the short rate in this model settles to a stationary distribution.

*Intuition: As time goes to infinity, the process "forgets" its initial condition and converges to a stationary distribution. This is the key property of mean reversion: no matter where you start, you eventually end up with the same long-run distribution centered at $\frac{b}{a}$ with variance $\frac{\sigma^2}{2a}$.*

---



### 3.d) If you had to guess, what is your best guess of $r$ in the long run? How does the limiting distribution of $r_T$ depend on $r_0$ and what is the implication?

The long-run mean under the stationary distribution is $\frac{b}{a}$, and that would be our best long-run guess for the short rate.

**Key observations:**

1. **Independence from initial conditions:** The limiting distribution does not depend on $r_0$, implying that this process forgets its origin.

2. **Mean reversion dynamics:** The mean of $r_T|r_0$ for finite $T$ is a weighted average of the initial value $r_0$ and the long-run mean $\frac{b}{a}$:
   $$
   \mathbb{E}[r_T|r_0] = e^{-aT}r_0 + (1-e^{-aT})\frac{b}{a}
   $$
   where the weight on $r_0$ decays exponentially fast at rate $a$.

3. **Variance convergence:** Similarly, $\text{Var}[r_T|r_0]$ decays exponentially fast at rate $2a$ to the long-run variance $\frac{\sigma^2}{2a}$.

4. **Role of parameter $a$:** It is clear that the parameter $a$ governs the rate at which the distribution of the short rate settles to its stationary distribution. Larger $a$ means faster mean reversion.

*Intuition: This model captures the economic reality that interest rates don't wander off to infinity or negative infinity—they are pulled back toward a long-run average. The parameter $a$ measures how strongly this pull operates: high $a$ means strong mean reversion (rates quickly return to normal), while low $a$ means weak mean reversion (rates can stay away from the mean for extended periods). This is crucial for pricing long-dated bonds and understanding interest rate risk.*

---

## Problem 4

**Problem Statement:** Suppose that the stochastic process $S_t$ follows a Geometric Brownian motion and has dynamics
$$
dS_t = \mu S_t dt + \sigma S_t dW_t, \quad S_0 = s_0
$$

---



### 4.a) Show that the solution $S(T)$ corresponding to these dynamics is $S(T) = s_0 e^{(\mu - \frac{1}{2}\sigma^2)T + \sigma W_T}$

To solve this SDE, we apply Ito's formula to the logarithm of $S_t$.

Let $X_t = \ln(S_t)$. We want to find the dynamics of $X_t$ using Ito's formula where $f(t,S) = \ln S$.

**Computing partial derivatives:**
$$
\frac{\partial f}{\partial S} = \frac{1}{S}, \quad \frac{\partial^2 f}{\partial S^2} = -\frac{1}{S^2}
$$

**Applying Ito's formula:**
$$
dX_t = \frac{\partial f}{\partial S} dS_t + \frac{1}{2}\frac{\partial^2 f}{\partial S^2}(dS_t)^2
$$

Substituting $dS_t = \mu S_t dt + \sigma S_t dW_t$:
$$
dX_t = \frac{1}{S_t}(\mu S_t dt + \sigma S_t dW_t) + \frac{1}{2}\left(-\frac{1}{S_t^2}\right)(\sigma S_t dW_t)^2
$$

Using $(dW_t)^2 = dt$:
$$
dX_t = \mu dt + \sigma dW_t - \frac{1}{2}\sigma^2 dt = \left(\mu - \frac{1}{2}\sigma^2\right)dt + \sigma dW_t
$$

**Integrating from $0$ to $T$:**

Notice that $X_t$ no longer appears on the right-hand side, so we can integrate directly:
$$
X_T - X_0 = \int_0^T \left(\mu - \frac{1}{2}\sigma^2\right)dt + \int_0^T \sigma dW_t
$$
$$
X_T = X_0 + \left(\mu - \frac{1}{2}\sigma^2\right)T + \sigma W_T
$$

Since $X_t = \ln(S_t)$, we have $X_0 = \ln(s_0)$ and $X_T = \ln(S_T)$:
$$
\ln(S_T) = \ln(s_0) + \left(\mu - \frac{1}{2}\sigma^2\right)T + \sigma W_T
$$

Taking exponentials:
$$
S_T = s_0 e^{(\mu - \frac{1}{2}\sigma^2)T + \sigma W_T}
$$

*Intuition: The famous $-\frac{1}{2}\sigma^2$ term arises from Ito's lemma and reflects the "drag" due to volatility. This is why the drift of $\ln(S_t)$ is not simply $\mu$ but is reduced by half the variance. This correction ensures that $\mathbb{E}[S_T] = s_0 e^{\mu T}$ despite the stochastic nature of the process.*

---



### 4.b) Find $\mathbb{E}[S(T)]$ in terms of $s_0$, $\mu$ and $\sigma$

From part (a), we have:
$$
S_T = s_0 e^{(\mu - \frac{1}{2}\sigma^2)T + \sigma W_T}
$$

Taking expectations:
$$
\mathbb{E}[S_T] = s_0 e^{(\mu - \frac{1}{2}\sigma^2)T} \mathbb{E}[e^{\sigma W_T}]
$$

**Using the moment generating function:**

We need to use that if $X \sim \mathcal{N}(\mu, \sigma^2)$, then:
$$
\mathbb{E}[e^{\omega X}] = e^{\omega \mu + \frac{1}{2}\omega^2 \sigma^2}
$$

Since $W_T \sim \mathcal{N}(0, T)$, we have:
$$
\mathbb{E}[e^{\sigma W_T}] = e^{\sigma \cdot 0 + \frac{1}{2}\sigma^2 T} = e^{\frac{1}{2}\sigma^2 T}
$$

Therefore:
$$
\mathbb{E}[S_T] = s_0 e^{(\mu - \frac{1}{2}\sigma^2)T} \cdot e^{\frac{1}{2}\sigma^2 T} = s_0 e^{\mu T}
$$

*Intuition: Despite the volatility, the expected value grows at the constant rate $\mu$. The $-\frac{1}{2}\sigma^2$ term in the exponent and the $+\frac{1}{2}\sigma^2 T$ from the MGF cancel perfectly, giving us a clean exponential growth. This is why $\mu$ is often called the "expected return" or "drift" of the asset.*

---



### 4.c) Find the dynamics of $Z_t = S_t^m$ and show that $Z_t$ also follows a geometric Brownian motion

We apply Ito's formula to $f(S) = S^m$ where $S_t$ follows the given dynamics.

**Computing partial derivatives:**
$$
\frac{\partial f}{\partial S} = mS^{m-1}, \quad \frac{\partial^2 f}{\partial S^2} = m(m-1)S^{m-2}
$$

**Applying Ito's formula:**
$$
dZ_t = df(S_t) = \frac{\partial f}{\partial S}dS_t + \frac{1}{2}\frac{\partial^2 f}{\partial S^2}(dS_t)^2
$$

Substituting:
$$
dZ_t = mS_t^{m-1}(\mu S_t dt + \sigma S_t dW_t) + \frac{1}{2}m(m-1)S_t^{m-2}(\sigma S_t)^2 dt
$$

Simplifying:
$$
dZ_t = mS_t^m \mu dt + mS_t^m \sigma dW_t + \frac{1}{2}m(m-1)S_t^m \sigma^2 dt
$$

Collecting terms:
$$
dZ_t = \left(m\mu + \frac{1}{2}m(m-1)\sigma^2\right)S_t^m dt + m\sigma S_t^m dW_t
$$

Since $Z_t = S_t^m$:
$$
dZ_t = \left(m\mu + \frac{1}{2}m(m-1)\sigma^2\right)Z_t dt + m\sigma Z_t dW_t
$$

This is in the form $dZ_t = \tilde{\mu}Z_t dt + \tilde{\sigma}Z_t dW_t$ where:
$$
\tilde{\mu} = m\mu + \frac{1}{2}m(m-1)\sigma^2, \quad \tilde{\sigma} = m\sigma
$$

Therefore, $Z_t$ follows a geometric Brownian motion.

*Intuition: Any power of a GBM is itself a GBM, but with modified drift and volatility. The new drift includes an additional term $\frac{1}{2}m(m-1)\sigma^2$ that captures the convexity effect (Jensen's inequality). The volatility scales linearly with $m$, so higher powers are more volatile.*

---



### 4.d) Use these results to find $\mathbb{E}[S_t^m]$

From part (c), $Z_t = S_t^m$ follows a geometric Brownian motion with drift $\tilde{\mu} = m\mu + \frac{1}{2}m(m-1)\sigma^2$ and volatility $\tilde{\sigma} = m\sigma$.

Using the result from part (b), the expected value of a GBM with drift $\tilde{\mu}$ is:
$$
\mathbb{E}[Z_t] = Z_0 e^{\tilde{\mu}t}
$$

Since $Z_0 = S_0^m = s_0^m$:
$$
\mathbb{E}[S_t^m] = \mathbb{E}[Z_t] = s_0^m e^{\left(m\mu + \frac{1}{2}m(m-1)\sigma^2\right)t}
$$

Simplifying:
$$
\mathbb{E}[S_t^m] = s_0^m e^{m\mu t + \frac{1}{2}m(m-1)\sigma^2 t}
$$

*Intuition: The expected value of $S_t^m$ grows faster than $(s_0 e^{\mu t})^m$ due to Jensen's inequality—the convexity of the power function means that the expectation of the power exceeds the power of the expectation. The extra term $\frac{1}{2}m(m-1)\sigma^2 t$ quantifies this convexity effect. This is crucial for pricing options and computing risk measures that depend on moments of asset prices.*

---

## Problem 5

**Problem Statement:** Let $\sigma(t)$ be a given deterministic function of time and define the process $X_t$ by
$$
X(t) = \int_0^t \sigma(s)dW_s
$$
Also define $Z(t) = e^{i\omega X(t)}$ where $i$ is the complex unit and $\omega$ is also a constant.

---



### 5.a) Find the dynamics of $X_t$

Since $X_t$ is defined as an Ito integral with deterministic integrand $\sigma(t)$:
$$
X(t) = \int_0^t \sigma(s)dW_s
$$

Taking differentials:
$$
dX_t = \sigma(t)dW_t
$$

*Intuition: The dynamics of $X_t$ are simply the integrand $\sigma(t)$ times the increment $dW_t$. This is a "pure diffusion" process with no drift term, and the volatility at time $t$ is given by the deterministic function $\sigma(t)$.*

---



### 5.b) Find the dynamics of $Z_t$ and show that $Z_t$ has dynamics:
$$
dZ_t = -\frac{1}{2}\omega^2\sigma^2(t)Z(t)dt + i\omega\sigma(t)Z_tdW_t, \quad Z_0 = 1
$$

We apply Ito's formula to $f(X) = e^{i\omega X}$ where $dX_t = \sigma(t)dW_t$.

**Computing partial derivatives:**
$$
\frac{\partial f}{\partial X} = i\omega e^{i\omega X}, \quad \frac{\partial^2 f}{\partial X^2} = (i\omega)^2 e^{i\omega X} = -\omega^2 e^{i\omega X}
$$

**Applying Ito's formula:**
$$
dZ_t = df(X_t) = \frac{\partial f}{\partial X}dX_t + \frac{1}{2}\frac{\partial^2 f}{\partial X^2}(dX_t)^2
$$

Substituting:
$$
dZ_t = i\omega e^{i\omega X_t}\sigma(t)dW_t + \frac{1}{2}(-\omega^2)e^{i\omega X_t}(\sigma(t)dW_t)^2
$$

Using $(dW_t)^2 = dt$:
$$
dZ_t = i\omega e^{i\omega X_t}\sigma(t)dW_t - \frac{1}{2}\omega^2 e^{i\omega X_t}\sigma^2(t)dt
$$

Since $Z_t = e^{i\omega X_t}$:
$$
dZ_t = -\frac{1}{2}\omega^2\sigma^2(t)Z_tdt + i\omega\sigma(t)Z_tdW_t
$$

The initial condition is $Z_0 = e^{i\omega X_0} = e^{i\omega \cdot 0} = 1$.

*Intuition: The complex exponential $Z_t$ has dynamics involving both a deterministic drift term (proportional to $-\omega^2\sigma^2(t)$) and a complex stochastic term. The drift term is negative and represents the decay of the characteristic function, while the diffusion term provides the oscillatory behavior.*

---



### 5.c) Integrate $dZ_t$ and take expectations to find an expression for $\mathbb{E}[Z(t)]$

Integrating the SDE from $0$ to $t$:
$$
Z(t) - Z(0) = -\frac{1}{2}\omega^2\int_0^t \sigma^2(s)Z(s)ds + i\omega\int_0^t \sigma(s)Z_sdW_s
$$

Since $Z(0) = 1$:
$$
Z(t) = 1 - \frac{1}{2}\omega^2\int_0^t \sigma^2(s)Z(s)ds + i\omega\int_0^t \sigma(s)Z_sdW_s
$$

**Taking expectations:**

The expected value of an Ito integral is zero, so:
$$
\mathbb{E}[Z(t)] = 1 - \frac{1}{2}\omega^2\int_0^t \sigma^2(s)\mathbb{E}[Z(s)]ds + i\omega\mathbb{E}\left[\int_0^t \sigma(s)Z_sdW_s\right]
$$
$$
\mathbb{E}[Z(t)] = 1 - \frac{1}{2}\omega^2\int_0^t \sigma^2(s)\mathbb{E}[Z(s)]ds
$$

*Intuition: The stochastic integral vanishes in expectation, leaving us with an integral equation relating $\mathbb{E}[Z(t)]$ to its past values. This is the key step in finding the characteristic function.*

---



### 5.d) Define $m(t) = \mathbb{E}[Z(t)]$ and show that $m(t)$ satisfies the ODE:
$$
m'(t) = -\frac{1}{2}\omega^2\sigma^2(t)m(t), \quad m(0) = 1
$$

Setting $m(t) = \mathbb{E}[Z(t)]$, we have from part (c):
$$
m(t) = 1 - \frac{1}{2}\omega^2\int_0^t \sigma^2(s)m(s)ds
$$

**Differentiating both sides with respect to $t$:**

Using the Fundamental Theorem of Calculus:
$$
\frac{d}{dt}m(t) = -\frac{1}{2}\omega^2\sigma^2(t)m(t)
$$

Therefore:
$$
m'(t) = -\frac{1}{2}\omega^2\sigma^2(t)m(t)
$$

The initial condition is:
$$
m(0) = \mathbb{E}[Z(0)] = \mathbb{E}[1] = 1
$$

*Intuition: We've transformed the integral equation into a simple first-order ODE. This differential equation describes how the characteristic function evolves over time, with the rate of change proportional to both the current value and the squared volatility.*

---



### 5.e) Argue that $\mathbb{E}[e^{i\omega X(t)}] = \exp\left(-\frac{1}{2}\omega^2\int_0^t \sigma^2(s)ds\right)$ and why we can say that $X(t) \sim \mathcal{N}\left(0, \int_0^t \sigma^2(s)ds\right)$

**Solving the ODE:**

The ODE from part (d) is:
$$
m'(t) = -\frac{1}{2}\omega^2\sigma^2(t)m(t), \quad m(0) = 1
$$

This is a separable first-order ODE. Rearranging:
$$
\frac{dm}{m} = -\frac{1}{2}\omega^2\sigma^2(t)dt
$$

Integrating both sides from $0$ to $t$:
$$
\ln m(t) - \ln m(0) = -\frac{1}{2}\omega^2\int_0^t \sigma^2(s)ds
$$

Since $m(0) = 1$ and $\ln(1) = 0$:
$$
\ln m(t) = -\frac{1}{2}\omega^2\int_0^t \sigma^2(s)ds
$$

Taking exponentials:
$$
m(t) = \exp\left(-\frac{1}{2}\omega^2\int_0^t \sigma^2(s)ds\right)
$$

**Characteristic function:**

Since $m(t) = \mathbb{E}[Z(t)] = \mathbb{E}[e^{i\omega X(t)}]$, we have:
$$
\hat{f}_{X(t)}(\omega) = \mathbb{E}[e^{i\omega X(t)}] = \exp\left(-\frac{1}{2}\omega^2\int_0^t \sigma^2(s)ds\right)
$$

**Identifying the distribution:**

Recall that if $Y \sim \mathcal{N}(\mu, \sigma^2)$, then its characteristic function is:
$$
\mathbb{E}[e^{i\omega Y}] = \exp\left(i\omega\mu - \frac{1}{2}\omega^2\sigma^2\right)
$$

Comparing with our result:
$$
\hat{f}_{X(t)}(\omega) = \exp\left(0 - \frac{1}{2}\omega^2\int_0^t \sigma^2(s)ds\right)
$$

This is the characteristic function of a normal random variable with mean $0$ and variance $\int_0^t \sigma^2(s)ds$.

Therefore:
$$
X(t) \sim \mathcal{N}\left(0, \int_0^t \sigma^2(s)ds\right)
$$

*Intuition: This problem demonstrates a powerful technique for finding the distribution of an Ito integral with deterministic integrand. We've shown that such integrals are always normally distributed, with variance equal to the integral of the squared volatility function. This result is fundamental in stochastic calculus and is used extensively in Problems 3 and 4. The key insight is that even though $\sigma(t)$ varies with time, the accumulated randomness follows a Gaussian distribution with variance determined by integrating $\sigma^2(t)$ over time. This generalizes the result that $\int_0^t dW_s = W_t \sim \mathcal{N}(0,t)$ to the case of time-varying volatility.*

---