# Avellaneda-Stoikov Model - Part3

```{note}
Key equation!
```

## Hamilton–Jacobi–Bellman equation

Recall that our agent’s objective is given by the value
function

$$u(s, x, q, t) = \max_{\delta^{a},\delta^{b}}\mathbb{E}_{t}[-\exp(-\gamma(X_{T} + q_{T}S_{T}))]$$

where the optimal feedback controls $\delta^{a}$ and $\delta^{b}$ will turn
out to be time and state dependent. We can use the dynamic
programming principle to show that the function $u$ solves
the following Hamilton–Jacobi–Bellman equation

$$
\begin{aligned}
u_{t} &+ \frac{1}{2}\sigma^{2}u_{ss} \\
&+ \underset{\delta^{b}}{\max}\lambda^{b}(\delta^{b})[u(s, x-s+\delta^{b}, q+1, t) - u(s, x, q, t)] +\\
&+ \underset{\delta^{a}}{\max}\lambda^{a}(\delta^{a})[u(s, x+s+\delta^{a}, q-1, t) - u(s, x, q, t)] = 0\\
u(&s,x,q,T) = -\exp(-\gamma(x+ qs)).
\end{aligned}
$$

```{note}
$u(s, x, q, t)$ 描述了在给定当前状态下（包括时间 $t$、库存 $q$、现金 $x$ 和价格 $s$）的最大期望效用，可以使用动态规划的方法建立从 $t$ 到 $t+\Delta$ 的递推方程：

$$u(s,x,q,t) = \max_{\delta^{a},\delta^{b}}\mathbb{E}[u(s_{t+\Delta{t}}, x_{t+\Delta{t}}, q_{t+\Delta{t}}, t+\Delta{t})]$$

```

````{prf:proof}

当 $\Delta{t}$ 足够小时

$$
\begin{aligned}
&u(s_{t+\Delta{t}}, x_{t+\Delta{t}}, q_{t+\Delta{t}}, t+\Delta{t}) - u(s,x,q,t)\approx\\
&\frac{\partial u}{\partial s}\Delta{s} + \frac{\partial u}{\partial x}\Delta{x} + \frac{\partial u}{\partial q}\Delta{q} + \frac{\partial u}{\partial t}\Delta{t}\approx\\
&(u(s_{t+\Delta{t}}, x, q, t+\Delta{t}) - u(s,x,q,t)) + (u(s, x_{t+\Delta{t}}, q_{t+\Delta{t}}, t) - u(s,x,q,t))
\end{aligned}
$$

设 $\lambda^{a}(\delta^{a})$ 和 $\lambda^{b}(\delta^{b})$ 分别是卖出和买入事件的到达率，则在 $\Delta{t}$ 内，卖出和买入事件发生的概率分别是 $\lambda^{a}(\delta^{a})\Delta{t}$ 和 $\lambda^{b}(\delta^{b})\Delta{t}$（忽略卖出和买入多次的情况）。

* 卖出事件发生的增量：<br>
 $u(s, x+s+\delta^{a}, q-1, t) - u(s,x,q,t)$
 
* 买入事件发生的增量：<br>
 $u(s, x-(s-\delta^{b}), q+1, t) - u(s,x,q,t)$

因此：

$$
\begin{aligned}
&\mathbb{E}[u(s, x_{t+\Delta{t}}, q_{t+\Delta{t}}, t)] - u(s,x,q,t) \approx \\
&\lambda^{a}(\delta^{a})\Delta{t}(u(s, x+s+\delta^{a}, q-1, t) - u(s,x,q,t)) + \\
&\lambda^{b}(\delta^{b})\Delta{t}(u(s, x-s+\delta^{b}, q+1, t) - u(s,x,q,t))
\end{aligned}
$$

另一方面，对 $u$ 进行泰勒展开：

$$
\begin{aligned}
u(s_{t+\Delta{t}}, x, q, t+\Delta{t}) - u(s,x,q,t) &\approx \frac{\partial u}{\partial t}\Delta{t} + \frac{\partial u}{\partial s}\Delta{s} + \frac{1}{2}\frac{\partial^{2} u}{\partial s^{2}}(\Delta{s})^{2}
\end{aligned}
$$

其中 $\Delta{s}=\sigma\Delta{W_{t}}$ 是随机变量，由于 $\Delta{W_{t}}\sim\mathcal{N}(0, \Delta{t})$，所以 $\mathbb{E}[(\Delta{W_{t}})^{2}] = \Delta{t}$，$\mathbb{E}[(\Delta{s})^{2}] = \sigma^{2}\Delta{t}$，而 $\mathbb{E}[\Delta{s}]=0$。取期望：

$$
\mathbb{E}[u(s_{t+\Delta{t}}, x, q, t+\Delta{t})] - u(s,x,q,t) \approx \frac{\partial u}{\partial t}\Delta{t} + \frac{1}{2}\sigma^{2}\frac{\partial^{2} u}{\partial s^{2}}\Delta{t}
$$

合并两方面：

$$
\begin{aligned}
0&=\max_{\delta^{a},\delta^{b}}\mathbb{E}[u(s_{t+\Delta{t}}, x_{t+\Delta{t}}, q_{t+\Delta{t}}, t+\Delta{t})] - u(s,x,q,t)\\
&\approx u_{t} + \frac{1}{2}\sigma^{2}u_{ss} + \\
&\max_{\delta^{a}}\lambda^{a}(\delta^{a})\Delta{t}(u(s, x+s+\delta^{a}, q-1, t) - u(s,x,q,t)) +\\
&\max_{\delta^{b}}\lambda^{b}(\delta^{b})\Delta{t}(u(s, x-s+\delta^{b}, q+1, t) - u(s,x,q,t))
\end{aligned}
$$

````

## Transformation of HJB equation

The solution to this nonlinear PDE is continuous in
the variables $s$, $x$ and $t$ and depends on the discrete
values of the inventory $q$. Due to our choice of
exponential utility (指数效用函数), we are able to simplify the problem
with the ansatz

$$u(s,x,q,t) = -\exp(-\gamma x)\exp(-\gamma\theta(s,q,t))$$

Direct substitution yields the following equation for $\theta$:

$$
\begin{aligned}
\theta_{t} &+ \frac{1}{2}\sigma^{2}\theta_{ss}-\frac{1}{2}\sigma^{2}\gamma\theta_{s}^{2} \\
&+ \underset{\delta^{b}}{\max}\left[\frac{\lambda^{b}(\delta^{b})}{\gamma}[1 - e^{\gamma(s-\delta^{b} - r^{b})}]\right] +\\
&+ \underset{\delta^{a}}{\max}\left[\frac{\lambda^{a}(\delta^{a})}{\gamma}[1 - e^{-\gamma(s+\delta^{b} - r^{b})}]\right] = 0\\
\theta(&s,q,T) = qs.
\end{aligned}
$$

Applying the definition of reservation bid and ask prices to the ansatz, we find that $r^{b}$ and $r^{a}$ depend directly on this function $\theta$. Indeed

$$r^{b}(s,q,t) = \theta(s,q+1,t)-\theta(s,q,t)$$

is the reservation bid price of the stock, when the
inventory is $q$ and

$$r^{a}(s,q,t) = \theta(s,q,t)-\theta(s,q-1,t)$$

is the reservation ask price, when the inventory is $q$. From the first-order optimality condition, we obtain the optimal distances $\delta^{b}$ and $\delta^{a}$. They are given
by the implicit relations

$$s - r^{b}(s,q,t) = \delta^{b} - \frac{1}{\gamma}\ln\left(1 - \gamma\frac{\lambda^{b}(\delta^{b})}{(\partial\lambda^{b}/\partial\delta)(\delta^{b})}\right)$$

and

$$r^{a}(s,q,t) - s = \delta^{a} - \frac{1}{\gamma}\ln\left(1 - \gamma\frac{\lambda^{a}(\delta^{a})}{(\partial\lambda^{a}/\partial\delta)(\delta^{a})}\right)$$

```{tip}
Proof:<br>
![](images/as10.png)
```
```{tip}
![](images/as11.png)
```
```{tip}
![](images/as12.png)
```

In summary, the optimal bid and ask quotes are
obtained through an intuitive, two-step procedure.

* First, we solve the PDE in order to obtain the
reservation bid and ask prices $r^{b}(s,q,t)$ and $r^{a}(s,q,t)$.

* Second, we solve the implicit equations and
obtain the optimal distances $\delta^{b}(s,q,t)$ and $\delta^{a}(s,q,t)$ between the mid-price and optimal bid and ask quotes..