# Problem 1 [25 points] Brachystochrone revisited

In this exercise, we will solve the Brachistochrone problem shown below using optimal control.

![BrachystochronbeOCPurl](https://i.ibb.co/HptwnZh/Brachistochrone-OCP.png)

## (a) [5 points] OCP reformulation
Let the control variable be the angle $\theta$ that the velocity vector $V$ (tangent to the curve) makes with the horizontal axis, as shown above. Reformulate the Brachistochrone problem of going from point $A\equiv(0,0)$ to point $B\equiv(x_{1},y_{1})$ as a **minimum time optimal control problem** with terminal time $T$ free, and terminal state $(x(T),y(T))$ fixed. In other words, write the corresponding problem in standard OCP form following the general template.

## Solution for Prob 1(a):
The optimal control problem is to
$$\underset{\theta(\cdot)}{\text{minimize}}\quad\int_{0}^{T}1\:{\rm{d}}t$$
subject to $\dot{x}=V\cos\theta, \dot{y}=V\sin\theta$, where $V = \sqrt{2gy}$. The initial condition $(x(0),y(0))\equiv(0,0)$, and the terminal condition $(x(T),y(T))\equiv(x_1,y_1)$ are given. There is no terminal cost ($\phi\equiv 0$); the terminal constraint is
$$\psi(x(T),T) := \begin{pmatrix}x(T) - x_{1}\\y(T) - y_{1} \end{pmatrix} = \begin{pmatrix}0\\0\end{pmatrix}.$$

## (b) [10 points] Two identities
**Prove that** the optimal position tuple $(x(t),y(t))$ at any time $t\in[0,T]$, is given by
$$x(t) = x_{1} + \frac{y_{1}}{2\cos^{2}\theta(T)}\bigg\{2\left(\theta(T) - \theta(t)\right) + \sin\left(2\theta(T)\right) - \sin\left(2\theta(t)\right)\bigg\}, \quad y(t) = y_{1}\frac{\cos^{2}\theta(t)}{\cos^{2}\theta(T)}.$$
The above two identities allow (numerically) solving for $(\theta(t),\theta(T))$ as function of $(x(t),y(t))$. This helps define optimal state feedback: $\theta(t)$ as function of $(x(t),y(t))$.

## Solution for Prob 1(b):
The Hamailtonian $H = 1 + \sqrt{2gy}\left(\lambda_{1}\cos\theta + \lambda_{2}\sin\theta\right)$ gives the necessary conditions:
\begin{align}
\dot{\lambda}_{1} = -\frac{\partial H}{\partial x} = 0, \quad \dot{\lambda}_{2} = -\frac{\partial H}{\partial y} = -\frac{g}{V}\left(\lambda_{1}\cos\theta + \lambda_{2}\sin\theta\right),\qquad(1)\\
0 = \frac{\partial H}{\partial\theta} = -\lambda_{1}V\sin\theta + \lambda_{2}V\cos\theta. 
\qquad(2)
\end{align}
From the costate ODEs, $\lambda_{1} = $ constant.
Since ${{\rm{d}}x}(T)=0$, and $dT\neq 0$, the transversality condition yields $H(T) = 0$. But since $H(t)$ has no explcit time dependence, it must be constant along the optimal trajectory (Lec. 7 p. 10-11), which combined with $H(T) = 0$ gives $H(t)=0$ for all $t\in[0,T]$. In other words, 
\begin{align}
H = 1 + \lambda_{1}V\cos\theta + \lambda_{2}V\sin\theta = 0.\qquad(3)
\end{align}
Eliminating $\lambda_{2}$ from (2) and (3), we obtain
\begin{align}
\lambda_{1} = -\frac{\cos\theta}{V}.\qquad(4)
\end{align}
Likewise, eliminating $\lambda_{1}$ from (2) and (3), we obtain
\begin{align}
\lambda_{2} = -\frac{\sin\theta}{V}.\qquad(5)
\end{align}
Furthermore, combining (4) with $\dot{\lambda}_{1}=0$, we get
\begin{align}
0=\dot{\lambda}_{1} = \frac{\partial\lambda_{1}}{\partial\theta}\dot{\theta} + \frac{\partial\lambda_{1}}{\partial y}\dot{y} = \frac{\sin\theta}{V}\dot{\theta} + \frac{g\sin\theta\cos\theta}{V^{2}}\qquad(6)\\
\Rightarrow\quad 0 = \dot{\theta} + \frac{g}{V}\cos\theta \quad \Rightarrow\quad \dot{\theta} = -\frac{g}{V}\cos\theta.\qquad(7)
\end{align}
Now, evaluating (4) at $t=t$ and $t=T$, and using the fact that $\lambda_{1}$ is constant, we obtain
\begin{align}
\frac{\cos\theta(t)}{\sqrt{y(t)}} = \frac{\cos\theta(T)}{\sqrt{y_{1}}} \quad\Rightarrow\quad y(t) = y_{1}\frac{\cos^{2}\theta(t)}{\cos^{2}\theta(T)}.\qquad(8)
\end{align}
Next, we use $\dot{x}=V\cos\theta$, together with (7), to get
\begin{align}
V\cos\theta = \dot{x} = \frac{{\rm{d}}x}{{\rm{d}}\theta}\dot{\theta} = \frac{{\rm{d}}x}{{\rm{d}}\theta} \times \left(-\frac{g}{V}\cos\theta\right) \quad\Rightarrow\quad \frac{{\rm{d}}x}{{\rm{d}}\theta} = -2y = -\frac{2y_{1}}{\cos^{2}\theta(T)}\cos^{2}\theta,\qquad(9)
\end{align}
wherein the last equality follows from (8). Integrating (9), we have
\begin{align}
&\int_{x=x}^{x=x_{1}}{\rm{d}}x = -\frac{2y_{1}}{\cos^{2}\theta(T)}\int_{\theta=\theta}^{\theta=\theta(T)}\cos^{2}\theta\:{\rm{d}}\theta\nonumber\\ 
\Rightarrow\quad &x(t) = x_{1} + \frac{y_{1}}{2\cos^{2}\theta(T)}\bigg\{2\left(\theta(T) - \theta(t)\right) + \sin\left(2\theta(T)\right) - \sin\left(2\theta(t)\right)\bigg\}.\qquad(10)
\end{align}
We conclude by noting that (8) and (10) are the desired identities.

## (c) [2 + 3 + 5 = 10 points] Properties of optimal solution
(c.1) **Prove that** the optimal control $\theta(t)$ satisfies $\dot{\theta} = $ constant.

(c.2) **Prove that** $\theta(0) = \frac{\pi}{2}$. **Give a one sentence physical interpretation** of this result.

(c.3) Letting $\phi := \pi - 2\theta$, use your answer in part (b) to **deduce that the optimal curve is a cycloid**.

## Solution for Prob 1(c):
(c.1) From equations (4) and (7) above, we get $\dot{\theta} = g\lambda_{1} =$ constant, assuming $g$ is constant as in the original Brachistochrone formulation.

(c.2) Evaluating equation (8) above at $t=0$ gives $\theta(0)=\frac{\pi}{2}$. Vertical initial heading is natural since the terminal point $B$ is below the initial point $A$. In other words, to minimize the time to go, it is optimal to increase the vertical coordinate as fast as possible. 

(c.3) In equations (8) and (10) above, we substitute $\theta$ as function of $\phi$, and define a constant
$$a := \frac{y_{1}}{2\cos^{2}\theta(T)} = \frac{y_{1}}{1-\cos\phi(T)}.$$
Then (8) gives 
$$y = a\left[1 + \cos(\pi-\phi)\right]=a\left(1-\cos\phi\right).$$ 
Similarly, (10) gives 
$$x-x_{1} + a\left[\phi(T) - \sin\phi(T)\right]=a(\phi-\sin\phi).$$
The above two equations give the parametric form of a cycloid passing through $(x_1,y_1)$. Here, $\theta(T)$ (and thus $\phi(T)$) is such that the cycloid passes though point A with coordinate $(0,0)$.

# Problem 2 [25 points] Controlling national economy

Suppose the scalar state $x(t)$ of national economy is governed by the second order ODE
$$\ddot{x} = -\alpha^{2}x + u, \quad \alpha\in\mathbb{R}\setminus\{0\}, \quad t\geq 0, \quad x(0)=\dot{x}(0)=0,$$
where $u(t)$ is the effort a Government puts at time $t$ for economic reform. 

Suppose the Goverment would like to maximize its chance of getting re-elected at the **fixed** terminal time $T$, by bringing the national economy at a healthy state at the time of re-election, while not spending too much effort in economic reform during its tenure, i.e.,
$$\underset{u(\cdot)}{\text{maximize}}\quad x(T) - \int_{0}^{T}u^{2}\;{\mathrm{d}}t.$$
In practice, the Government may want to maximize an increasing function of the above cost, but we will ignore such details.

## (a) [2 + 3 + 8 = 13 points] Standard form

(i) **Define the state vector** and **write the second order ODE in state space form**, i.e., as a controlled vector first order ODE.

(ii) Use your answer in part (a)(i) to clearly **rewrite the OCP in standard form**. **Identify terminal cost/terminal constraint**, if any.

(iii) Write the **Hamiltonian, the costate ODEs, the PMP**, and **the transversality condition** for the OCP in part (a)(ii).

## Solution for Prob 2(a):
(i) The state vector is $x = \begin{pmatrix}x_{1}\\
x_{2}
\end{pmatrix} := \begin{pmatrix}x\\
\dot{x}
\end{pmatrix}\in\mathbb{R}^{2}$. The controlled ODE in state space form is
$$\dot{x} = \begin{pmatrix}
0 & 1\\
-\alpha^{2} & 0
\end{pmatrix}x + \begin{pmatrix}
0\\
1
\end{pmatrix}u, \qquad u\in\mathbb{R}.$$

(ii) The OCP in standard form is
$$\underset{u(\cdot)}{\text{minimize}}\quad -x_{1}(T) + \int_{0}^{T}u^{2}\:{\rm{d}}t$$
subject to
$$\dot{x} = \begin{pmatrix}
0 & 1\\
-\alpha^{2} & 0
\end{pmatrix}x + \begin{pmatrix}
0\\
1
\end{pmatrix}u,$$
and final time $T$ fixed. The terminal cost $\phi(x_{1}(T),T)\equiv -x_{1}(T)$. There is no terminal constraint, i.e., $\psi\equiv 0$.

(iii) The Hamiltonian $H = u^{2} + \lambda_{1}x_{2} + \lambda_{2}\left(-\alpha^{2}x_{1} + u\right)$. The costate ODEs are
$$\dot{\lambda}_{1} = -\dfrac{\partial H}{\partial x_{1}} = \alpha^{2}\lambda_{2}, \quad \dot{\lambda}_{2} = -\dfrac{\partial H}{\partial x_{2}} = -\lambda_{1}.$$
The PMP gives $0 = \dfrac{\partial H}{\partial u} = 2u + \lambda_{2}$, which implies $u^{\text{opt}} = -\lambda_{2}^{\text{opt}}/2$.

Since ${\rm{d}}T=0$ and ${\rm{d}}x(T)\neq 0$, the transversality condition gives $\dfrac{\partial\phi}{\partial x(T)} = \begin{pmatrix}
-1\\
0
\end{pmatrix}=\begin{pmatrix}
\lambda_{1}(T)\\
\lambda_{2}(T)
\end{pmatrix}$.

## (b) [5 + (5 + 2) = 12 points] Solution of the OCP

(i) Find the costates in terms of $t,\alpha,T$.

**Hint:** use the transversality conditions to solve the costate ODE initial value problem.

(ii) Compute the **optimal economic reform** $u^{\text{opt}}(t)$ for the Government. **Also compute the optimal terminal (i.e., election year) reform** $u^{\text{opt}}(T)$.

## Solution for Prob 2(b):
(i) From the costate ODEs, we have $\ddot{\lambda}_{1} = \alpha^{2}\dot{\lambda}_{2} = -\alpha^{2}\lambda_{1}$, that is, $\ddot{\lambda}_{1} + \alpha^{2}\lambda_{1} = 0$, which gives
$$\lambda_{1}(t) = a\cos(\alpha t) + b\sin(\alpha t),$$
where the constants $a,b$ are to be determined from the terminal values of the costates. Consequently,
$$\lambda_{2}(t) = \frac{1}{\alpha^{2}}\dot{\lambda}_{1} = \frac{1}{\alpha}\left(-a\sin(\alpha t) + b\cos(\alpha t)\right).$$
To determine the constants $a, b$, we now use the terminal values of the costates
(coming from transversality): $\lambda_{1}(T)=-1$, $\lambda_{2}(T)=0$. This gives
$$
\left(\begin{array}{cc}
\cos (\alpha T) & \sin (\alpha T) \\
-\frac{1}{\alpha} \sin (\alpha T) & \frac{1}{\alpha} \cos (\alpha T)
\end{array}\right)\left(\begin{array}{l}
a \\
b
\end{array}\right)=\left(\begin{array}{c}
-1 \\
0
\end{array}\right) \Rightarrow\left(\begin{array}{l}
a \\
b
\end{array}\right)=\left(\begin{array}{cc}
\cos (\alpha T) & \sin (\alpha T) \\
-\frac{1}{\alpha} \sin (\alpha T) & \frac{1}{\alpha} \cos (\alpha T)
\end{array}\right)^{-1}\left(\begin{array}{c}
-1 \\
0
\end{array}\right)
$$
$$
\Rightarrow\left(\begin{array}{l}
a \\
b
\end{array}\right)=\frac{1}{\frac{1}{\alpha}\left(\cos ^{2}(\alpha T)+\sin ^{2}(\alpha T)\right)}\left(\begin{array}{cc}
\frac{1}{\alpha} \cos (\alpha T) & -\sin (\alpha T) \\
\frac{1}{\alpha} \sin (\alpha T) & \cos (\alpha T)
\end{array}\right)\left(\begin{array}{c}
-1 \\
0
\end{array}\right)=\left(\begin{array}{l}
-\cos (\alpha T) \\
-\sin (\alpha T)
\end{array}\right).
$$
(ii) Combining the PMP and part 2(b)(i), we obtain
$$u^{\text{opt}}(t) = -\lambda_{2}^{\text{opt}}(t)/2 = -\frac{1}{2\alpha}\left(-a\sin(\alpha t) + b\cos(\alpha t)\right) = \frac{1}{2 \alpha}(\sin (\alpha T) \cos (\alpha t)-\cos (\alpha T) \sin (\alpha t)) = \frac{1}{2 \alpha} \sin (\alpha(T-t)).$$
Thus, the optimal terminal reform $u^{\text{opt}}(T) = 0$.