<h3>6.5 Quasi Newton Methods

$\qquad$One problem with Newton's method is the difficulty of evaluating the Hessian $G^{(k)}$. Quasi Newton methods, which don't require Hessian evaluation, are extension of the one-dimension Secant method.<br>
$\qquad$In the one-dimensional secant method, the second derivative is approximated by difference in the first derivative,<br>
$$\large f''(x^{(k)})=\frac{f'(x^k)-f'(x^{k-1})}{x^k-x^{k-1}}\quad\Rightarrow\quad G(x^{(k)})=\frac{g(x^k)-g(x^{k-1})}{x^k-x^{k-1}}$$<br>
$$\text{when}\;\;x^k,\;x^{k-1}\in\mathbb R.\hspace{11cm}$$<br>

$\qquad$In the multivariable case the Hessian, hence any approximations to it are $nxn$ symmetric matrices.<br>
It is possible to use either<br>
i) an approximate $\quad B^{(k)}$ to $G(x^k)=\nabla^2f(x^k)$ or<br>
ii) an approximate $\quad H^{(k)}$ to $G^{-1}(x^k)=\nabla^2f^{-1}(x^k)$<br>
$\qquad$An importance feature of Quasi Newton methods is that the approximation $B^{(k)}$ or $H^{(k)}$ are positive definite.<br>

Let $g^{(k)}\equiv\nabla f(x^k),$ then a convex quadratic model<br>
$$q_k(\bar d)=f^{(k)}+d^Tg^{(k)}+\frac{1}{2}d^TB^{(k)}d$$<br>
produces a search direction $d^{(k)}$ as the solution of the linear system.<br>
$$\hspace{6.5cm}B^{(k)}d=-g^{(k)}\hspace{4cm}----\;(2)$$<br>

Let $H^{(k)}$ be the inverse Hessian, then eq. $(2)$ is rewritten as<br>
$$\bar d=-H^{(k)}g^{(k)}$$<br>

#### $\underline{Algorithm}\quad$
1. $\underline{Initialization}:\;\;$ Given a starting point $x^{(1)}$<br>
$\quad$a symmetric posotive definite initial Hessian approximate $B^{(1)},$ a toterance $\in_d\;>\;0$ and $\in_f\;>\;0,$ and a maximum number of iteration $k_{max}\;,$ set $k=1$<br><br>
2. $\underline{Calculate\;the\;objective\;value}$<br>
$\quad f^{(k)}=f(x^k)\quad$ and $\quad$ the gradient $\;g^{(k)}=\nabla f(x^k)$<br><br>
3. Do loop<br>
    a) $\underline{Search\;direction}:$ Calculate the quasi-Newton direction by solving<br>
    $\qquad B^{(k)}d^{(k)}=-g^{(k)}$<br><br>
    b) $\underline{Line\;search}:$ Calculate an exact or approximate minimizer $\alpha^{(k)}$ of $f(x^k+\alpha d^k),$ starting with $\alpha^{(k)}=1,$ to find<br>
    $\qquad$i) New point<br>
      $\hspace{20mm} x^{k+1}=x^k+\alpha^kd^k$<br>
    $\qquad$ii) New function value and gradient<br>
      $\hspace{20mm} f^{(k+1)}=f(x^{(k+1)}),\;\;g^{k+1}=\nabla f(x^{k+1})$<br><br>
    c) $\underline{Update\;Hessian\;approximate}:$<br>
    $\qquad B^{k+1}=B^k+E^k\quad$ (for $E^k$ see the next topic)<br><br>
4. $\underline{Untile}$<br>
$\quad k\;>\;k_{max}\;$ or $\;||g^k||\le\in_g$ or $\;||f^k-f^{k+1}\le\in_f||$<br><br>

$\qquad$The major advantage of quasi-Newton methods over Newton's method is that only the function value and gradient must be calculated.<br>
If $B^{(k)}$ is positive definite and $g^{(k)}\ne0$ then $d^{(k)}$ is a descent direction, so the line search can produce a $\alpha^{(k)}\;>\;0$ with $f^{(k+1)}\;<\;f^{(k)}.$<br><br>

$\normalsize\underline{Update\;Hessian\;approximation}$<br><br>
$\qquad$The question is Algorithm of quasi-Newton method is how to efficiently update the Hessian approximate $B^{(k)}.$<br>

If $f\in C^2,$ Taylor series expansion of the gradient about $x^{(k)}$ gives<br>
$$\nabla f(x^{k+1})\approx\nabla f(x^k)+\nabla^2f(x^k)(x^{k+1}-x^k)$$<br>
$$\text{or}$$<br>
$$g(x^{k+1})\approx g(x^k)+\nabla g(x^k)(x^{k+1}-x^k)$$<br>
$$\hspace{8cm} g(x^{k+1})-g(x^k)\approx G^{(k)}(x^{k+1}-x^k)\quad\text{when}\;\;\nabla g(x^k)\equiv G^(k)\hspace{1.5cm}----\;(6)$$<br>

$\qquad$As $B^{(k)}$ must be know before $d^{(k)};$ hence $x^{k+1}$ and $g^{k+1}$ can be calculated. It is not possible to get $B^{(k)}$ to satisfy eq. $(6)$ or $G^{(k)}=B^{(k)}.$ The update Hessian $B^{k+1}$ should approximate the curvature of $f(\bar x)$ along the direction $(x^{k+1}-x^k).$ That is $G^{(k)}\approx B^{(k+1)}.$ Then eq. $(6)$ rewritten as<br>
$$\hspace{8cm} g^{(k+1)}-g^{(k)}=B^{(k+1)}(x^{k+1}-x^k)\hspace{6cm}----\;(7)$$<br>

Let $S^{(k)}=x^{k+1}-x^k=(x^k+\alpha^kd^k)-x^k=\alpha^kd^k\;$ and $\;Y^{(k)}=g^{(k+1)}-g^{(k)},$ then eq. $(7)$ giving $B^{k+1}$ to satisfy the quasi-Newton relation is<br>
$$\hspace{9cm}Y^{(k)}=B^{(k+1)}S^{(k)}\hspace{7.8cm}----\;(8)$$<br>
This is the fundamental way in which curvature information is included in the Hessian approximation.<br><br>

$\qquad$As the Hessian is symmetric the approximation $B^{k+1}$ must also be symmetric, so<br>
$$\hspace{9cm}B^{(k+1)T}=B^{k+1}\hspace{8cm}----\;(9)$$<br>

The constructed update Hessian approximation $B^{K+1}$ must correspond to 3 conditions, namely<br>
i) $B^{k+1}$ to satisfy the quasi-Newton relation<br>
$\qquad Y^{(k)}=B^{(k+1)}S^{(k)}$<br><br>
ii) $B^{k+1}$ be symmetric matrix<br><br>
iii) $B^{k+1}$ be positive definite<br><br>
From i) $B^{k+1}$ should only differ from $B^{(k)}$ by a matrix of rank-1. A symmetric rank-1 update can be written as<br>
$$\hspace{4.5cm}B^{k+1}=B^k+\eta uu^T\qquad\text{when}\;\;\eta\in\mathbb R\;\;\text{and}\;\;u\in\mathbb R^{n}$$<br><br>
There is only one symmetric rank-1 update which satisfies the quasi-Newton relation, namely<br>
$$\hspace{6cm}\large B^{k+1}=B^k+\frac{1}{(Y^k-B^kS^k)^TS^k}(Y^k-B^kS^k)(Y^k-B^kS^k)^T\small\hspace{3cm}----\;(11)$$<br>

$\qquad\underline{Note}:$ rank-1 matrix<br>
$\hspace{19mm}$If $\bar u\;$ and $\;\bar v$ are vectors, then $uv^T$ is a rank-1 matrix, ie.<br>
$\qquad\quad\bar u=\left[\begin{array}{c}1\\2\end{array}\right]\;\;,\;\;$ $\bar v=\left[\begin{array}{c}3\\5\end{array}\right]\;\;\Rightarrow\;\;$ $\bar{u}\bar{v}^T=\left[\begin{array}{c}3 & 5\\6 & 10\end{array}\right]$ $\left.\begin{array}{c}r_1\\r_2\end{array}\right.\;\;,\;\;\;\text{rank}\;2r_1=r_2$<br><br>

$\qquad$If the denomiator $(Y^k-B^kS^k)^TS^k$ in eq. $(11)$ is zero, $B^{k+1}$ is not defined. Even if the denominator is nonzero the update Hessian approximation $B^{k+1}$ may not be positive definite. So addition to $B^{(k)}$ with the symmetric rank-2 update for $B^{k+1}$ being positive definite. We obtain the Broyden family as<br>
$$\large B^{(k+1)}=B^{(k)}+\frac{Y^{(k)}Y^{(k)T}}{S^{(k)T}Y^{(k)}}-\frac{B^{(k)}S^{(k)}B^{(k)}S^{(k)T}}{S^{(k)T}B^{(k)}S^{(k)}}+\phi(S^{(k)T}B^{(k)}S^{(k)})W^{(k)}W^{(k)T}$$<br>
$$\large\text{when}\quad W^{(k)}=\frac{Y^{(k)}}{S^{(k)T}Y^{(k)}}-\frac{B^{(k)}S^{(k)}}{S^{(k)T}B^{(k)}S^{(k)}}\;\;\text{and}\;\;\phi\in[0,1]\hspace{7cm}$$<br>

$\qquad$For $\phi=0,$ so called $\underline{the\;BFGS\;update}$ which was discoverd by Broyden, Fletcher, Glodfarb, and Shanno in 1970.<br>
$\qquad$For $\phi=1,$ called $\underline{the\;DFP\;update}$ which was discoverd by Davidon in 1959 and popularied by Fletcher and Powell in 1963. It is now wildly believed that the BFGS update is the most effective update from Broyden family.<br>

$\begin{align}
    \underline{Ex.}\qquad &Min.\quad\;\; f(\bar x)=x^2_1+2x^2_2\nonumber\\
    &\text{Starting pt.}\;\;x^{(1)}=\left[\begin{array}{c}1 \\ \frac{1}{4}\end{array}\right]\quad\text{and}\quad B^{(1)}=I=\left[\begin{array}{c}1 & 0 \\ 0 & 1\end{array}\right]\qquad\text{Using BFGS update and exact line search}\nonumber
\end{align}$<br><br>
$\Longrightarrow\qquad$The gradient $\quad g(\bar x)=\left[\begin{array}{c}2x_1 \\ 4x_2\end{array}\right]$<br>
$\hspace{18mm}$The Hessian $\quad G(\bar x)=\left[\begin{array}{c}2 & 0 \\ 0 & 4\end{array}\right]$<br>

$\hspace{18mm}\underline{Iteration\;1}:\quad$ at $\;\;x^{(1)}=\left[\begin{array}{c}1 \\ \frac{1}{4}\end{array}\right]\;;\qquad g^{(1)}=$ $\left[\begin{array}{c}2(1) \\ 4\big(\frac{1}{4}\big)\end{array}\right]=$ $\left[\begin{array}{c}2 \\ 1\end{array}\right]\quad,\quad B^{(1)}=$ $\left[\begin{array}{c}1 & 0 \\ 0 & 1\end{array}\right]$<br>

$\hspace{18mm}$Calculate the search direction, solve<br>
$$B^{(1)}d^{(1)}=-g^{(1)}$$<br>
$$Id^{(1)}=-g^{(1)}$$<br>
$$d^{(1)}=-g^{(1)}=\left[\begin{array}{c}-2 \\ -1\end{array}\right]$$<br>

$\hspace{18mm}$Do exact line search<br>
$\hspace{18mm}$Find $\alpha^{(1)}$ to minimizer $f(x^{(1)}+\alpha d^{(1)}),$ that is<br>
$$g^T(x^{(1)}+\alpha d^{(1)})d^{(1)}=0$$<br>
$$\text{when}\qquad x^{(1)}+\alpha d^{(1)}=\left[\begin{array}{c}1 \\ \frac{1}{4}\end{array}\right]+\alpha
    \left[\begin{array}{c}-2 \\ -1\end{array}\right]\nonumber$$<br>
$\hspace{11.2cm}=\left[\begin{array}{c}1-2\alpha \\ \frac{1}{4}-\alpha\end{array}\right]$<br>
$$\begin{align}
    \text{So}\qquad\left[2(1-2\alpha)\quad 4\left(\frac{1}{4}-\alpha\right)\right]\left[\begin{array}{c}-2 \\ -1\end{array}\right] &= 0\nonumber \\
    -4(1-2\alpha)-4\left(\frac{1}{4}-\alpha\right) &= 0\nonumber \\
    12\alpha &= 5\nonumber \\
    \alpha &= \frac{5}{12}\nonumber
\end{align}$$<br>
$$\therefore\qquad\text{the minimizer is}\;\;\alpha^{(1)}=\frac{5}{12}\hspace{6cm}$$<br>
$$\begin{align}
    \text{The new point}\qquad x^{(2)} &= x^{(1)}+\alpha^{(1)}d^{(1)}\nonumber \\
    &= \left[\begin{array}{c}1 \\ \frac{1}{4}\end{array}\right]+\frac{5}{12}\left[\begin{array}{c}-2 \\ -1\end{array}\right]\nonumber \\
    &= \left[\begin{array}{c}\frac{1}{6} \\ - \frac{1}{6}\end{array}\right]\nonumber
\end{align}$$<br>

$\hspace{18mm}\underline{Update\;the\;Hessian\;approximation}$<br>
$$\large B^{(2)}=B^{(1)}+\frac{Y^{(1)}Y^{(1)T}}{S^{(1)T}Y^{(1)}}-\frac{B^{(1)}S^{(1)}B^{(1)}S^{(1)T}}{S^{(1)T}B^{(1)}S^{(1)}}$$<br>
$\hspace{2.5cm}$ where<br>
$$S^{(1)}=x^{(2)}-x^{(1)}=(x^{(1)}+\alpha^{(1)}d^{(1)})-x^{(1)}=\alpha^{(1)}d^{(1)}=\frac{5}{12}\left[\begin{array}{c}-2 \\ -1\end{array}\right]$$<br>

$$Y^{(1)}=g^{(2)}-g^{(1)}=\left[\begin{array}{c}2\left(\frac{1}{6}\right) \\ 4\left(-\frac{1}{6}\right)\end{array}\right]-
    \left[\begin{array}{c}2 \\ 1\end{array}\right]=
    \left[\begin{array}{c}\frac{1}{3}-2 \\ -\frac{2}{3}-1\end{array}\right]=
    \frac{5}{3}\left[\begin{array}{c}-1 \\ -1\end{array}\right]$$<br>

$\hspace{2.5cm}$ and<br>
$$S^{(1)T}Y^{(1)}=\frac{5}{12}\left[\begin{array}{c}-2 & -1\end{array}\right]
    \;\frac{5}{3}\left[\begin{array}{c}-1 \\ -1\end{array}\right]=\left(\frac{25}{36}\right)(2+1)=\frac{25}{12}$$<br>

$\hspace{2.5cm}$ Since $\quad B^{(1)}S^{(1)}=IS^{(1)}=S^{(1)}\;,\;\;$ so<br>
$$S^{(1)T}B^{(1)}S^{(1)}=S^{(1)T}S^{(1)}=\frac{5}{12}\left[\begin{array}{c}-2 & -1\end{array}\right]
    \frac{5}{12}\left[\begin{array}{c}-2 \\ -1\end{array}\right]=
    \left(\frac{5}{12}\right)^2(4+1)=\frac{(5)^3}{(12)^2}$$<br>

$\hspace{2.5cm}$ Therefore,<br>
$$\begin{align}
    B^{(2)} &= \left[\begin{array}{c}1&0 \\ 0&1\end{array}\right]\;+\;\left(\frac{12}{25}\right)\left(\frac{5}{3}\right)
        \left[\begin{array}{c}-1 \\ -1\end{array}\right]\left(\frac{5}{3}\right)
        \left[\begin{array}{c}-1&-1\end{array}\right]\;-\;\frac{(12)^2}{(5)^3}\left(\frac{5}{12}\right)
        \left[\begin{array}{c}-2 \\ -1\end{array}\right]\left(\frac{5}{12}\right)
        \left[\begin{array}{c}-2 & -1\end{array}\right]\nonumber \\
    &= \left[\begin{array}{c}1&0 \\ 0&1\end{array}\right]\;+\;\frac{12}{9}
        \left[\begin{array}{c}-1 \\ -1\end{array}\right]
        \left[\begin{array}{c}-1 & -1\end{array}\right]\;-\;\frac{1}{5}
        \left[\begin{array}{c}-2 \\ -1\end{array}\right]
        \left[\begin{array}{c}-2 & -1\end{array}\right]\nonumber \\
    &= \left[\begin{array}{c}1&0 \\ 0&1\end{array}\right]\;+\;\frac{4}{3}
        \left[\begin{array}{c}1&1 \\ 1&1\end{array}\right]\;-\;\frac{1}{5}
        \left[\begin{array}{c}4&2 \\ 2&1\end{array}\right]\nonumber \\
    &= \left[\begin{array}{c}1+\frac{4}{3}-\frac{4}{5}&0+\frac{4}{3}-\frac{2}{5} \\ 0+\frac{4}{3}-\frac{2}{5}&1+
        \frac{4}{3}-\frac{1}{5}\end{array}\right]\nonumber \\
    &= \frac{1}{15}\left[\begin{array}{c}23&14 \\ 14&32\end{array}\right]
        \nonumber
\end{align}$$<br>

$\hspace{18mm}\underline{Iteration\;2}:\quad\;\;x^{(2)}=\left[\begin{array}{c}\frac{1}{6} \\ -\frac{1}{6}\end{array}\right]\;;\qquad g^{(2)}=$ $\left[\begin{array}{c}\frac{1}{3} \\ -\frac{2}{3}\end{array}\right]$ $\quad,\quad B^{(2)}=\frac{1}{15}\left[\begin{array}{c}23&14 \\ 14&32\end{array}\right]$<br>

$\hspace{18mm}$Search direction by solving<br>
$$B^{(2)}d^{(2)}=-g^{(2)}$$<br>
$$\frac{1}{15}\left[\begin{array}{c}23&14 \\ 14&32\end{array}\right]
    \left[\begin{array}{c}d_1 \\ d_2\end{array}\right]=
    -\left[\begin{array}{c}\frac{1}{3} \\ -\frac{2}{3}\end{array}\right]$$<br>
$$d^{(2)}=\left[\begin{array}{c}d_1 \\ d_2\end{array}\right]=\bigg(\frac{1}{15}
    \left[\begin{array}{c}23&14 \\ 14&32\end{array}\right]\bigg)^{-1}\;
    \left[\begin{array}{c}-\frac{1}{3} \\ \frac{2}{3}\end{array}\right]=\frac{15}{240}
    \left[\begin{array}{c}32&-14 \\ -14&23\end{array}\right]
    \left[\begin{array}{c}-\frac{1}{3} \\ \frac{2}{3}\end{array}\right]=\frac{5}{9}
    \left[\begin{array}{c}-1 \\ 1\end{array}\right]$$<br>
    
$\hspace{18mm}$Line search, solve<br>
$$g^T(x^{(2)}+\alpha d^{(2)})d^{(2)}=0$$<br>
$$\text{when}\qquad x^{(2)}+\alpha d^{(2)}=\left[\begin{array}{c}\frac{1}{6} \\ -\frac{1}{6}\end{array}\right]+
    \alpha\left(\frac{5}{9}\right)\left[\begin{array}{c}-1 \\ 1\end{array}\right]\nonumber$$<br>
$\hspace{10.8cm}=\left[\begin{array}{c}\frac{1}{6}-\frac{5}{6}\alpha \\ -\frac{1}{6}+\frac{5}{6}\alpha\end{array}\right]$<br>

$$\begin{align}
    \text{So}\qquad\left[2\left(\frac{1}{6}-\frac{5}{9}\alpha\right)\quad 4\left(-\frac{1}{6}+\frac{5}{9}\alpha\right)\right]\left(\frac{5}{9}\right)\left[\begin{array}{c}-1 \\ 1\end{array}\right] &= 0\nonumber \\
    -2\left(\frac{1}{6}-\frac{5}{9}\alpha\right)+4\left(-\frac{1}{6}+\frac{5}{9}\alpha\right) &= 0\nonumber \\
    \frac{30}{9}\alpha &= \frac{2}{3}+\frac{1}{3}\nonumber \\
    \alpha^{(2)} &= \frac{9}{30}\nonumber \\
    &= \frac{3}{10}\nonumber
\end{align}$$<br>

$$\begin{align}
    \text{The new point}\qquad x^{(3)} &= x^{(2)}+\alpha^{(2)}d^{(2)}\nonumber \\
    &= \left[\begin{array}{c}\frac{1}{6} \\ -\frac{1}{6}\end{array}\right]+\frac{3}{10}
        \left[\begin{array}{c}-\frac{5}{9} \\ \frac{5}{9}\end{array}\right]\nonumber \\
    &= \left[\begin{array}{c}0 \\ 0\end{array}\right]
        \nonumber
\end{align}$$<br>

$\hspace{18mm}$Therefore, $x^{(3)}=\left[\begin{array}{c}0 \\ 0\end{array}\right]=x^*$ is the global minimizer. Since $g(x^{(3)})=0$ and $G$ is positive definite and a constrant matrix.<br><br>

$\hspace{18mm}\underline{Update\;the\;Hessian\;approximation}$<br>
$$\large B^{(3)}=B^{(2)}+\frac{Y^{(2)}Y^{(2)T}}{S^{(2)T}Y^{(2)}}-\frac{B^{(2)}S^{(2)}B^{(2)}S^{(2)T}}{S^{(2)T}B^{(2)}S^{(2)}}$$<br>
$\hspace{2.5cm}$ where<br>
$$S^{(2)}=x^{(3)}-x^{(2)}=(x^{(2)}+\alpha^{(2)}d^{(2)})-x^{(2)}=\alpha^{(2)}d^{(2)}=\left(\frac{3}{10}\right)
    \left(\frac{5}{19}\right)\left[\begin{array}{c}-1 \\ 1\end{array}\right]=\frac{1}{6}
    \left[\begin{array}{c}-1 \\ 1\end{array}\right]$$<br>

$$Y^{(2)}=g^{(3)}-g^{(2)}=\left[\begin{array}{c}0 \\ 0\end{array}\right]-
    \left[\begin{array}{c}\frac{1}{3} \\ -\frac{2}{3}\end{array}\right]=
    \left[\begin{array}{c}-\frac{1}{3} \\ \frac{2}{3}\end{array}\right]$$<br>

$\hspace{2.5cm}$ and<br>
$$S^{(2)T}Y^{(2)}=\frac{1}{6}\left[\begin{array}{c}-1 & 1\end{array}\right]
    \left[\begin{array}{c}-\frac{1}{3} \\ \frac{2}{3}\end{array}\right]=\frac{1}{6}\left(\frac{1}{3}+\frac{2}{3}\right)=\frac{1}{6}$$<br>
    
$\hspace{2.5cm}\begin{align}\text{Since}\quad B^{(2)}S^{(2)}=B^{(2)}\alpha^{(2)}d^{(2)}\quad\Rightarrow\quad\alpha^{(2)}B^{(2)}d^{(2)}
    &= \alpha^{(2)}(-g^{(2)})\nonumber \\
    &= -\frac{3}{10}\left[\begin{array}{c}\frac{1}{3} \\ -\frac{2}{3}\end{array}\right]\nonumber \\
    &= -\frac{1}{10}\left[\begin{array}{c}1 \\ -2\end{array}\right]\nonumber
\end{align}$<br>

$\hspace{2.5cm}$ So<br>
$$\qquad S^{(2)T}B^{(2)}S^{(2)}=\frac{1}{6}\left[\begin{array}{c}-1 \\ 1\end{array}\right]
    \left(-\frac{1}{10}\right)\left[\begin{array}{c}1 \\ -2\end{array}\right]=-\frac{1}{60}(-3)=\frac{1}{20}$$<br>

$\hspace{2.5cm}$ Therefore,<br>
$$\begin{align}
    B^{(3)} &= \frac{1}{15}\left[\begin{array}{c}23&14 \\ 14&32\end{array}\right]\;+\;6
        \left[\begin{array}{c}-\frac{1}{3} \\ \frac{2}{3}\end{array}\right]
        \left[\begin{array}{c}-\frac{1}{3}&\frac{2}{3}\end{array}\right]\;-\;(20)\left(-\frac{1}{10}\right)
        \left[\begin{array}{c}1 \\ -2\end{array}\right]\left(-\frac{1}{10}\right)
        \left[\begin{array}{c}1 & -2\end{array}\right]\nonumber \\
    &= \frac{1}{15}\left[\begin{array}{c}23&14 \\ 14&32\end{array}\right]\;+\;6
        \left[\begin{array}{c}\frac{1}{4}&-\frac{2}{9} \\ -\frac{2}{9}&\frac{4}{9}\end{array}\right]\;-\;20
        \left(\frac{1}{10}\right)^2\left[\begin{array}{c}1&-2 \\ -2&4\end{array}\right]\nonumber \\
    &= \frac{1}{15}\left[\begin{array}{c}23&14 \\ 14&32\end{array}\right]\;+\;\frac{2}{3}
        \left[\begin{array}{c}-1&-2 \\ -2&4\end{array}\right]\;-\;\frac{1}{5}
        \left[\begin{array}{c}1&-2 \\ -2&4\end{array}\right]\nonumber \\
    &= \left[\begin{array}{c}\frac{23}{15}+\frac{2}{3}-\frac{1}{5}&\frac{14}{15}-\frac{4}{3}+\frac{2}{5} \\ \frac{14}{15}-
        \frac{4}{3}+\frac{2}{5}&\frac{32}{15}+\frac{8}{3}-\frac{4}{5}\end{array}\right]\nonumber \\
    &= \left[\begin{array}{c}2&0 \\ 0&4\end{array}\right]
        \nonumber
\end{align}$$<br>