-
Notifications
You must be signed in to change notification settings - Fork 0
MATHEMATICS FOR NEURAL NETWORK
Problem 1
Let C :
\begin{align*} \min_{y \in \mathbb{R}^d: ||y||=1} y^T \cdot \nabla C(x_0) \end{align*}
Solution:
To solve the optimization problem \begin{align*} \min_{y \in \mathbb{R}^d: ||y||=1} y^T \cdot \nabla C(x_0) \end{align*}
Using the fact that for any unit vector
\begin{align*} y^T \cdot \nabla C(x_0) \end{align*}
is maximized when
This is because the dot product measures the projection of
Therefore, In obtaining the the optimal
We define our lagrange
Now, we set the two derivative to zero,
Putting this value in
Let's calculate the value of
Now, we evaluate
Therefore, the solution to the optimization problem is
\begin{align*} y^{'} = - \frac{\nabla C(x_0)}{||\nabla C(x_0)||}, \end{align*}
and the optimal value is
\begin{align*}
- ||\nabla C(x_0)||. \end{align*}