In Chapter 2, there are many references to the gradient descent algorithm, but the following equation numbers are not (2.1) but (2.11).