1. So far we took the rules for derivatives for granted. Using the definition and limits prove the properties for (i) $f(x) = c$, (ii) $f(x) = x^$n$, (iii) $f(x) = e^x$ and (iv) $f(x) = logx$. 

All of these use the definition
$$
f'(x) \;=\;\lim_{h\to0}\frac{f(x+h)-f(x)}{h}.
$$

##### (i) Constant Rule, $f(x)=C$

By definition,
$$
\frac{d}{dx} C = \lim_{h\to0}\frac{C - C}{h}
= \lim_{h\to0}\frac{0}{h}
= 0.
$$
Since the numerator is identically zero, the limit is zero.

In [1]:
import sympy as sp

x, C = sp.symbols('x C')
f = C
sp.diff(f, x)
# → 0

0

##### (ii) Power Rule $f(x) = x^$n$
**a)** $n$ a positive integer

Use the binomial expansion:
$$
(x+h)^n
= \sum_{k=0}^n \binom nk x^{\,n-k}h^k
= x^n + n x^{\,n-1}h + \sum_{k=2}^n\binom nk x^{\,n-k}h^k.
$$
Then
$$
\frac{(x+h)^n - x^n}{h}
= n\,x^{\,n-1} + O(h),
$$
so taking $h\to0$ gives
$$
\frac{d}{dx}x^n = n\,x^{\,n-1}.
$$

**b)** $n$ a negative integer

Write $n=-m$ with $m>0$. Then $x^n = 1/x^m$.  By the quotient (or chain) rule,
$$
\frac{d}{dx}x^{-m}
= -\,x^{-m-1}\,(m\,x^{\,m-1})
= -m\,x^{-(m+1)}
= n\,x^{\,n-1}.
$$

In [2]:
x, n = sp.symbols('x n', integer=True, positive=True)
f = x**n
sp.diff(f, x)
# → n*x**(n-1)


n*x**n/x

##### (iii) Exponential Rule $f(x) = e^x$
Via power series

$$
e^x = \sum_{k=0}^\infty \frac{x^k}{k!}
\;\implies\;
\frac{d}{dx}e^x
= \sum_{k=1}^\infty \frac{k x^{k-1}}{k!}
= \sum_{j=0}^\infty \frac{x^j}{j!}
= e^x.
$$

Via the limit definition of \(e\)

$$
\frac{d}{dx}e^x
=\lim_{h\to0}\frac{e^{x+h}-e^x}{h}
=e^x\lim_{h\to0}\frac{e^h-1}{h}
=e^x\cdot1
= e^x.
$$

In [3]:
f = sp.exp(x)
sp.diff(f, x)
# → exp(x)


exp(x)

##### (iv) Logarithm Rule $f(x) = logx$
Since $y = \ln x$ is the inverse of $x = e^y$,
$$
1 = \frac{d}{dx}(e^y)
  = e^y\frac{dy}{dx}
  = x\frac{dy}{dx}
\quad\Longrightarrow\quad
\frac{dy}{dx} = \frac1x.
$$
Hence
$$
\frac{d}{dx}\ln x = \frac1x.
$$

In [4]:
f = sp.log(x)
sp.diff(f, x)
# → 1/x


1/x

2. In the same vein, prove the product, sum, and quotient rule from first principles.
##### Product Rule
$$
\frac{d}{dx} [f(x)g(x)] = f(x) \frac{d}{dx} g(x) + g(x) \frac{d}{dx} f(x) \quad \text{Product rule}
$$
**Prove:**
$$
\begin{aligned}
\frac{d}{dx}\bigl[f(x)g(x)\bigr]
&=\lim_{h\to0}\frac{f(x+h)g(x+h)-f(x)g(x)}{h}\\
&=\lim_{h\to0}\frac{f(x+h)g(x+h)-f(x)g(x+h)+f(x)g(x+h)-f(x)g(x)}{h}\\
&=\lim_{h\to0}\frac{\bigl[f(x+h)-f(x)\bigr]\,g(x+h)}{h}
  +\lim_{h\to0}\frac{f(x)\,\bigl[g(x+h)-g(x)\bigr]}{h}\\
&=\Bigl(\lim_{h\to0}\frac{f(x+h)-f(x)}{h}\Bigr)\,g(x)
  +f(x)\,\Bigl(\lim_{h\to0}\frac{g(x+h)-g(x)}{h}\Bigr)\\
&=f'(x)\,g(x)+f(x)\,g'(x).
\end{aligned}
$$


In [5]:
h = x**2 * sp.sin(x)
sp.diff(h, x)
# → x**2*cos(x) + 2*x*sin(x)


x**2*cos(x) + 2*x*sin(x)

##### Sum Rule
$$
\frac{d}{dx} [f(x) + g(x)] = \frac{d}{dx} f(x) + \frac{d}{dx} g(x) \quad \text{Sum rule}
$$
**Prove:**
For any two functions $f,g$:
$$
\frac{d}{dx}\bigl[f(x)+g(x)\bigr]
=\lim_{h\to0}\frac{[f(x+h)+g(x+h)]-[f(x)+g(x)]}{h}
=\lim_{h\to0}\frac{f(x+h)-f(x)}{h}
  +\lim_{h\to0}\frac{g(x+h)-g(x)}{h}
=f'(x)+g'(x).
$$

In [6]:
f = x**3 + sp.exp(x)
sp.diff(f, x)
# → 3*x**2 + exp(x)


3*x**2 + exp(x)

##### Quotient Rule
$$
\frac{d}{dx} \left( \frac{f(x)}{g(x)} \right) = \frac{g(x) \frac{d}{dx} f(x) - f(x) \frac{d}{dx} g(x)}{g^2(x)} \quad \text{Quotient rule}
\tag{2.4.4}
$$
**Prove:**
Assume $g(x)\neq0$.  Write
$$
\frac{f(x)}{g(x)} = f(x)\,\bigl[g(x)\bigr]^{-1}.
$$
Then by the product rule and the chain rule (derivative of $u^{-1}$ is $-u^{-2}u'$):
$$
\frac{d}{dx}\frac{f}{g}
=\frac{d}{dx}\bigl(f\cdot g^{-1}\bigr)
=f'\,g^{-1}+f\;\bigl(-g^{-2}g'\bigr)
=\frac{f'}{g}-\frac{f\,g'}{g^2}
=\frac{g\,f' - f\,g'}{g^2}.
$$

In [7]:
q = x**2/(1+x)
sp.diff(q, x)
# → x*(x + 2)/(x + 1)**2


-x**2/(x + 1)**2 + 2*x/(x + 1)

3. Prove that the constant multiple rule follows as a special case of the product rule.

**Statement.** Show that for any constant $C$ and differentiable function $g(x)$,
$$
\frac{d}{dx}\bigl[C\,g(x)\bigr] \;=\; C\,g'(x).
$$

**Proof.**  
Treat $C$ as the constant function $f(x)=C$, so $f'(x)=0$.  By the product rule,
$$
\frac{d}{dx}\bigl[f(x)\,g(x)\bigr]
= f'(x)\,g(x) + f(x)\,g'(x)
= 0\cdot g(x) + C\cdot g'(x)
= C\,g'(x).
$$
Hence the constant multiple rule follows directly.

In [8]:
C, x = sp.symbols('C x')
g = sp.sin(x)
sp.diff(C*g, x)
# → C*cos(x)


C*cos(x)

Exercises 4:

Rational exponents are exponents of numbers that are expressed as rational numbers, that is, in $a^{\frac{p}{q}}$, $a$ is the base and $\frac{p}{q}$ is the rational exponent where $q$ ≠ 0. In rational exponents, the base must be a positive integer.

Therefore in this situation: $x^x$ we got $x>0$

Use the fact that:

(1): $y=e^{ln(y)}$

and 

(2): $\ln(a^b)=b.ln(a)$  (2)

to get

$x^x$

$=e^{ln(x^x)}$  (Use (1))

$=e^{x.ln(x)}$   (Use (2))

Then the derivative you want will be the derivative of this:

$\frac{d}{dx}(x^x)=\frac{d}{dx}(e^{x.ln(x)})$

Using the chain rule this derivative is then:

$\frac{d}{dx}(x^x)$

$=\frac{d}{dx}(e^{x.ln(x)})$

$=e^{x.ln(x)}.(\frac{d}{dx}(x.ln(x)))$

$=e^{x.ln(x)}(1.ln(x)+x.\frac{1}{x})$

$=e^{x.ln(x)}(ln(x)+1)$

Thus the final answer is $x^x(ln(x)+1)$

Exercises 5:

Derivative is the rate of change in a function with respect to changes in its arguments.

In this scenario, $f'(x) = 0$ for some $x$, it means that at that point, the rate of change in a function $f$ with respect to changes in its argument $x$ is zero. In other words, the function is neither increasing nor decreasing at that point.

$f'(x) = 0$ for some $x$, is a local extremum (either a maximum or a minimum), where the slope of the tangent to the curve of the function $f$ at that point is horizontal.

Example:

For $f(x)=x^2$, the derivative is $f′(x)=2x$. Setting $f′(x)=0=2x$, we find that $x=0$, where at this point there is a local minimum, as the function changes from descending to ascending at this point.


Exercises 6:

In [None]:
ff = lambda x: x ** 3 - 1 / x
x = np.arange(0, 3, 0.1)
plot(x, [ff(x), 4 * x - 4], 'x', 'f(x)', legend=['f(x)', 'Tangent line (x=1)'])

7.

We have $f(\mathbf x) = 3x_{1}^2 + 5e^{x_2}$, $\mathbf x = (x_1, x_2)^{T}$

Then
- $\partial_{x_1}f(\mathbf x) = 6x_1$
- $\partial_{x_2}f(\mathbf x) = 5e^{x_2}$

According to 2.4.8, 
$$
\nabla_{\mathbf x}f(\mathbf x)
=
\begin{bmatrix}
\partial_{x_1}f(\mathbf x)\\
\partial_{x_2}f(\mathbf x)
\end{bmatrix}
=
\begin{bmatrix}
6x_1 \\
5e^{x_2}
\end{bmatrix}.
$$

8.

We have $f(\mathbf x) = \|\mathbf x\|_{2} = \sqrt {x_1^2 + x_2^2 + ... + x_n^2}$

If $\mathbf x \neq \mathbf 0$,

$$
\nabla_{\mathbf x}f(\mathbf x)
=
\begin{bmatrix}
\partial_{x_1}f(\mathbf x)\\
\partial_{x_2}f(\mathbf x)\\
\vdots\\
\partial_{x_n}f(\mathbf x)
\end{bmatrix}
=
\begin{bmatrix}
\frac{2x_1}{\sqrt {x_1^2 + x_2^2 + ... + x_n^2}} \\
\frac{2x_2}{\sqrt {x_1^2 + x_2^2 + ... + x_n^2}} \\
\vdots \\
\frac{2x_n}{\sqrt {x_1^2 + x_2^2 + ... + x_n^2}}
\end{bmatrix}
=
\frac{2}{\sqrt {x_1^2 + x_2^2 + ... + x_n^2}}
\begin{bmatrix}
x_1 \\
x_2 \\
\vdots \\
x_n
\end{bmatrix}
= \frac{2}{\|\mathbf x\|_2}\mathbf x
$$

If $\mathbf x = \mathbf 0$,

$$
\frac{\partial f}{\partial x_i}
=
\lim_{h \to 0} \frac{f(0,...,0,h,0,...,0)-f(\mathbf 0)}{h}
=
\lim_{h \to 0} \frac{\|(0,...,0,h,0,...,0)\|_2-0}{h}
$$

$$
\frac{\partial f}{\partial x_i}
=
\lim_{h \to 0} \frac{\sqrt {0^2+...0^2+h^2+0^2+...+0^2}}{h}
=
\lim_{h \to 0} \frac{\sqrt {h^2}}{h}
=
\lim_{h \to 0} \frac{|h|}{h}
$$

Because
$$
\lim_{h \to 0^+} \frac{|h|}{h}
=
1,
\lim_{h \to 0^-} \frac{|h|}{h}
=
-1
$$,
when $\mathbf x = \mathbf 0$, f is not differentiable.

#### **Problem 9: Can you write out the chain rule for the case where $u = f(x, y, z)$, where $x = x(a, b)$, $y = y(a, b)$, and $z = z(a, b)$**

According to the **multivariable chain rule**:

- Partial derivative of $u$ with respect to $a$:
  
  $$
  \frac{\partial u}{\partial a} = \frac{\partial f}{\partial x} \frac{\partial x}{\partial a} + \frac{\partial f}{\partial y} \frac{\partial y}{\partial a} + \frac{\partial f}{\partial z} \frac{\partial z}{\partial a}
  $$

- Partial derivative of $u$ with respect to $b$:

  $$
  \frac{\partial u}{\partial b} = \frac{\partial f}{\partial x} \frac{\partial x}{\partial b} + \frac{\partial f}{\partial y} \frac{\partial y}{\partial b} + \frac{\partial f}{\partial z} \frac{\partial z}{\partial b}
  $$

#### **Problem 10: Given a differentiable and invertible function $f(x)$, compute the derivative of its inverse $f^{-1}(x)$. Use the identity $f^{-1}(f(x)) = x$ and conversely $f(f^{-1}(y)) = y$ to support your derivation.**

We are given that:
- $f$ is differentiable and invertible,
- $f(f^{-1}(y)) = y$ and $f^{-1}(f(x)) = x$.

To compute the derivative of $f^{-1}(x)$, start from the identity:

$$
f(f^{-1}(x)) = x
$$

Differentiate both sides with respect to $x$:

$$
\frac{d}{dx} \left[ f(f^{-1}(x)) \right] = \frac{d}{dx}(x)
$$

Apply the chain rule:

$$
f'(f^{-1}(x)) \cdot (f^{-1})'(x) = 1
$$

Solve for $(f^{-1})'(x)$:

$$
(f^{-1})'(x) = \frac{1}{f'(f^{-1}(x))}
$$
