# Calculus for Artificial Intelligence

In-depth explanations and examples of calculus topics relevant to AI.

## Differential Calculus

### Functions and Limits

**Functions:**
A function $f: X \to Y$ maps each element in set $X$ to an element in set $Y$. For example, $f(x) = x^2$ maps each real number $x$ to its square.

**Limits:**
The limit of $f(x)$ as $x$ approaches $a$ is $L$, denoted as $\lim_{x \to a} f(x) = L$. This concept defines how a function behaves near a specific point.

### Derivatives

**Definition of a Derivative:**
The derivative of $f$ at $x$, denoted $f'(x)$ or $\frac{df}{dx}$, measures the rate of change of $f$ with respect to $x$:
$$ f'(x) = \lim_{h \to 0} \frac{f(x+h) - f(x)}{h} $$

**Rules of Differentiation:**
- **Product Rule:** $(uv)' = u'v + uv'$
- **Quotient Rule:** $\left( \frac{u}{v} \right)' = \frac{u'v - uv'}{v^2}$
- **Chain Rule:** $(f(g(x)))' = f'(g(x)) \cdot g'(x)$

**Partial Derivatives:**
For a multivariable function $f(x, y)$, the partial derivative with respect to $x$ is:
$$ \frac{\partial f}{\partial x} = \lim_{h \to 0} \frac{f(x+h, y) - f(x, y)}{h} $$

**Gradient:**
The gradient of $f$ is a vector of its partial derivatives:
$$ \nabla f = \left( \frac{\partial f}{\partial x_1}, \frac{\partial f}{\partial x_2}, \ldots, \frac{\partial f}{\partial x_n} \right) $$

### Applications
- **Optimization:** Finding the maxima and minima of functions.
- **Gradient Descent:** An iterative method to find local minima of a function by moving in the direction opposite to the gradient.
- **Taylor Series Expansion:** Approximating a function as a sum of its derivatives at a point.

## Integral Calculus

### Indefinite Integrals

**Antiderivatives:**
The antiderivative of $f$, denoted $F$, is a function such that $F' = f$. For example, $\int x^2 \, dx = \frac{x^3}{3} + C$, where $C$ is the constant of integration.

**Basic Integration Rules:**
- **Power Rule:** $\int x^n \, dx = \frac{x^{n+1}}{n+1} + C$
- **Integration by Parts:** $\int u \, dv = uv - \int v \, du$
- **Substitution:** $\int f(g(x))g'(x) \, dx = \int f(u) \, du$, where $u = g(x)$

### Definite Integrals

**Definition and Properties:**
The definite integral of $f$ from $a$ to $b$ is:
$$ \int_a^b f(x) \, dx $$
It represents the area under the curve of $f$ from $x = a$ to $x = b$.

**Numerical Integration:**
- **Trapezoidal Rule:** Approximating the area under $f$ as a series of trapezoids.
- **Simpson's Rule:** Using parabolic segments to approximate the area.

### Applications
- **Probability:** Computing cumulative distribution functions.
- **Expectation:** Calculating expected values in probability theory.
- **Area and Volume Calculations:** Used in spatial data analysis.

## Multivariable Calculus

### Functions of Several Variables

**Vector-Valued Functions:**
Functions with vector outputs, e.g., $\mathbf{r}(t) = \langle x(t), y(t), z(t) \rangle$.

**Level Sets:**
Contours where a multivariable function takes on constant values, e.g., $f(x, y) = c$.

### Partial Derivatives and Gradients

**Gradient Vector:**
For $f(x, y)$, the gradient is:
$$ \nabla f = \left( \frac{\partial f}{\partial x}, \frac{\partial f}{\partial y} \right) $$

**Directional Derivatives:**
The rate of change of $f$ in the direction of a vector $\mathbf{v}$ is:
$$ D_{\mathbf{v}} f = \nabla f \cdot \mathbf{v} $$

### Multiple Integrals

**Double and Triple Integrals:**
Integrals over regions in two or three dimensions:
$$ \iint_R f(x, y) \, dA $$
$$ \iiint_W f(x, y, z) \, dV $$

**Change of Variables:**
Using polar, cylindrical, and spherical coordinates to simplify integration.

### Vector Calculus

**Divergence and Curl:**
- **Divergence:** $\nabla \cdot \mathbf{F}$
- **Curl:** $\nabla \times \mathbf{F}$

**Line and Surface Integrals:**
Integrals over curves and surfaces, used in fields like electromagnetism.

## Differential Equations

### Ordinary Differential Equations (ODEs)

**First-Order ODEs:**
Equations involving the first derivative, e.g., $\frac{dy}{dx} = f(x, y)$.

**Higher-Order ODEs:**
Equations involving higher derivatives, e.g., $\frac{d^2y}{dx^2} + p(x)\frac{dy}{dx} + q(x)y = g(x)$.

### Partial Differential Equations (PDEs)

**Common PDEs:**
- **Heat Equation:** $u_t = \alpha \nabla^2 u$
- **Wave Equation:** $u_{tt} = c^2 \nabla^2 u$

**Numerical Solutions:**
Using finite difference methods to approximate solutions to PDEs.

## Applications in AI

### Optimization

**Cost Functions:**
Using calculus to minimize or maximize cost functions in machine learning models.

**Backpropagation:**
Calculating gradients for training neural networks by propagating errors backward.

**Convex Optimization:**
Solving optimization problems where the objective function is convex, ensuring a global minimum.

### Machine Learning Algorithms

**Support Vector Machines (SVMs):**
Using derivatives to optimize the separating hyperplane between classes.

**Regularization:**
Techniques like L2 and L1 regularization to prevent overfitting by adding penalty terms to the cost function.

**Neural Networks:**
Understanding how activation functions and weights are adjusted using gradients to train models.

### Probabilistic Models

**Expectation-Maximization (EM):**
An algorithm for finding maximum likelihood estimates in models with latent variables.

**Bayesian Inference:**
Calculating posterior distributions using integration, often requiring numerical methods.
