# Essential Mathematics for Data Analysts

As a data analyst, having a solid grasp of mathematical concepts is crucial. Below is a guide to key mathematical topics with equations to help you get started.

## 1. **Statistics**
### Mean
The mean (average) of a dataset is calculated as:
$ \text{Mean} = \frac{\sum_{i=1}^{n} x_i}{n} $

### Median
The median is the middle value of a sorted dataset. If the dataset has an even number of elements:
$ \text{Median} = \frac{x_{\frac{n}{2}} + x_{\frac{n}{2} + 1}}{2} $

### Mode
The mode is the value(s) that appear most frequently in a dataset.

### Variance
Variance measures the spread of data points:
$ \text{Variance} = \frac{\sum_{i=1}^{n} (x_i - \mu)^2}{n} $

### Standard Deviation
Standard deviation is the square root of the variance:
$ \text{Standard Deviation} = \sqrt{\frac{\sum_{i=1}^{n} (x_i - \mu)^2}{n}} $

### Covariance
Covariance measures the relationship between two variables:
$ \text{Cov}(X, Y) = \frac{\sum_{i=1}^{n} (x_i - \mu_X)(y_i - \mu_Y)}{n} $

### Correlation Coefficient
The correlation coefficient normalizes covariance to be between -1 and 1:
$ r = \frac{\text{Cov}(X, Y)}{\sigma_X \sigma_Y} $

---

## 2. **Probability**
### Probability of an Event
$ P(A) = \frac{\text{Number of favorable outcomes}}{\text{Total number of outcomes}} $

### Conditional Probability
$ P(A \mid B) = \frac{P(A \cap B)}{P(B)} $

### Bayes' Theorem
$ P(A \mid B) = \frac{P(B \mid A) P(A)}{P(B)} $

### Expected Value
$ \mathbb{E}[X] = \sum_{i} x_i P(x_i) $

---

## 3. **Linear Algebra**
### Matrix Operations
- **Matrix Addition**: $ A + B = C $, where $ c_{ij} = a_{ij} + b_{ij} $
- **Matrix Multiplication**: $ C = AB $, where $ c_{ij} = \sum_{k} a_{ik} b_{kj} $

### Transpose
The transpose of a matrix is denoted by:
$ A^T = [a_{ji}] $

### Determinant (for a 2x2 matrix)
$ \text{det}(A) = ad - bc, \text{where } A = \begin{bmatrix} a & b \\ c & d \end{bmatrix} $

### Eigenvalues and Eigenvectors
For a square matrix $ A $:
$ A v = \lambda v $, where $ \lambda $ is the eigenvalue and $ v $ is the eigenvector.

---

## 4. **Calculus**
### Derivatives
The derivative of a function measures its rate of change:
$ f'(x) = \lim_{h \to 0} \frac{f(x + h) - f(x)}{h} $

### Partial Derivatives
Partial derivatives are used when dealing with multivariable functions:
$ \frac{\partial f}{\partial x} = \lim_{h \to 0} \frac{f(x + h, y) - f(x, y)}{h} $

### Gradient
The gradient is a vector of partial derivatives:
$ \nabla f(x, y) = \begin{bmatrix} \frac{\partial f}{\partial x} \\ \frac{\partial f}{\partial y} \end{bmatrix} $

### Integration
The integral of a function represents the area under its curve:
$ \int_a^b f(x) \, dx $

---

## 5. **Optimization**
### Gradient Descent
Used to minimize a function:
$ \theta = \theta - \alpha \nabla J(\theta) $
Where:
- $ \theta $: Parameters
- $ \alpha $: Learning rate
- $ \nabla J(\theta) $: Gradient of the cost function

---

## 6. **Discrete Mathematics**
### Summation Notation
$ \sum_{i=1}^{n} x_i $

### Factorial
$ n! = n \times (n-1) \times (n-2) \cdots 1 $

### Permutations
$ P(n, r) = \frac{n!}{(n-r)!} $

### Combinations
$ C(n, r) = \frac{n!}{r!(n-r)!} $

---

## 7. **Probability Distributions**
### Normal Distribution
$ f(x) = \frac{1}{\sqrt{2 \pi \sigma^2}} e^{-\frac{(x-\mu)^2}{2\sigma^2}} $

### Binomial Distribution
$ P(X = k) = \binom{n}{k} p^k (1-p)^{n-k} $

---

Mastering these concepts will significantly enhance your ability to analyze data effectively. 
