## $$Stat \ Cheat \ Sheet \ - \ Basics$$

#### Sample Mean
$$\overline x = \frac{x_1 + x_2 + ... + x_n}{n} = \frac{\sum_{i = 1}^n x_i}{n}$$

#### Deviation from average
$$d_i = x_i - \overline x$$
$$\sum_{i=1}^n d_i = \sum_{i=1}^n (x_i - \overline x) = 0$$
$$\sum_{i=1}^n (x_i - \overline x)^2 = \sum_{i=1}^n (x_i)^2 - n(\overline x)^2$$
$$\sum_{i=1}^n (x_i - \overline x)(y_i - \overline y) = \sum_{i=1}^n (x_i - \overline x)y_i = \sum_{i=1}^n x_i(y_i - \overline y) = \sum_{i=1}^n (x_iy_i - n(\overline x \overline y)$$

#### Sample median 
Obtained by first ordering the n observations from smallest to largest (with any repeated values included so that every sample observation appears in the ordered list). Then,
$$\tilde x = \begin{cases}
\text{The single middle value if n is odd} = \biggl(\frac{n + 1}{2}\biggl)^{th} \text{ordered value} \\
\text{The average of the two middle values if n is even} = {average of } \biggl(\frac{n}{2}\biggl)^{th} and \biggl(\frac{n}{2} + 1 \biggl)^{th} \text{ordered values}
\end{cases} $$

#### Sample Variance
$$s^2 = \frac{\sum (x_i - \overline x)^2}{n - 1} = \frac{S_{xx}}{n - 1}$$
If $y_1 = x_1 + c, y_2 = x_2 + c, ..., y_n = x_n + c, \ then \ s_y^2 = s_x^2$ <BR>
If $y_1 = cx_1, y_2 = cx_2, ..., y_n = cx_n, \ then \ s_y^2 = c^2s_x^2$

#### Sample Standard Deviation
$$s = \sqrt {s^2}$$
If $y_1 = cx_1, y_2 = cx_2, ..., y_n = cx_n, \ then \ s_y = |c|s_x$ <BR>

#### Sample Covariance
$$cov(x,y) = \frac{\sum_i(x_i - \overline x)(y_i - \overline y)}{n - 1}$$

#### Sample Correlation
$$r = \frac{cov(x,y)}{s_xs_y}$$
Correlation 
* varies between -1 and 1
* above .5 indicates strong relationship
* doesn't change whether we add/multiply constants to/with the x or y variables

#### Variance Covariance Matrix
$$vcov(x,y) = \begin{bmatrix}
var(x) & cov(x,y) \\
cov(x,y) & var(y)
\end{bmatrix}$$

$$var(X) = \begin{bmatrix}
var(X_1) & cov(X_1, X_2) & \dots & cov(X_1, X_n) \\
cov(X_2, X_1) & var(X_2) & \dots & cov(X_2, X_n) \\
\vdots & \ & \ddots \\
cov(X_n, X_1) & cov(X_n, X_2) & \dots & var(X_n) \\
\end{bmatrix}$$

#### Linear Relations
$$y=\beta_0 + \beta_1 x$$

- A point change in $x$ will always result in the same change in $y$ regardless of the initial value of $x$
- Marginal effect of $x$ on $y$ is constant

**Average Propensity**
$$\frac{y}{x} = \frac{\beta_0}{x} + \beta_1$$
where $\beta_1$ is the **marginal propensity**. Average Propensity is always greater than marginal propensity and gets closer to it as x increases.

#### Non Linear Relations
$$y=\beta_0 + \beta_1 x + \beta_2 x^2$$
- Change in $y$ for a given change in $x$ depends on the starting value of $x$.

Maximum of the function occurs at
$$x = \frac{\beta_1}{-2\beta_2}$$

Slope
$$=\frac{\Delta y}{\Delta x} \approx \beta_1 + 2\beta_2 x$$

#### Logarithmic and Exponential Functions

For small changes in x
$$\Delta log(x) = log(x_1) - log(x_0) \approx \frac{x_1 - x_0}{x_0} = \frac{\Delta x}{x_0}$$
$$100 \cdot \Delta log(x) \approx \text{%} \Delta x$$

$$log[exp(x)] = x$$
$$log(y) = \beta_0 + \beta_1 x \iff y = exp(\beta_0 + \beta_1 x)$$

#### Elasticity

Elasticity of $y$ with respect to $x$ is the percentage change in $y$ when $x$ increases by 1%.

$$\frac{\Delta y}{\Delta x} \cdot \frac{x}{y} = \frac{\text{%} \Delta y}{\text{%} \Delta x} = \frac{\Delta log(y)}{\Delta log(x)}$$
$$= \beta_1 \cdot \frac{x}{y} =  \beta_1 \cdot \frac{x}{\beta_0 + \beta_1 x}$$

A **constant elasticity model** is approximated by the equation 
$$log(y) = \beta_0 + \beta_1log(x)$$ 
where $\beta_1$ is the elasticity of $y$ with respect to $x$ (assuming that $x, y \gt 0$)

**Semi-elasticity** of y with respect to x is the percentage change in y when x increases by one unit.
$$\frac{\text{%} \Delta y}{\Delta x} = 100 \beta_1$$

#### Normalization Vs Standardization

**Normalization** - To scale a variable to have a values between 0 and 1. This is usually called *feature scaling*. One possible formula to achieve this is:
$$x_{new} = \frac{x - x_{min}}{x_{max} - x_{min}}$$

**Standardization** -  To transform data using *z-score* or *t-score* (usually to have a mean of zero and a standard deviation of 1)
$$z_i = \frac{x_i - \overline x}{s}$$

**Normalizing vectors (in linear algebra) to a norm of one** - Normalization in this sense means to transform a vector so that it has a length of one.