# Descriptive Statistics: Measures of Dispersion (Absolute)


## Variance

Variance is the average of the squared differences from the mean.

Variance measures how "spread-out" the data is.

### Formula (variance)

#### Population Variance

$$
\sigma^2 = \frac{\sum_{i=1}^{N} (x_i - \mu)^2}{N}
$$

Calculate the difference between each data point ($x_i$) and the mean ($\mu$), square the result, and sum them all up.

Finally, divide by the number of data points ($N$).

#### Sample Variance

$$
s^2 = \frac{\sum_{i=1}^{n} (x_i - \bar{x})^2}{n-1}
$$

### Examples (variance)

In [23]:
import numpy as np
from scipy import stats
import statistics as sts

#### Example 1 (variance)

In [24]:
ex_one = [0, 1, 3, 5, 6]

In [25]:
sts.pvariance(ex_one) # population variance

5.2

In [26]:
sts.variance(ex_one) # sample variance

6.5

In [35]:
np.var(ex_one) # population variance

np.float64(5.2)

In [34]:
stats.tvar(ex_one) # sample variance

np.float64(6.5)

#### Example 2 (variance)

In [43]:
ex_two = [2, 3, 4, 3, 3]

In [44]:
sts.pvariance(ex_two)

0.4

In [45]:
sts.variance(ex_two)

0.5

In [46]:
np.var(ex_two)

np.float64(0.4)

#### Example 3 (variance)

In [78]:
hurricanes = [18, 21, 13, 19, 24, 17, 14, 12, 15, 14]

In [86]:
hurr_mean = np.mean(hurricanes)
hurr_sub = np.subtract(hurricanes, hurr_mean)
np.sum(np.square(hurr_sub)) / 10

np.float64(13.210000000000003)

In [81]:
sts.variance(hurricanes)

14.677777777777777

In [82]:
sts.pvariance(hurricanes)

13.21

In [83]:
np.var(hurricanes)

np.float64(13.210000000000003)

In [84]:
stats.tvar(hurricanes)

np.float64(14.67777777777778)

#### Example 4 (variance)

In [63]:
students = [18, 22, 25, 26, 15]

In [87]:
stud_mean = np.mean(students)
stud_sub = np.subtract(students, stud_mean)
np.sum(np.square(stud_sub)) / 5


np.float64(17.36)