
### Mean Vector
The first step in analyzing multivariate data is computing the **mean vector** and the **variance-covariance matrix**.

Consider the following matrix:

$$ X = \begin{bmatrix}
4.0 & 2.0 & 0.60 \\
4.2 & 2.1 & 0.59 \\
3.9 & 2.0 & 0.58 \\
4.3 & 2.1 & 0.62 \\
4.1 & 2.2 & 0.63 
\end{bmatrix} $$

The set of 5 observations, measuring 3 variables, can be described by its **mean vector** and **variance-covariance matrix**. The three variables, from left to right are length, width, and height of a certain object, for example. Each row vector $X_i$ is another observation of the three variables (or components).

The **mean vector** consists of the means of each variable. Where the mean is simply the sum of the data points divided by the number of data points:

$$ \bar{Y} = \sum_{i=1}^N \frac{Y_i}{N} $$

The mean is that value that is most commonly referred to as the **average**. Both terms are being used as a synonyms.

### Variance
If you're analyzing data and notice a **high variance**, it means there's a large spread between the numbers, with many data points far away from the mean. This can signal diverse behavior among the subjects or elements you're studying.

The variance is defined as

$$ s^2 = \sum_{i=1}^N \frac{(Y_i - \bar{Y})^2}{N - 1} $$

where $\bar{Y} $ is the mean of the data.

The variance is roughly the **arithmetic average of the squared distance from the mean**. Squaring the distance from the mean has the effect of giving greater weight to values that are further from the mean. For example, a point 2 units from the mean adds 4 to the above sum while a point 10 units from the mean adds 100 to the sum.


### Covariance Matrix

The variance-covariance matrix consists of:
1. the variances of the variables along the main diagonal
2. and the covariances between each pair of variables in the other matrix positions

The formula for computing the covariance of the variables $X$ and $Y$ is

$$ COV = \frac{ \sum_{i=1}^N (X_i - \bar{X})(Y_i - \bar{Y}) }{N - 1} $$

with $\bar{X}$ and $\bar{Y}$ denoting the means of features $X$ and $Y$, respectively.

The results are:

$$ \bar{X} = \begin{bmatrix} 4.10  & 2.08 & 0.604 \end{bmatrix} $$


$$ COV = \begin{bmatrix}
0.025 & 0.0075 & 0.00175 \\
0.0075 & 0.0070 & 0.00135 \\
0.00175 & 0.00135 & 0.00043
\end{bmatrix} $$

Thus
* $0.025$ is the variance of the *length* variable
* $0.0075$ is the covariance between the *length* and the *width* variables
* $0.00175$ is the covariance between the *length* and the *height* variables
* $0.0070$ is the variance of the *width* variable
* $0.00135$ is the covariance between the *width* and *height* variables
* and $0.00043$ is the variance of the *height* variable.


The mean vector is often referred to as the **centroid** and the variance-covariance matrix as the **dispersion** or dispersion matrix. Also, the terms variance-covariance matrix and covariance matrix are used interchangeably. 
