## Properties of the Expectation and Variance

Now that we are familiar with the concepts of a joint distribution, marginal distributions, variance and covariance, we can discuss some of their properties.

### Expectation

1. The expectation of a constant is the constant itself:

$$
E(c) = c
$$

2. The expectation of a sum of random variables is the sum of the expectations:

$$
E(X + Y) = E(X) + E(Y)
$$

To prove this, we can use the definition of the expectation. Let $f_{XY}(x, y)$ be the joint probability mass function of $X$ and $Y$. Then

$$
\begin{align*}
E(X + Y) & = \sum_x \sum_y (x + y) f_{XY}(x, y) \\
         & = \sum_x \sum_y x f_{XY}(x, y) + \sum_x \sum_y y f_{XY}(x, y) \\
         & = \sum_x x \sum_y f_{XY}(x, y) + \sum_y y \sum_x f_{XY}(x, y) \\
         & = \sum_x x f_X(x) + \sum_y y f_Y(y) \\
         & = E(X) + E(Y)
\end{align*}
$$

In the derivation above we use the fact that the marginal distribution of $X$ is

$$
f_X(x) = \sum_y f_{XY}(x, y)
$$

and the marginal distribution of $Y$ is

$$
f_Y(y) = \sum_x f_{XY}(x, y)
$$

3. The expectation of a constant times a random variable is the constant times the expectation of the random variable:

$$
E(cX) = cE(X)
$$

To prove this, we can use the definition of the expectation. Let $f_{X}(x)$ be the probability mass function of $X$. Then

$$
\begin{align*}
E(cX) & = \sum_x c x f_{X}(x) \\
       & = c \sum_x x f_{X}(x) \\
       & = c E(X)
\end{align*}
$$

Here we use the fact that the constant does not depend on the summation index $x$ and can therefore be taken out of the summation.

Now we can combine these two properties to show that the expectation of a linear combination of random variables is the linear combination of the expectations:

$$
E(aX + bY) = aE(X) + bE(Y)
$$

### Variance

1. The variance of a constant is zero:

$$
Var(c) = 0
$$

To see this, we can use the definition of the variance. Let $f_{X}(x)$ be the probability mass function of $X$. Then

$$
\begin{align*}
Var(c) & = E((c - E(c))^2) \\
       & = E((c - c)^2) \\
       & = E(0) \\
       & = 0
\end{align*}
$$

Here use use the first property of the expectation (the expectation of a constant is the constant itself).

2. The variance of a sum of random variables is the sum of the variances plus twice the covariance:

$$
Var(X + Y) = Var(X) + Var(Y) + 2Cov(X, Y)
$$

To prove this, we can use the definition of the variance. Let $f_{XY}(x, y)$ be the joint probability mass function of $X$ and $Y$. Then

$$
\begin{align*}
Var(X + Y) & = E((X + Y - E(X + Y))^2) \\
           & = E((X + Y - E(X) - E(Y))^2) \\
           & = E((X - E(X))^2 + (Y - E(Y))^2 + 2(X - E(X))(Y - E(Y))) \\
           & = E((X - E(X))^2) + E((Y - E(Y))^2) + 2E((X - E(X))(Y - E(Y))) \\
           & = Var(X) + Var(Y) + 2Cov(X, Y)

\end{align*}
$$

Here we use the fact that the expectation is a linear operator and that the expectation of a sum is the sum of the expectations. The rest follows just from rearranging terms and using the definition of the covariance.

3. The variance of a sum of random variables is the sum of the variances if the random variables are uncorrelated (i.e., the covariance is zero):

$$
Var(X + Y) = Var(X) + Var(Y)
$$

This follows directly from the second property of the variance. If the random variables are uncorrelated, the covariance is zero, and the variance of the sum is the sum of the variances.

4. The variance of a constant times a random variable is the constant squared times the variance of the random variable:

$$
Var(cX) = c^2 Var(X)
$$

This follows directly from the definition of the variance and the expectation of a constant times a random variable.

$$
\begin{align*}
Var(cX) & = E((cX - E(cX))^2) \\
        & = E((cX - cE(X))^2) \\
        & = E(c^2(X - E(X))^2) \\
        & = c^2 E((X - E(X))^2) \\
        & = c^2 Var(X)
\end{align*}
$$

The proof only uses the property of the expectation (the expectation of a constant times a random variable is the constant times the expectation of the random variable) and the definition of the variance.

5. The variance of a linear combination of random variables is the linear combination of the variances plus twice the covariance:

$$
Var(aX + bY) = a^2 Var(X) + b^2 Var(Y) + 2ab Cov(X, Y)
$$

This follows directly from the definition of the variance and the expectation of a linear combination of random variables.

6. The variance can be rewritten as:

$$
Var(X) = E(X^2) - E(X)^2
$$

This follows directly from the definition of the variance.

$$
\begin{align*}
Var(X) & = E((X - E(X))^2) \\
       & = E(X^2 - 2XE(X) + E(X)^2) \\
       & = E(X^2) - 2E(X)E(X) + E(X)^2 \\
       & = E(X^2) - E(X)^2
\end{align*}
$$

7. The covariance can be rewritten as:

$$
Cov(X, Y) = E(XY) - E(X)E(Y)
$$

This follows directly from the definition of the covariance.

$$
\begin{align*}
Cov(X, Y) & = E((X - E(X))(Y - E(Y))) \\
          & = E(XY - XE(Y) - YE(X) + E(X)E(Y)) \\
          & = E(XY) - E(X)E(Y)
\end{align*}
$$

## Independence and Covariance

If two random variables are independent, their covariance is zero. This follows directly from the definition of independence and the definition of the covariance. By definition, independence means that the joint probability mass function can be written as the product of the marginal probability mass functions:


$$
P(X = x, Y = y) = P(X = x)P(Y = y)
$$

This means that the expected value of the product of the random variables is the product of the expected values:

$$
\begin{align*}
E(XY) & = \sum_x \sum_y xy P(X = x, Y = y) \\
       & = \sum_x \sum_y xy P(X = x)P(Y = y) \\
       & = \sum_x x P(X = x) \sum_y y P(Y = y) \\
       & = E(X)E(Y)
\end{align*}
$$

The definition of the covariance is:

$$
Cov(X, Y) = E(XY) - E(X)E(Y)
$$

Substitute the expected value of the product of the random variables:

$$
Cov(X, Y) = E(X)E(Y) - E(X)E(Y) = 0
$$

