# Learning Goals 
1. Be able to compute the variance and standard deviation of a random variable.
2. Understand that standard deviation is a measure of scale or spread.
3. Be able to compute variance using the properties of scaling and linearity. 

## Spread 
The expected value (mean) of a random variable is a measure of location or central tendency.
If you had to summarize a random variable with a single number, the mean would be a good
choice. Still, the mean leaves out a good deal of information. For example, the random
variables X and Y below both have mean 0, but their probability mass is spread out about
the mean quite differently. 

\begin{array}{c|ccccc} 
\text{values  } X & -2 & -1 & 0 & 1 & 2 \\
\hline \operatorname {pmf  } p(x) & 1 / 10 & 2 / 10 & 4 / 10 & 2 / 10 & 1 / 10
\end{array}

\begin{array}{c|cc}
\text { values } Y & -3 & 3 \\
\hline \operatorname{pmf} p(y) & 1 / 2 & 1 / 2
\end{array}


## Variance and standard deviation 
Taking the mean as the center of a random variable's probability distribution, the variance
is a measure of how much the probability mass is spread out around this center. We'll start
with the formal definition of variance and then unpack its meaning. 

\begin{definition}
If $X$ is a random variable with mean $E(X)=\mu,$ then the variance of $X$ is defined by
$$
\operatorname{Var}(X)=E\left((X-\mu)^{2}\right)
$$
\end{definition}


The **standard deviation** $\sigma$ of $X$ is defined by
$$
\sigma=\sqrt{\operatorname{Var}(X)}
$$

\begin{example}
Compute the mean, variance and standard deviation of the random variable
X with the following table of values and probabilities.
  
 \begin{array}{c|ccc}
\text { value } x & 1 & 3 & 5 \\
\hline \operatorname{pmf} p(x) & 1 / 4 & 1 / 4 & 1 / 2
\end{array}

\end{example}


answer: First we compute $E(X)=7 / 2$. Then we extend the table to include $(X-7 / 2)^{2}$.
$$
\begin{array}{c|ccc}
\text { value } x & 1 & 3 & 5 \\
\hline p(x) & 1 / 4 & 1 / 4 & 1 / 2 \\
\hline(x-7 / 2)^{2} & 25 / 4 & 1 / 4 & 9 / 4
\end{array}
$$
Now the computation of the variance is similar to that of expectation:
$$
\operatorname{Var}(X)=\frac{25}{4} \cdot \frac{1}{4}+\frac{1}{4} \cdot \frac{1}{4}+\frac{9}{4} \cdot \frac{1}{2}=\frac{11}{4}
$$
Taking the square root we have the standard deviation $\sigma=\sqrt{11 / 4}$

## The variance of a Bernoulli $(p)$ random variable.
Bernoulli random variables are fundamental, so we should know their variance.
If $X \sim$ Bernoulli $(p)$ then
$$
\operatorname{Var}(X)=p(1-p)
$$
Proof: We know that $E(X)=p .$ We compute $\operatorname{Var}(X)$ using a table.
$$
\begin{array}{c|cc}
\text { values } X & 0 & 1 \\
\hline \operatorname{pmf} p(x) & 1-p & p \\
\hline(X-\mu)^{2} & (0-p)^{2} & (1-p)^{2} \\
\operatorname{Var}(X)=(1-p) p^{2}+p(1-p)^{2}=(1-p) & p(1-p+p)={(1-p) p}
\end{array}
$$
As with all things Bernoulli, you should remember this formula.
Think: For what value of $p$ does Bernoulli $(p)$ have the highest variance? Try to answer this by plotting the PMF for various $p$.

## Independence

In a  probabilistic sense two random
variables X and Y are independent if knowing the value of X gives you no information
about the value of Y. 

\begin{definition}
The discrete random variables $X$ and $Y$ are independent if
$$
P(X=a, Y=b)=P(X=a) P(Y=b)
$$
for any values $a, b .$ That is, the probabilities multiply.


\end{definition}

## Properties of variance 
The three most useful properties for computing variance are:

1. If $X$ and $Y$ are **independent** then $\operatorname{Var}(X+Y)=\operatorname{Var}(X)+\operatorname{Var}(Y)$.
2. For constants $a$ and $b, \operatorname{Var}(a X+b)=a^{2} \operatorname{Var}(X)$
3. $\operatorname{Var}(X)=E\left(X^{2}\right)-E(X)^{2}$

\begin{example}
Suppose $X$ and $Y$ are independent and $\operatorname{Var}(X)=3$ and $\operatorname{Var}(Y)=5 .$ Find:
(i) $\operatorname{Var}(X+Y)$
(ii) $\operatorname{Var}(3 X+4)$
(iii) $\operatorname{Var}(X+X)$
(iv) $\operatorname{Var}(X+3 Y)$
\end{example}



answer: To compute these variances we make use of Properties 1 and 2 .

(i) since $X$ and $Y$ are independent, $\operatorname{Var}(X+Y)=\operatorname{Var}(X)+\operatorname{Var}(Y)=8$.

(ii) Using Property $2, \operatorname{Var}(3 X+4)=9 \cdot \operatorname{Var}(X)=27$.

(iii) Don't be fooled! Property 1 fails since $X$ is certainly not independent of itself. We can use Property $2: \operatorname{Var}(X+X)=\operatorname{Var}(2 X)=4 \cdot \operatorname{Var}(X)=12 .$ (Note: if we mistakenly used
Property $1,$ we would the wrong answer of $6 .)$

(iv) We use both Properties 1 and 2 .
$$
\operatorname{Var}(X+3 Y)=\operatorname{Var}(X)+\operatorname{Var}(3 Y)=3+9 \cdot 5=48
$$

\begin{example}
Use Property  3 to compute the variance of  X $\sim \text { Bernoulli }(p)$ . 

\end{example}

**answer** we have $E\left(X^{2}\right)=p .$ So Property 3 gives
$$
\operatorname{Var}(X)=E\left(X^{2}\right)-E(X)^{2}=p-p^{2}=p(1-p)
$$

## Variance of binomial $(n, p)$

Suppose $X \sim$ binomial $(n, p) .$ since $X$ is the sum of independent Bernoulli $(p)$ variables and each Bernoulli variable has variance $p(1-p)$ we have
$$
X \sim \operatorname{binomial}(n, p) \Rightarrow \operatorname{Var}(X)=n p(1-p)
$$

## Tables of Distributions and Properties 

$$
\begin{array}{|l|c|c|c|c|}
\hline \text { Distribution } & \text { range } X & \operatorname{pmf} p(x) & \operatorname{mean} E(X) & \text { variance } \operatorname{Var}(X) \\
\hline \text { Bernoulli }(p) & 0,1 & p(0)=1-p, \quad p(1)=p & p & p(1-p) \\
\hline \text { Binomial }(n, p) & 0,1, \ldots, n & p(k)=\left(\begin{array}{c}
n \\
k
\end{array}\right) p^{k}(1-p)^{n-k} & n p & n p(1-p) \\
\hline \text { Uniform }(n) & 1,2, \ldots, n & p(k)=\frac{1}{n} & \frac{n+1}{2} & \frac{n^{2}-1}{12} \\
\hline \text { Geometric }(p) & 0,1,2, \ldots & p(k)=p(1-p)^{k} & \frac{1-p}{p} & \frac{1-p}{p^{2}} \\
\hline
\end{array}
$$

$$
\begin{aligned}
&\text { Let } X \text { be a discrete random variable with range } x_{1}, x_{2}, \ldots \text { and } \operatorname{pmf} p\left(x_{j}\right)\\
&\begin{array}{|ll|l|}
\hline \text { Expected Value: } & & \text { Variance: } \\
\hline \text { Synonyms: } & \text { mean, average } & \\
\hline \text { Notation: } & E(X), \mu & \operatorname{Var}(X), \sigma^{2} \\
\hline \text { Definition: } & E(X)=\sum_{j} p\left(x_{j}\right) x_{j} & E\left((X-\mu)^{2}\right)=\sum_{j} p\left(x_{j}\right)\left(x_{j}-\mu\right)^{2} \\
\hline \text { Scale and shift: } & E(a X+b)=a E(X)+b & \operatorname{Var}(a X+b)=a^{2} \operatorname{Var}(X) \\
\hline \text { Linearity: } & (\text { for any } X, Y) & (\text { for } X, Y \text { independent) } \\
& E(X+Y)=E(X)+E(Y) & \operatorname{Var}(X+Y)=\operatorname{Var}(X)+\operatorname{Var}(Y) \\
\hline \text { Functions of } X: & E(h(X))=\sum p\left(x_{j}\right) h\left(x_{j}\right) & \\
\hline \text { Alternative formula: } & & \operatorname{Var}(X)=E\left(X^{2}\right)-E(X)^{2}=E\left(X^{2}\right)-\mu^{2} \\
\hline
\end{array}
\end{aligned}
$$