```{index} axioms of probability; corollaries
```

# Corollaries to the Axioms of Probability

Corollaries are results that can be proven from more fundamental theorems or properties. In this case, we are interested in what additional properties or relations we can develop for our probability measure, based on the Axioms of Probability. 

Let $A \in \mathcal{F}$ and $B \in \mathcal{F}$.  Then the following
  properties of $P$ can be derived from the axioms and the
  mathematical structure of $\mathcal{F}$:


**Corollary 1.** Let $\overline{A}$ denote the complement of $A$; i.e., $\overline{A}$ contains every outcome in $S$ that is  not in $A$. Then

$$
P\left( \overline{A} \right) = 1 - P\left(A \right).
$$

*Proof:*

The proof uses Axioms II and III, as well as properties of sets. First, note that $A \cup \overline{A} = S$ and that $A$ and $\overline{A}$ are disjoint by definition. Then by Axioms II and III,
\begin{align*}
P(S) &= 1\\
P(A \cup \overline{A}) &=1 \\
P(A) + P(\overline{A}) &= 1 \\
P(A) &= 1 - P(\overline{A}).
\end{align*}




**Example**

A fair six-sided die is rolled two times and the top faces are recorded. What is the probability that neither roll is less than 3?

Let $E_i$ be the event that the outcome of roll $i$ is less than 3. Then we are asked to find $P\left(\overline{E_1} \cap \overline{E_2} \right)$. 

By DeMorgan's rules, 

$$
P\left(\overline{E_1} \cap \overline{E_2} \right) = P\left(\overline{E_1 \cup E_2} \right).
$$

We can apply Corollary 1 to get

$$
P\left(\overline{E_1 \cup E_2} \right) = 1 - P\left(E_1 \cup E_2 \right).
$$

But we already found the probability on the right-hand side to be 5/9 in {doc}`Axiomatic Probability<axiomatic-prob>`. Thus, probability we are looking for is $1-5/9=4/9$.


  
**Corollary 2.** $P(A)\le 1$ 

As previously noted, this restriction is not included in the axioms. 

*Proof:* 

By Corollary 1, we have

$$
P(A) = 1 - P(\overline{A}).
$$

By Axiom I, $P(\overline{A}) \ge 0$, so it must be that $P(A) \le 1$.



**Corollary 3.** $P(\emptyset)=0$

*Proof:*

By Corollary 1, we have

$$
P(\emptyset) = 1 - P(\overline{\emptyset}).
$$

But $\overline{\emptyset} =S$. Thus, 
\begin{align}
P(\emptyset) &= 1 - P(S) \\
&= 1-1 \\
&=0
\end{align}




**Corollary 4.** If $A_1, A_2, \ldots, A_n$ are pairwise mutually 
      exclusive, then 
      
      \begin{eqnarray*}
        P\left(\bigcup_{k=1}^{n} A_k \right)= \sum_{k=1}^{n} P(A_k)
      \end{eqnarray*}
      
The proof is by induction and is omitted.

```{index} union; probabilities of
```


**Corollary 5.** $P(A \cup B) = P(A) +P(B) - P(A \cap B)$

*Proof*

The proof requires a bit of work with sets and applying Axiom III. It is based on the following Venn diagram for the event $A \cup B$:

<!--![Venn Diagram with Disjoint Parts Labeled](prob-unions.png) -->

<img src="prob-unions.png" alt="Venn Diagram with Disjoint Regions Labeled" width="400px" style="margin-left:auto;margin-right:auto;">

Note that the regions  $A \cap \overline{B}$, $A \cap B$, and $B \cap \overline{A}$ are disjoint.

In addition, note that we can write 
\begin{align*}
A \cup B &= \biggl(A \cap \overline{B} \biggr) \cup  \biggl ( A \cap B \biggr)  \cup \biggl( B \cap \overline{A} \biggr) \\
A &=\biggl(A \cap \overline{B} \biggr) \cup  \biggl ( A \cap B \biggr), \mbox{ and} \\
 B &=    \biggl( B \cap \overline{A} \biggr) \cup \biggl ( A \cap B \biggr).  \\
\end{align*}

Applying Axiom III to each of these identities yields:
\begin{align*}
P(A \cup B) &= P\biggl(A \cap \overline{B} \biggr) + P\biggl ( A \cap B \biggr)+ P \biggl( B \cap \overline{A} \biggr) \\
P(A) &= P\biggl(A \cap \overline{B} \biggr) +  P\biggl ( A \cap B \biggr), \mbox{ and} \\
P(B) &=    P\biggl( B \cap \overline{A} \biggr) + P \biggl ( A \cap B \biggr).  \\
\end{align*}

We can write the last two equations as
\begin{align*}
P\biggl(A \cap \overline{B} \biggr) = P(A) -  P\biggl ( A \cap B \biggr), \mbox{ and} \\
P\biggl( B \cap \overline{A} \biggr) = P(B) - P \biggl ( A \cap B \biggr).  
\end{align*}

Substituting into the remaining equation yields:
\begin{align*}
P(A \cup B) &= \biggl[ P(A) -  P\biggl ( A \cap B \biggr) \biggr] + P\biggl ( A \cap B \biggr)+ \biggl[P(B) - P \biggl ( A \cap B \biggr) \biggr],
\end{align*}
which simplifies to the desired result.



**Example**

In the United States, most high school students applying for college take a standardized achievement test called the SAT, which is administered by the College Board. The main test consists of a Verbal and Math part, each of which is scored on a scale from 200 to 800.  The following probability information is inferred from data found online[^sat_data].
* The probability of getting over a 600 on the Verbal part is 0.24.
* The probability of getting over a 600 on the Math part is 0.25.
* The probability of getting over a 600 on both the Math and Verbal parts is 0.16.

**(a)** If a college requires that a student get a score over 600 on at least one of the Math and Verbal parts of the SAT to be eligible for admission, what is the probability that a randomly chosen student will meet the college's SAT criterion for admission?

Let 

$V$ = event that verbal score > 600
$M$ = event that Math score > 600

We are looking for $P\left(V \cup M \right)$. We can apply Corollary 5,
\begin{align*}
P\left( V \cup M \right) &= P(V) + P(M) - P(V \cap M) \\
 &= 0.24 + 0.25 -   0.16 \\
 &= 0.33
\end{align*}

About 1/3 of students who take the SAT will be meet the college's criterion.

[^sat_data]: Data on correlation  from https://eportfolios.macaulay.cuny.edu/liufall2013/files/2013/10/New_Perspectives.pdf .
Data on mean and variance is from https://blog.prepscholar.com/sat-standard-deviation .


(b) What is the probability that a randomly chosen student does not make over a 600 on either test?

This probability can be written as $P( \overline{V} \cap \overline{M})$.  It is the complement of the event in part (a):
\begin{align*}
P( \overline{V} \cap \overline{M}) &= P\left( \overline{V \cup M} \right) \\
&= 1- P\left( V \cup M \right) \\
&= 0.67
\end{align*}

(c) What is the probability that a randomly chosen student makes over a 600 on the Math part but a 600 or less on the Verbal part?

This probability can be written as $P(M \cap \overline{V})$. I will provide a purely mathematical answer, but it is helpful to draw a Venn diagram to visualize this scenario. Note that
\begin{align*}
P(M) &= P \left(M \cap V \right) + P\left(M \cap \overline{V} \right)\\
P\left(M \cap \overline{V} \right) &= P(M) - P \left(M \cap V \right) \\
&= 0.25 - 0.16 \\
&= 0.09
\end{align*}


**Example**

**(Take 2)** A fair six-sided die is rolled twice.  What is the probability that either of the rolls is a value less than 3? 

As before, let $E_i$ be the event that the top face on roll $i$ is less than 3, for $i=1,2$.
   
Referring back to {doc}`axiomatic-prob`, note that it is much easier to calculate the number of outcomes in $E_1 \cap E_2$ than to count the number of items in $E_1 \cup E_2$.  (Intersections are always no bigger than the smallest constituent set, where unions are no smaller than the largest of the  constituent sets.)

Here is the code that displays the intersection:

In [2]:
from termcolor import colored
print("Outcomes in both E1 and E2 are in red:")
for j in range(1,7):
    for k in range(1,7):
        if j<3 and k<3:
            print(colored("("+str(j)+", "+str(k)+")   ", 'red'), end="")
        else:
            print("("+str(j)+", "+str(k)+")   ", end="")
    print()

Outcomes in both E1 and E2 are in red:
[31m(1, 1)   [0m[31m(1, 2)   [0m(1, 3)   (1, 4)   (1, 5)   (1, 6)   
[31m(2, 1)   [0m[31m(2, 2)   [0m(2, 3)   (2, 4)   (2, 5)   (2, 6)   
(3, 1)   (3, 2)   (3, 3)   (3, 4)   (3, 5)   (3, 6)   
(4, 1)   (4, 2)   (4, 3)   (4, 4)   (4, 5)   (4, 6)   
(5, 1)   (5, 2)   (5, 3)   (5, 4)   (5, 5)   (5, 6)   
(6, 1)   (6, 2)   (6, 3)   (6, 4)   (6, 5)   (6, 6)   


We see that $| E_1 \cap E_2| = 4$, which means that $P(E_1 \cap E_2) = 4/36=1/9$.  Then we can calculate the desired probability as
\begin{align*}
P(E_1 \cup E_2) &= P(E_1) + P(E_2) - P(E_1 \cap E_2) \\
&= \frac{2}{6} + \frac{2}{6} - \frac{1}{9}\\
&= \frac{5}{9}
\end{align*}

**Corollary 6.**   If $A \subset B$, then $P(A) \le P(B)$.

*Proof*

It may be helpful to refer to the Venn diagram for some intuition:


<img src="prob-subset.png" alt="Venn Diagram of Two Sets A and B, where A is a subset of B" width="250px" style="margin-left:auto;margin-right:auto;">

Note that $ A \subset B$ implies that $A \cap B = A$. Then 
\begin{align*}
P(B) &= P\left(B \cap A \right) + P \left(B \cap \overline{A} \right) \\
     &= P\left( A \right) + P \left(B \cap \overline{A} \right) \\
P\left( A \right) &= P(B) - P \left(B \cap \overline{A} \right) 
\end{align*}
Since $P \left(B \cap \overline{A} \right) \ge 0$, $P(A) \le P(B)$.


**Corollary 7.**

\begin{eqnarray*}
P\left( \bigcup_{k=1}^{n} A_k \right) &=& 
\sum_{k=1}^{n} P\left(A_j\right)
    -\sum_{j<k} P \left( A_j \cap A_k \right) + \cdots \\
    && + 
    (-1)^{(n+1) } P\left(A_1 \cap A_2 \cap \cdots \cap A_n \right)
\end{eqnarray*} 

Here the ellipses indicate that the pattern should be continued until exhaustion: add all single events, subtract off all intersections of pairs of events, add in all intersections of 3 events, ...}

The proof is by induction and is omitted.

Note that the set of Corollaries is not unique, and this set is not meant to be comprehensive. Rather, this represents some common tools that we will use in our work on probability.