# Probability Theory

This notebook will walk through essential concepts of probability theory with practical examples.


### $\sigma Algebra$

A $\sigma$ algebra is a set of sets that contains all set-differences that can be constructed by combining arbitrary subsets of said set. Furthermore, it contains all finite unions of sets and all infinite intersection of the set. More formally, according to Kolmogoroff: 

Let $E$ be a space of elementary events. Consider the powerset $2^E$ and le $\Im \subset 2^E$ be a set of subsets of $E$. Elements of $\Im$ are called random events. If $\Im$ satisfies the following properties,it is called a $\sigma$-algebra.

1. $E \in \Im$
2. $(A, B) \in \Im \Rightarrow (A - B) \in \Im$
3. $(A_1, A_2, ... \in \Im) \Rightarrow \left( \cup_{i=1}^\mathbb{N} A_i \in \Im \wedge \cap_{i=1}^\infty \in \Im  \right)$

An example of such a set a set is the following: 

In [1]:
from itertools import chain, combinations

def powerset(iterable):
    s = list(iterable)
    return list(chain.from_iterable(combinations(s, r) for r in range(len(s)+1)))

E = {"a", "b", "c"}
powerset_of_E = powerset(E)
powerset_of_E

[(),
 ('b',),
 ('c',),
 ('a',),
 ('b', 'c'),
 ('b', 'a'),
 ('c', 'a'),
 ('b', 'c', 'a')]

The reason why the $\sigma$-algebra is the set of interest for probability theory is, bluntly speaking, knowing the probability of every atomic event is knowing the probability of every possible event.

Formally,

## Probability Measure
 Let $(E, \Im)$ be a  $\sigma$-algebra. A non-negative real function $P \rightarrow \mathbb{R}_{0, +}$
 is called a measure if it satisfies the following properties:
 
1. $P(\emptyset) = 0$
2. For any countable sequence $\{A_i \in \Im \}_{i=1,...,}$ of pairwise disjoints sets $A_i \cap A_j = \emptyset$ if $i \neq j, P$ satisfies countable additivity ($\sigma$-additivity):
$$P \left( \cup_{i=1}^\infty A_i  \right) = \sum_{i=1}^\infty P(A_i)$$ 
3. $P(A \cup B) = P(A) + P(B) + P(A,B)$


The probability measure just tells, that for non-intersecting sets, you can determine the probability of the union by adding the atomic probabilities. Furthermore, for intersecting sets you have to subtract the intersection, because it is added in there twice otherwise. A common way to visually think about those things are venn diagrams. 