# Lecture 1: Probability Models and Axioms

A probabilistic model is a quantitative description of a situation, a phenomenon, or an experiment.

A probabilistic model involves two steps.

- Sample Space: We need to describe the possible outcomes of an experiment.

- Probability Laws: We need to specify how to assign probabilities to the outcomes or the collection of outcomes.
 - Axioms: Probabilities need to satisfy basic properties to be meaningful.
   - e.g. Probabilities cannot be negative.
 - Properties that follow from the axioms

Two types of probabilistic outcomes:
 - Discrete
 - Continuous

## Sample Space

What ever an experiment is, like
 - flipping a coin
 - flipping a coin for 5 times
 - rolling a dice
 
...There will be a set of possible outcomes of an experiment.

We denote the set of possible outcomes of a experiment as $\Omega$.

The elements of a sample space should be
 - Mutually exclusive
  - At the end of an experiment, there can only be one of the outcomes that has happened.
 - Collectively exhaustive
  - Together, all the elements of the sample space exhaust all the possibilities.
 - At the "right" granularity
  - The right granularity will include the sufficient but only relevant information in the model.
  - For example, for flipping a coin, the weather of the location of the experiment will be irrelevant information
    - Hence $\Omega = \{H, T\}$ will be a better sample space compare to $\Omega' = \{H~and~rain, H~and~no~rain, T~and~rain, T~and~no~rain\}$, although the elements in latter sample space are also mutually exclusive and collective exhaustive.

### Sample Space Examples

Samples space are sets. And a sample space can be finite, infinite, discrete, continuous, and so on.

#### Example: 2 rolls of a tetrahedral die

One possible representation of the sample space is the following

![](assets/sample_space_tetrahedral_die_xy.png)

...and the order of the dice roll matters. E.g. $(2, 3)$ is a different outcome to $(3, 2)$.

This is case of models that the probabilistic experiment can be described in phases or stages.

It is useful to describe the such an experiment as a sequential description in terms of a tree.

![](assets/sample_space_tetrahedral_die_tree.png)

In both descriptions, we have $16$ possible outcomes.

### Example: Throwing a dart towards a target as a unit square

And outcome will the dart hits point $(x, y)$ on the target such that $0 \leq x, y \leq 1$, while $x$ and $y$ are real numbers.

![](assets/sample_space_dart.png)

## Probability Axioms

### Definition: Event

Recall the dart example above. What is the probability of the dart hits on an exact point $(x, y)$ for any particular $x$ and $y$?

The probability of such an outcome would be essentially $0$. And it is natural that in a continuous model that any individual point should have a $0$ probability.

In this case instead of assigning probabilities to individual points, we will assign probabilities to a **subset** of the sample space.

A subset of the sample space is called an **event**.

The probability of an event $A$ is denoted as $P(A)$.

Why is this called an event? Because at the end of the experiment, the outcome of the experiment is either in the subset $A$ (then we would say event $A$ has occurred), or is outside $A$ (then we would say event $A$ did not occurred).

![](assets/continous_event.png)

By convention, probabilities are always given between $0$ and $1$.

Intuitively, $0$ probability means something practically cannot happen. And $1$ probability means practically the event of interest is going to happen.

### Axioms

The rules all probabilities should satisfy are call the Axioms of Probability.

1. Non-negativity: $P(A) \geq 0$
2. Normalization: $P(\Omega) = 1$
3. (Finite) Additivity (to be strengthen later): If $A \cap B = \emptyset$, then $P(A \cup B) = P(A) + P(B)$.

![](assets/additivity_axiom.png)