# ML 101

## 1. Probabilities

### Introduction

- Statistics and probability theory constitute a branch of mathematics for dealing with uncertainty. The probability theory provides a basis for the science of statistical inference from data
- Sample: (of size n) obtained from a mother population assumed to be represented by a probability
- Descriptive statistics: description of the sample
- Inferential statistics: making a decision or an inference from a sample of our problem

### Probabilities

A set of probability values for an experiment with sample space $S = \\{ O_1, O_2, \cdots, O_n \\}$  consists of some probabilities that satisfy: $$ 0 \leq p_i \leq 1, \hspace{0.5cm} i= 1,2, \cdots, n $$ and
$$ p_1 +p_2 + \cdots +p_n = 1 $$

The probability of outcome $O_i$ occurring is said to be $p_i$ and it is written:

$$ P(O_i) = p_i $$

In cases in which the $n$ outcomes are equally likely, then each probability will have a value of $\frac{1}{n}$

### Events
- Events: subset of the sample space
- The probability of an event $A$, $P(A)$, is obtained by the probabilities of the outcomes contained withing the event $A$
- An event is said to occur if one of the outcomes contained within the event occurs
- Complement of events: event $ A' $ is the event consisting of everything in the sample space $S$ that is not contained within $A$: $$
P(A) + P(A ') = 1$$


### Combinations of Events

1. Intersections
- $A \cap B$ consists of the outcomes contained within both events $A$ and $B$
- Probability of the intersection, $P(A \cap B) $, is the probability that both events occur simultaneously
- Properties:
    - $P(A \cap B) +P(A \cap B') = P(A)$
    - Mutually exclusive events: if $A \cap B = \emptyset$
    - $A \cap (B \cap C) = (A \cap B) \cap C $
2. Union
- Union of Events: $ A \cup B $ consists of the outcomes that are contained within at least one of the events $A$ and $B$
- The probability of this event, $P (A \cup B)$ is the probability that at least one of these events $A$ and $B$ occurs
- Properties:
    - If the events are mutually exclusive, then $P(A \cup B) = P(A) + P(B)$
    - $P( A \cup B) = P(A \cap B') + P(A' \cap B) + P(A \cap B)$
    - $P( A \cup B) = P(A) + P(B) - P(A \cap B)$
    - $P(A \cup B \cup C) = P(A) + P(B) + P(C) - P(A \cap B) - P( B \cap C) - P( A \cap C) + P(A \cap B \cap C)$

### Conditional Probability
- Conditional Probability: of an event $A$ conditional on an event $B$ is:
$$P(A \mid B) = \frac{P(A \cap B)}{P(B)} \hspace{0.5cm}  \text{for } P(B) >0$$
- Properties:
    - $P (A \mid B) = \frac{P(A \cap B)}{P(B)} \Longrightarrow P(A \cap B) = P(B)P (A \mid B)$
    - $P (A \mid B \cap C) = \frac{P(A \cap B \cap C)}{P(B \cap C)} \Longrightarrow P(A \cap B \cap C) = P(B \cap C)P (A \mid B \cap C)$
    - In general, for a sequence of events $A_1, A_2, \cdots, A_n$:
    $$P(A_1, A_2, \cdots, A_n) = P(A_1)P(A_2 \mid A_1)P(A_3 \mid A_1 \cap A_2) \cdots P(A_n \mid A_1 \cap \cdots \cap A_{n-1})$$
- Two events A and B are independent if
    - $P(A \mid B) = P(A)$
    - $P(B \mid A) = P(B)$
    - $P(A \cap B) = P(A) \times P(B)$
    - Interpretation: events are independent if the knowledge about one event does not affect the probability of the other event

### Posterior Probabilities
- Law of total probability: Given $\{ A_1, A_2, \cdots, A_n \}$ a partition of sample space $S$, the probability of an event $B$, $P(B)$ can be expressed as:
$$P(B) = \sum_{i=1}^n P(A_i)P(B \mid A_i)$$
- Bayes' Theorem: Given $\{ A_1, A_2, \cdots, A_n \}$ a partition of a sample space, then the posterior probabilities of the event $A_i$ conditional on an event $B$ can be obtained from the probabilities $P(A_i)$ and $P(A_i \mid B)$ using the formula:
$$ P(A_i \mid B) = \frac{P(A_i)P(B \mid A_i)}{\sum_{j=1}^n P(A_j)P(B \mid A_j)}$$