## Conditional probability tables
When the variables are discrete  we may think of the factors $p(x_i\mid x_{A_i})$ as probability tables. rows correspond to assignments to $x_{A_i}$ and columns correspond to values of $x_i$. the entries contain the actual probabilities $p(x_i\mid x_{A_i})$

## Bayes' Theorem 
1. **Prior Probability $P(H)$**: This represents our initial belief about the probability of a hypothesis $H$ before we see any new data or evidence. It reflects what we know about the situation prior to the latest information.

2. **Likelihood $P(E|H)$**: This is the probability of observing the evidence $E$ given that the hypothesis $H$ is true. It measures how likely we are to observe the evidence if the hypothesis holds.

3. **Evidence $P(E)$**: Also known as the marginal likelihood, this is the probability of observing the evidence under all possible hypotheses. It can be calculated by considering all possible states of the world (all hypotheses) and summing or integrating over them.

4. **Posterior Probability $P(H|E)$**: This is what Bayesian inference aims to compute - the probability of the hypothesis $H$ after taking into account the new evidence $E$ and our prior belief. This represents our updated belief about the hypothesis in light of the new evidence.

Bayes' theorem provides the mathematical formula for computing the posterior probability:

$ P(H|E) = \frac{P(E|H) \cdot P(H)}{P(E)} $



${\displaystyle {\text{posterior}}={\frac {{\text{likelihood}}\times {\text{prior}}}{\text{evidence}}}\,}$


## Conditionally Independent

If $A$ and $B$ are conditionally independent of $C$, written symbolically as: ${\displaystyle (A\perp \!\!\!\perp B|C)}$

$P(A,B|C)=P(A|C)P(B|C)$

$P(A|B,C)=P(A|C)$


The concept of conditional independence can be visually represented using probability trees, Venn diagrams, or Bayesian networks. However, let's consider a simple Venn diagram with two events $A$ and $B$, and given that a third event $ C$ has occurred.

1. **Event A**: The shaded area for A represents $ P(A | C) $
2. **Event B**: The shaded area for B represents $ P(B | C) $
3. **Event C**: The presence of C as the bounding box indicates we are looking at probabilities conditional on C.

Assuming $ A$ and $ B $ are conditionally independent given $ C $, then the following holds:
$
P(A \cap B | C) = P(A | C) \times P(B | C)
$

You would represent this by showing that the overlap between A and B, given $C$, can be calculated as the product of the individual conditional probabilities.

### Diagram

Imagine a bounding box for event C; inside this box, we have two overlapping circles, one for event A and another for event B. 

```
                      ---------------
                     |       C       |
                     |  -----------  |
                     | |     A     | |
                     | | --------- | |
                     | | |   A∩B  | | |
                     | | |________| | |
                     | |     B       | |
                     | --------------- |
                      -----------------
```

### Calculating Area

If we treat these shapes as geometric areas, we could say:

- Area of $ C = 1 $ (because we're looking at conditional probabilities)
- Area of $ A $ inside $ C = P(A | C) $
- Area of $ B $ inside $ C = P(B | C) $
- Overlapping Area of $ A $ and $ B $ inside $ C = P(A \cap B | C) $

Since $ A $ and $ B $ are conditionally independent given $ C $:
$
\text{Area of \( A \cap B \) inside \( C \)} = \text{Area of \( A \) inside \( C \)} \times \text{Area of \( B \) inside \( C \)}
$

That would mean $ P(A \cap B | C) = P(A | C) \times P(B | C) $.

In this case, the area representing $ A \cap B $ within the boundary of $ C $ can be directly calculated by the product of the conditional probabilities of $ A $ and $ B $, given $ C $.


## Independent Event

Two events $A,B$ are said to be statistically independent if and only if 

$P(A,B)=P(A)P(B)$

$P(A|B)=\frac{P(A,B)}{P(B)}=\frac{P(A)P(B)}{P(B)}=P(A)$

Also $\bar{B}$ and $A$ are independent, $P(A,\bar{B})=P(A)P(\bar{B})$

If $X$ and $Y$ are independent random variables, then the expectation operator $\operatorname {E}$  has the property

${\displaystyle \operatorname {E} [XY]=\operatorname {E} [X]\operatorname {E} [Y]}$


and the covariance ${\displaystyle \operatorname {cov} [X,Y]}$ is zero, as follows from

${\displaystyle \operatorname {cov} [X,Y]=\operatorname {E} [XY]-\operatorname {E} [X]\operatorname {E} [Y].}$



# Conditional Distribution of Y Given X
Refs: [1](https://online.stat.psu.edu/stat414/lesson/21/21.1)


