(uv:mt:rvs)=
# Random Variables

## Measurability

Next up is one of the most misunderstood (and most unnecessarily terrifying) concepts of traditional probability and statistics courses: the random variable. Let's start with a definition:

````{prf:definition} $(\mathcal F, \Sigma)$-Random Variable
:label: uv:mt:rvs:rv
Suppose that $(\Omega, \mathcal F)$ and $(S, \Sigma)$ are two measurable spaces. A function $X : \Omega \rightarrow S$ is said to be a measurable map from $(\Omega, \mathcal F)$ to $(S, \Sigma)$ if for all $B \in \Sigma$:
```{math}
X^{-1}(B) \equiv \{X \in B\} \equiv \{\omega : X(\omega) \in B\} \in \mathcal F
```
Written another way:
```{math}
X^{-1}(\Sigma) \equiv \{X^{-1}(B) : B \in \Sigma\} \equiv \left\{\{\omega : X(\omega) \in B\} : B \in \Sigma\right\}\subseteq \mathcal F
```
For such a measurable map $X$, $X$ is said to be a $(\mathcal F, \Sigma)$-valued random variable, or it is $(\mathcal F, \Sigma)$-measurable. In shorthand, we denote that $X \in m(\mathcal F, \Sigma)$.
````
That definition was quite large, and the interpretation of this definition is really at the heart of the difference between traditional probability and statistics with measure theoretic probability and statistics. Let's break down all of the things that this definition is saying.

The first aspect to fixate on here is that for some $\omega \in \Omega$, $X$ maps $\omega$ to some other point $s \in \mathcal S$. 

All that the first line says is that, when we take some subset of the codomain $S$ that is in the $\sigma$-algebra $\Sigma$, $B \in \Sigma$, that the set of all $\omega$ where $X(\omega) \in B$ is in the $\sigma$-algebra $\mathcal F$ (this is made extremely clear by the right-most set). Stated mathematically, for any $B \in \Sigma$, the *preimage* of $B$ is in $\mathcal F$. The left two notations, $X^{-1}(B)$ and $\{X \in B\}$, are simply shorthands for the statement at the right. Random variables can, and will, become extremely confusing simply by way of people seeing the shorthands, and then forgetting what the shorthand actually means. The idea is that the preimage of elements of the $\sigma$-algebra of $S$ are in the $\sigma$-algebra of $\Omega$. In a figure, this looks kind of like this:

```{figure} ./Images/measurable.png
---
width: 700px
name: uv:mt:rvs:measurable_fig
---
Here, the blue circle represents the event space $\Omega$, and the target space $S$ is represented by the red square. For some $B \in \Sigma$, represented by the dark red circles, there are a set of $\omega$s which are a subset of $\Omega$ that map to $B$ under $X$. The collection of all such $\omega$s, here shown in dark blue circles, comprise a set which is in $\mathcal F$, the $\sigma$-algebra on $\Omega$. 
```

The bottom line simply states this another way. This notation can get confusing, and we are not going to write it out so nicely every time, so take a long hard look at the previous definition, the equivalent notations, and learn to love them!

````{prf:remark} Shorthand when codomain is a subset of the reals
Suppose that $(\Omega, \mathcal F)$ and $(S, \Sigma)$ are two measurable spaces, and $X \in m(\mathcal F, \mathcal \Sigma)$. If $S \subseteq \mathbb R$ and $\Sigma = \mathcal B(S)$, then we will use the shorthand $X \in m\mathcal F$.
````

Usually, the codomain are going to be real values, and the $\sigma$-algebra is just going to be the Borel $\sigma$-algebra, so it gets cumbersome to always have to write all of this out, so we adopt the above shorthand fairly often as the book proceeds. 

There are some important consequences here. 

````{prf:remark} Discrete event space
Suppose that the event space $\Omega$ is discrete, and that $(\Omega, \mathcal F)$ is a measurable space. Then any $X : \Omega \rightarrow \mathbb R$ is $m\mathcal F$.
````

An important example of a random variable is the indicator random variable:

````{prf:example} Indicator function
:label: uv:mt:rvs:indicator
Suppose that $(\Omega, \mathcal F)$ is a measurable space, and $A \in \mathcal F$. Then the function:
```{math}
    \mathbb 1_{\{A\}}(\omega) &= \begin{cases}
        1, & \omega \in A \\
        0, & \omega \not\in A
    \end{cases}
```
is $m\mathcal F$.
````

We also have an equivalent characterization of the definition for a random variable, that will be helpful for proofs:

````{prf:remark} Inverse map
:label: uv:mt:rvs:rv:equiv
Suppose that $(\Omega, \mathcal F)$ and $(S, \Sigma)$ are two measurable spaces, and $X \in m(\mathcal F, \Sigma)$. An equivalent interpretation of the definition of a random variable is that $X^{-1} : \Sigma \rightarrow \mathcal F$.
````

````{prf:remark}
You'll often see us go back and forth throughout this book using the words measurable and random variable. Probability theorists tend to like the word random variable, and mathematicians tend to like the word measurable. We will tend to stick to the probability theory language later on, but it is important to cement in your head that there is nothing "random" about these things really; random variables (measurable functions), follow a prescribed set of rules, and really thinking hard about these rules will make them feel a little less mysterious.
````

## Generators

Just like we could identify $\sigma$-algebras from generating sets, we can identify random variables from generating sets:

````{prf:theorem} Random variable induced by generating sets
:label: uv:mt:rvs:generator
Suppose that $(\Omega, \mathcal F)$ and $(S, \Sigma)$ are measurable spaces. that $X : \Omega \rightarrow S$, and that $\Sigma = \sigma(\mathcal A)$ for some family of subsets $A \in \mathcal A$, where $A \subseteq S$. Then if for all $A \in \mathcal A$:
```{math}
    X^{-1}(A) = \{\omega : X(\omega) \in A\} &\in \mathcal F \\
    \Rightarrow X^{-1}(\mathcal A) \subseteq \mathcal F
```
$X \in m(\mathcal F, \Sigma)$. 
````

````{prf:proof}
Let:
```{math}
\mathcal B &\triangleq \{B \in \Sigma : X^{-1}(B) \in \mathcal F\} \\
&\equiv \mathcal B \triangleq \{B \in \Sigma : \{X \in B\} \in \mathcal F\} \\
&\equiv \mathcal B \triangleq \{B \in \Sigma : \{\omega : X(\omega) \in B\} \in \mathcal F\} \\
&\equiv X(\mathcal F)
```
be a family of sets on $S$, where $\mathcal B$ are the subsets of the codomain $S$ which land in $\mathcal F$ under the inverse image of $X$. 

By construction, notice that $X^{-1} : \mathcal B \rightarrow \mathcal F$. 

To see that $\mathcal B$ is a $\sigma$-algebra:
1\. Contains $\Omega$: Note that since $X : \Omega \rightarrow S$, that for any $\omega \in \Omega$, then $X(\omega) \in S$.

Then since $\Omega \in \mathcal F$, $S \in \mathcal B$.
2\. Closed under complements: Suppose that $B \in \mathcal B$, so $\{X \in B\}\equiv \{\omega : X(\omega) \in B\} \in \mathcal F$. 

Further, note that $\{\omega : X(\omega) \in B^c\} \equiv \{X \in B^c\} = \{X \in B\}^c \equiv \{\omega : X(\omega) \in B\}^c$, which follows because $\{\omega : X(\omega) \in B\} = \Omega \setminus \{\omega : X(\omega) \in B^c\}$. 

Then since $\mathcal F$ is a $\sigma$-algebra where $\{\omega : X(\omega) \in B\} \in \mathcal F$, then $\{\omega : X(\omega) \in B\}^c \in \mathcal F$, since $\mathcal F$ is closed under complement.

Then $B^c \in \mathcal B$.

3\. Closed under countable unions: Suppose that $B_n \in \mathcal B$, where $n \in \mathbb N$.

Then for all $n \in \mathbb N$, $\{X \in B_n\} \equiv \{\omega : X(\omega) \in B_n\} \in \mathcal F$.

Then since $\mathcal F$ is closed under countable unions, $\bigcup_{n \in \mathbb N}\{X \in B_n\} \in \mathcal F$.

Finally, note that $\bigcup_{n \in \mathbb N}\{X \in B_n\} \equiv \bigcup_{n \in \mathbb N}\{\omega : X(\omega) \in B_n\}$ is identical to the set $\left\{\omega : X(\omega) \in \bigcup_{n \in \mathbb N}B_n\right\} \equiv \left\{X \in \bigcup_{n \in \mathbb N}B_n \right\}$, so $\left\{X \in \bigcup_{n \in \mathbb N}B_n \right\} \in \mathcal F$.

Then $\bigcup_{n \in \mathbb N}B_n  \in \mathcal B$.

Then by construction, $X \in m(\mathcal F, \mathcal B)$. 

Finally, we have to show that $X$ is $m(\mathcal F, \mathcal \Sigma)$. 

Notice that $\mathcal B \supseteq \Sigma$.

By construction, $\mathcal B \supseteq \mathcal A$, where $\Sigma = \sigma(\mathcal A)$.

Then since $\mathcal B$ is a $\sigma$-algebra, by {prf:ref}`uv:mt:prob_spaces:sig:preserve`, $\mathcal B \supseteq \Sigma = \sigma(\mathcal A)$.

Then $X^{-1} : \Sigma \rightarrow \mathcal F$.

Then $X \in m(\mathcal F, \Sigma)$, by {prf:ref}`uv:mt:rvs:rv:equiv`. 
````

Why is this result so important? As it turns out, establishing measurability of a function can be rather difficult. For instance, if the codomain is the real line $\mathbb R$ and the $\sigma$-algebra is $\mathcal R$, it might be really hard to establish that {prf:ref}`uv:mt:rvs:rv` holds for every possible element of the $\mathcal R$ (this, of course, being because describing $\mathcal R$ in and of itself is tedious if not impossible). However, we can describe generators of $\mathcal R$, and then we can use {prf:ref}`uv:mt:rvs:generator` to evaluate whether a function is measurable (a random variable).
