## Random Variables

According to the definition given in calculus textbooks, the quantity $y$ is called a function of the real number $x$, if to every $x$ there corresponds a value $y$. This definition can be extended to cases where the independent variable is not a real number. Thus, we call the distance a function of a pair of points; the perimeter of a triangle is a function defined on the set of triangles; a sequence $(a_n)$ is a function for all positive integers; the binomial coefficient ${x \choose k}$ is a function defined for pairs of numbers $(x,k)$ of which the second is a non-negative integer. In the same sense, we can say the the number $S_n$ of successes in $n$ Bernoulli trails is a function defined on the same space; to each of the $2^n$ points in this space, there corresponds a number $S_n$.

---
**Definition (Random Variable).** A function defined on a sample space is called a random variable.

---

Typical random variables are the number of aces in a hand at bridge, the number of successes in $n$ Bernoulli trials, the waiting time for the $r$th success etc. In each case, there is unique rule which associates a number $X$ with any sample point $\omega$. The classical theory of probability was devoted mainly to a study of gambler's gain, which is again a random variable; in fact every random variable can be interpreted as the gain of a real or imaginary gambler in a suitable game. The position of a particle under diffusion, the energy, temperature of physical systems are random variables, but they are defined in non-discrete sample spaces, and their study is therefore deferred. In the case of a discrete sample space, we can actually tabulate any random variable $X$ by enumerating in some order all points of the space and associating with each the corresponding value of $X$.

Let $X$ be a random variable and let $x_1,x_2,\ldots$ be the values which it assumes; in most of what follows the $x_j$ will be integers. The aggregate of all sample points on which $X$ assumes the fixed value $x_j$ forms the event $X = x_j$; its probability is denoted by $P\{X = x_j\}$.

The function

\begin{align*}
P(X=x_j) = f(x_j) \quad (j=1,2,\ldots) \tag{1}
\end{align*}

is called the probability mass function (PMF) of the random varibale $X$. Clearly, 

\begin{align*}
f(x_j) \geq 0, \quad \sum f(x_j) = 1 \tag{2}
\end{align*}

With this terminology we can say that in Bernoulli trials, the number of successes $S_n$ is a random variable with the  probability mass function:

\begin{align*}
P(X=k) =  {n \choose k}p^k q^{n-k}
\end{align*}

whereas the number of trials up to and including the first success is a random variable with the PMF:

\begin{align*}
P(X=k) = q^{k-1}p
\end{align*}

In [13]:
# Binomial PMF

using Plots
using Distributions
plotlyjs()

N = 20

function binomial_pmf(k,n,p)
    return binomial(n,k)*(p^k)*(1-p)^(n-k)
end

plot([k for k in 0:N],[binomial_pmf(k,N,0.5) for k in 0:N],
        line=:stem, marker=:circle, c=:blue,
        xlabel="x",
        ylabel="Probability",
        label="Binomial PMF")

In [14]:
# Geometric PMF

using Plots
using Distributions
plotlyjs()

N = 20

function geometric_pmf(k,p)
    return (1-p)^(k-1)*p
end

plot([k for k in 1:N],[geometric_pmf(k,0.5) for k in 1:N],
        line=:stem, marker=:circle, c=:blue,
        xlabel="x",
        ylabel="Probability",
        label="Geometric PMF")

Consider now two random variables $X$ and $Y$ defined on the same spample space, and denote the values which they assume respectively by $x_1,x_2,\ldots$ and $y_1,y_2,\ldots$; let the corresponding probability mass functions be $\{f(x_j)\}$ and $\{g(y_k)\}$. The aggregate of the sample points points in which the two conditions $X=x_j$ and $Y=y_k$ are satisfied forms an event whose probability will be denote by $\{P(X=x_j, Y=y_k\}$. The function

\begin{align*}
P\{X=x_j,Y=y_k\} = f(x_j,y_k)
\end{align*}

is called the joint PMF of $X$ and $Y$. It is best exhibited in the form a double entry table. Clearly,