<h1 style="font-size: 1.6rem; font-weight: bold">Module 3 - Topic 3: Random Variables</h1>
<p style="margin-top: 5px; margin-bottom: 5px;">Monash University Australia</p>
<p style="margin-top: 5px; margin-bottom: 5px;">ITO 4001: Foundations of Computing</p>
<p style="margin-top: 5px; margin-bottom: 5px;">Jupyter Notebook by: Tristan Sim Yook Min</p>

---

### **Definition and Classification**

A random variable may be defined as a real-valued function over a random experiment's sample space $S$. 
- The function's domain is $S$
- The real numbers associated with the various possible outcomes of the random experiment constitute the range of the function

Random variables are classified into two main categories:

1. **Discrete Random Variable**:
   - Range consists of a finite number or countable infinitude of values
   
2. **Continuous Random Variable**:
   - Range consists of an uncountable infinitude of values

#### **Random Sampling**
In random sampling, each member of a population has an equal (the same) chance of being selected as a sample.

<br> 

#### **Examples of Discrete Random Variables**

#### Example 1: Item Selection from a Manufactured Lot
- Let $X$ represent the status of an item drawn randomly
- $X=0$ represents drawing a non-defective item
- $X=1$ represents drawing a defective item
- Range of $X$ is $\{0,1\}$
- Therefore, $X$ is a discrete random variable

#### Example 2: First Failure of a Switch
- Let $X$ denote the successive number of the throw on which the first failure of a switch occurs
- Range of $X$ is $\{1,2,3,...,n\}$
- This range consists of a countable infinitude of values
- Therefore, $X$ is a discrete random variable

<br> 

#### **Example of a Continuous Random Variable: Time to Failure**
- Let $X$ denote the time to failure of a bus section in an electrostatic precipitator
- Range of $X$ consists of all real numbers greater than zero: $(0,\infty)$
- This is an uncountable infinitude of values
- Therefore, $X$ is a continuous random variable



---

### **Expectation of a Random Variable**

The expected value or expectation of a random variable is the **average value** of the random variable. The expected value of a random variable $X$ is denoted by $E(X)$.

The expected value of a random variable can be interpreted as the long-run average of observations on the random variable. The procedure for calculating the expected value of a random variable depends on whether the random variable is discrete or continuous.

#### **Discrete Random Variables**

If $X$ is a **discrete random variable** with **probability distribution function (pdf)** specified by $f(x)$, then the expectation of a discrete random variable $X$ is:

$$E[X] = \sum_x x f(x)$$

or for a function $g(X)$ of the random variable:

$$E[g(X)] = \sum_x g(x) f(x)$$

#### **Continuous Random Variables**

If $X$ is a **continuous random variable** with pdf specified by $f(x)$, then the expectation of the continuous random variable $X$ is:

$$E[X] = \int_{-\infty}^{\infty} x f(x) dx$$

or for a function $g(X)$ of the random variable:

$$E[g(X)] = \int_{-\infty}^{\infty} g(x) f(x) dx$$

#### **Important Properties**

Similar to the expectation of a random variable, the expectation of a function of a random variable $g(X)$ is a weighted average of the function over all possible values $X$ can take on, with each value being weighted according to the probability of observing it.

In general, an expected value of a function of random variables is **not** the function evaluated at the expected values of the random variables:

$$E[f(X)] \neq f(E[X])$$

This is a common misconception and is only true for linear functions.


---

### **Linearity of Expectation**

#### **Expected Value of a Constant#

The expected value of a constant is the constant itself: 

$$E(c) = c$$

where $c$ is any constant. This can be verified by noting that:

$$E(c) = \int_{-\infty}^{\infty} c f(x) dx = c \int_{-\infty}^{\infty} f(x) dx$$

and by definition:

$$\int_{-\infty}^{\infty} f(x) dx = 1$$

#### **Expected Value of a Constant Times a Random Variable**

Therefore, the expected value of a constant times a random variable is the constant times the expected value of the random variable:

$$E[cX] = c E[X]$$

where $c$ is any constant with respect to $X$.

This can also be verified by:

$$E[cX] = \int_{-\infty}^{\infty} cx f(x) dx = c \int_{-\infty}^{\infty} x f(x) dx = c E[X]$$

#### **Additivity of Expectation**

The expected value of two terms is the sum of the expected value of each:

$$E[X+Y] = E[X] + E[Y]$$

Similarly, for functions of random variables:

$$E[f(x) + g(x)] = E[f(x)] + E[g(x)]$$

#### **Mean and Variance**

The expected value of a random variable $X$ is also called the **mean of** $X$ and is often designated by $\mu$. 

The expected value of $(X - \mu)^2$ is called the **variance** of $X$. The variance is denoted as:

$$\text{Var}(X) = E[(X - \mu)^2]$$

The positive square root of the variance is called the **standard deviation**. The terms $\sigma^2$ and $\sigma$ (sigma squared and sigma) represent the variance and standard deviation, respectively.

**Variance** is a measure of the spread or dispersion of the values of the random variable about its mean value. The **standard deviation** is $\sqrt{\text{Var}[X]}$, which is also a measure of spread or dispersion. 

The standard deviation is expressed in the same units as $X$, whereas the variance is expressed in the square of these units.


---

### **Variances**

The variance of $X$ can be calculated directly from the following definition:

$$\sigma^2 = E[(X - \mu)^2] = V[X]$$

However, it can be calculated more easily by this equivalent formula:

$$V[X] = E[(X - E[X])^2] = E[X^2] - E[X]^2$$

which extends to functions

$$V[f(X)] = E[(f(X) - E[f(X)])^2]$$

The alternative formula also gives the variance:

$$V[X] = E[X^2] - E[X]^2$$

If the variance is small, then the realizations of $X$ will be tightly clustered around $E[X]$, and if the variance is large, there is more variability in values taken on by $X$. By the linearity of expectation, the variance has the following property:

$$V[cX] = c^2 V[X]$$

so that the variance of $X$ times a constant $c$ is equal to the variance of $X$ times the square of $c$.

--- 

### **Covariances**

If we have two random variables $X$ and $Y$ we can define the covariance of two random variables $X$ and $Y$, $\text{Cov}(X,Y)$ as:

$$\text{cov}(X,Y) = E[(X - \mu_X)(Y - \mu_Y)]$$

where $\mu_X$ and $\mu_Y$ are the means of $X$ and $Y$, respectively. The above can be written as:

$$\text{cov}(X,Y) = E[(X - E[X])(Y - E[Y])]$$
$$= E[XY] - E[X]E[Y]$$

From its definition, we see that covariance satisfies the following properties:

$$\text{Cov}(X,Y) = \text{Cov}(Y,X)$$

and

$$\text{Cov}(X,X) = \text{Var}(X)$$

Another property of covariance, which immediately follows from its definition, is that for any constant $a$:

$$\text{Cov}(aX,Y) = a\text{Cov}(X,Y)$$

If $X$ and $Y$ are independent random variables, then

$$\text{Cov}(X,Y) = 0$$

And so, for independent $X_1, \ldots, X_n$:

$$\text{Var}\left(\sum_{i=1}^n X_i\right) = \sum_{i=1}^n \text{Var}(X_i)$$

---

### **Worksheet Examples**

**Q1) Random Variable Expectation: Let $X$ be a fair coin and $Y$ be a fair 4-sided dice. What is the expectation of $E[X + Y]$?**

- a) 3
- b) 0.5
- c) 0.125
- d) 2.5

<br>

**Answer: d** <br>
*Explanation: This problem involves applying the linearity of expectation.*

$$\text{For a fair coin } X \text{, the possible values are usually 0 (tails) and 1 (heads)}$$

$$\text{For a fair 4-sided dice } Y \text{, the possible values are 1, 2, 3, and 4}$$

Step 1 : Find the expectation of X:

$$E[X] = 0 \times P(X=0) + 1 \times P(X=1)$$

$$E[X] = 0 \times \frac{1}{2} + 1 \times \frac{1}{2} = \frac{1}{2} = 0.5$$

Step 2: Find the expectation of Y:

$$E[Y] = 1 \times P(Y=1) + 2 \times P(Y=2) + 3 \times P(Y=3) + 4 \times P(Y=4)$$

$$E[Y] = 1 \times \frac{1}{4} + 2 \times \frac{1}{4} + 3 \times \frac{1}{4} + 4 \times \frac{1}{4}$$

$$E[Y] = \frac{1 + 2 + 3 + 4}{4} = \frac{10}{4} = 2.5$$

$$\text{Using the linearity of expectation:}$$

$$E[X + Y] = E[X] + E[Y]$$

$$E[X + Y] = 0.5 + 2.5 = 3$$

Therefore, the expectation of $X + Y$ is 3.
