# Concept of a Random Variable in Probability

## **1. Definition of a Random Variable**
A **random variable** (denoted by $ X $) is a function that assigns a numerical value to each possible outcome of a random experiment.

Mathematically, a random variable is a function:

$$
X: \Omega \to \mathbb{R}
$$

where:
- $ \Omega $ is the sample space (all possible outcomes),
- $ \mathbb{R} $ is the set of real numbers,
- $ X(\omega) $ maps each outcome $ \omega $ to a number.

---

## **2. Example: Coin Toss**
Consider a fair coin toss where:
- Sample space: $ \Omega = \{\text{Heads}, \text{Tails}\} $.

We define a random variable $ X $ as:

$$
X(\text{Heads}) = 1, \quad X(\text{Tails}) = 0
$$

Here, $ X $ converts qualitative outcomes into **numerical values**, making them useful for mathematical analysis.

---

## **3. Types of Random Variables**
### **(A) Discrete Random Variable**
A **discrete random variable** takes a **countable** number of distinct values.  

#### **Example 1: Rolling a Die**
- $ X $ = Outcome of rolling a fair 6-sided die.
- Possible values: $ X \in \{1, 2, 3, 4, 5, 6\} $.

#### **Example 2: Number of Heads in 3 Coin Tosses**
- Possible values: $ X \in \{0, 1, 2, 3\} $.

### **(B) Continuous Random Variable**
A **continuous random variable** can take an **uncountable** number of values, typically from an interval in $ \mathbb{R} $.  

#### **Example 1: Height of People**
- Let $ X $ be the height of a randomly chosen person (e.g., 170.2 cm, 173.8 cm).
- $ X $ can take any real value in an interval (e.g., $ 150 \leq X \leq 200 $).

#### **Example 2: Time to Complete a Task**
- Let $ X $ be the time (in seconds) taken to complete a task.
- $ X $ can take any value in $ (0, \infty) $.

---

## **What is a Probability Distribution?**

A probability distribution is a mathematical function that describes the probabilities of different outcomes in a random experiment. It essentially provides a complete picture of the possible values a random variable can take on, along with their associated probabilities.   

Key Points

- Random Variable: A random variable is a variable whose value is a numerical outcome of a random phenomenon. It can be discrete (e.g., the number of heads in 3 coin flips) or continuous (e.g., a person's height).   
- Outcomes and Probabilities: A probability distribution lists all possible outcomes of a random variable and assigns a probability to each outcome.   
- Types of Distributions: There are many different types of probability distributions, each with its own unique characteristics and applications. Some common examples include the normal distribution, binomial distribution, and Poisson distribution. 

There are two main types of probability distributions:

- Discrete Probability Distribution → Defined for discrete random variables.
- Continuous Probability Distribution → Defined for continuous random variables.

## **4. Probability Distribution of a Random Variable**
A **probability distribution** describes how likely each value of a random variable is.

### **(A) Probability Mass Function (PMF) for Discrete RVs**
For a discrete random variable $ X $, the **probability mass function (PMF)** gives the probability of each possible value:
Let X be a discrete random variable that can take on values x1, x2, ..., xn. The Probability Mass Function (PMF) of X, denoted as f(x), is defined as:
$$
P(X = x) = p(x)
$$
where P(X = x) is the probability that the random variable X takes on the value x.

Properties
- Non-Negativity: p(x) ≥ 0 for all x
- Normalization: ∑p(x) = 1, where the sum is taken over all possible values of x
- Discrete: p(x) is only defined for discrete values of x

#### **Example: Rolling a Fair Die**
$$
P(X = x) =
\begin{cases}
\frac{1}{6}, & x \in \{1, 2, 3, 4, 5, 6\} \\
0, & \text{otherwise}
\end{cases}
$$

---

### **(B) Probability Density Function (PDF) for Continuous RVs**
For a continuous random variable X, the Probability Density Function (PDF) f(x) describes how probabilities are distributed over real values. However, the probability of any single value is 0 because there are infinitely many possible values. Instead, probabilities are found over intervals. For a continuous random variable $ X $, probabilities are given by a **probability density function (PDF)**, $ f(x) $, such that:

$$
P(a \leq X \leq b) = \int_{a}^{b} f(x) \,dx
$$

Key Property of PDF: $ \int_{-\infty}^{\infty} f(x) \,dx = 1 $.

#### **Example: Normal Distribution**
The height of people often follows a **Normal distribution**:

$$
X \sim \mathcal{N}(\mu, \sigma^2) = \frac{1}{\sqrt{2 \pi \sigma^2}} e^{-\frac{(x - \mu)^2}{2 \sigma^2}}
$$

where:
- $ \mu $ is the mean height,
- $ \sigma^2 $ is the variance.

### **(C) Cumulative Distribution Function (CDF) for Continuous RVs**
For a continuous random variable $ X $, the **cumulative distribution function (CDF)** gives the probability that the random variable is less than or equal to a given value:
$$
F(x) = P(X \leq x) = \int_{-\infty}^x f(t) \,dt
$$

The PDF f(x) can be obtained from the CDF:
$$
f(x) = \frac{dF(x)}{dx}
$$

Properties of CDF:

- Non-Negativity: F(x) ≥ 0 for all x
- F(x) is monotonically nondecreasing i.e. if x < y then F(x) ≤ F(y)
- F(x) → 0 as x → -∞ and F(x) → 1 as x → ∞
- If X is discrete and takes integer values, the PMF and the CDF can be obtained from each other by summing or differencing: 
$$
F_X(k) = \sum_{x=-\infty}^k p(x)
$$
$$
p_X(k) = F_X(k) - F_X(k-1) = P(X <= k) - P(X <= k-1)
$$

---

## **5. Expected Value (Mean) and Variance**
### **(A) Expected Value (Mean)**
The **expected value** (or mean) of a random variable $ X $ represents the average value it takes.

For a **discrete** random variable:

$$
E[X] = \sum_{i} x_i P(X = x_i)
$$

For a **continuous** random variable:

$$
E[X] = \int_{-\infty}^{\infty} x f(x) dx
$$

### **(B) Variance**
The **variance** measures the spread of values around the mean. Variance, denoted as Var(X) or σ² (sigma squared), is a measure of how spread out or dispersed a set of data points are around their mean (average). In the context of a random variable, it quantifies the average squared deviation of the random variable from its expected value.  A higher variance indicates that the data points are more spread out, while a lower variance suggests they are clustered closer to the mean.

$$
Var(X) = E[(X - E[X])^2]
$$

(X - E[X])²: Squaring the deviation serves two purposes:
- It makes the deviation always positive. We're interested in the magnitude of the deviation, not its direction.
- It gives more weight to larger deviations. A larger deviation contributes more to the variance than a smaller deviation.

E[(X - E[X])²]: This is the expected value of the squared deviation. It's essentially the average of all the squared deviations, weighted by their probabilities (in the discrete case) or according to the probability density function (in the continuous case).

There's another formula for variance that is often more convenient for calculations:

Var(X) = E[X²] - (E[X])²

---

## **6. Summary Table: When to Use Each**
| **Feature**           | **Discrete Random Variable** | **Continuous Random Variable** |
|----------------------|---------------------------|---------------------------|
| **Definition**       | Takes countable values    | Takes uncountable values  |
| **Example**         | Number of heads in coin flips | Time taken to finish a race |
| **Probability**      | Probability Mass Function (PMF) | Probability Density Function (PDF) |
| **Probability Calculation** | $ P(X = x) $ | $ P(a \leq X \leq b) = \int_a^b f(x)dx $ |
| **Expectation (Mean)** | $ E[X] = \sum x P(X = x) $ | $ E[X] = \int x f(x) dx $ |