# Lecture Summary: Combining Random Variables

This lecture introduces the concept of combining random variables, focusing on adding and subtracting them to analyze their total effect on measurements or processes. It illustrates this through an example involving the time taken by different company departments to approve time cards and for payroll to process them, considering these times as random variables \(X\) and \(Y\).

## Key Points
- **Independence and Units**: It's crucial for the random variables to be independent and have the same units.
- **Mean, Variance, and Standard Deviation**: The lecture explains how to compute these for the sum or difference of two random variables.
  - The **mean** of the sum or difference is the sum or difference of their means.
  - The **variance** for the sum or difference is the sum of their variances, showing that data spread is not canceled out in the combination.
- **Practical Application**: Demonstrates calculating new insights from combined random variables without recalculating from the base data.

This approach simplifies analyzing combined random variables, offering a method to quickly derive new insights using existing summary statistics.


In [1]:
# Python Code Cell Example
import numpy as np

# Example data for random variables X (time for department approval) and Y (time for payroll processing)
X = np.array([1, 2, 2, 3]) # Hours
Y = np.array([2, 3, 5, 6]) # Hours

# Calculate means, variances, and standard deviations
mean_X, mean_Y = np.mean(X), np.mean(Y)
variance_X, variance_Y = np.var(X, ddof=0), np.var(Y, ddof=0)
std_dev_X, std_dev_Y = np.sqrt(variance_X), np.sqrt(variance_Y)

# Calculate combined statistics for the sum
combined_mean = mean_X + mean_Y
combined_variance = variance_X + variance_Y
combined_std_dev = np.sqrt(combined_variance)

print(f"Combined Mean: {combined_mean} hours")
print(f"Combined Variance: {combined_variance}")
print(f"Combined Standard Deviation: {combined_std_dev} hours")


Combined Mean: 6.0 hours
Combined Variance: 3.0
Combined Standard Deviation: 1.7320508075688772 hours
