# **Measure of Central Tendency**

Central Tendency refers to the statistical measures that **identify a single value** as representative of an entire data distribution.  
The three main measures are: **Mean, Median, and Mode.**

---

## **1. Mean (Average)**

### **Formula:**
For a **population**,  

* Mean (mu) = Sum of all values divided by the total number of values in the population.

For a **sample**, 

* Mean (x-bar) = Sum of all sample values divided by the total number of sample values.

### **Example:**

* Ages = {1, 3, 4, 5}  
* Mean = (1 + 3 + 4 + 5) ÷ 4 = 13 ÷ 4 = 3.25

---

### **Effect of Outlier:**
Ages = {1, 3, 4, 5, 100}  
**Mean (x-bar)** = (1 + 3 + 4 + 5 + 100) ÷ 5 = 113 ÷ 5 = 22.6

> The mean increases drastically due to the **outlier (100)**, making it **less reliable** for skewed data.

---

## **2. Median**

The **median** is the **middle value** of the dataset when arranged in order.

### **Steps to Find Median:**
1. Arrange the data in ascending order.  
2. If **n is odd**, the median is the middle number.  
3. If **n is even**, the median is the average of the two middle numbers.

### **Example (Odd Case):**
Ages = {4, 3, 1, 5, 100}  
→ Sorted = {1, 3, 4, 5, 100}  
→ **Median = 4**

### **Example (Even Case):**
Ages = {1, 3, 4, 5, 100, 200}  
→ Sorted = {1, 3, 4, 5, 100, 200}  
→ **Median = (4 + 5) / 2 = 4.5**

---

## **3. Mode**

The **mode** is the **most frequently occurring value** in a dataset.

### **Example:**
Ages = {1, 3, 3, 4, 5, 5, 5, 6}  
→ **Mode = 5**

---

### **Summary**
| Measure | Formula | Meaning | Sensitive to Outliers? |
|----------|----------|----------|------------------------|
| **Mean** | Sum of all values divided by the total number of values in the population. | Average value | ✅ Yes |
| **Median** | Middle value of sorted data | Middle point | ❌ No |
| **Mode** | Most frequent value | Common value | ❌ No |


In [3]:
# Example: Mean, Median, and Mode Calculation in Python

import numpy as np
from scipy import stats

# Data
ages = [1, 3, 4, 5, 100]

# Mean
mean = np.mean(ages)

# Median
median = np.median(ages)

# Mode
mode = stats.mode(ages, keepdims=True)[0][0]

print(f"Data: {ages}")
print(f"Mean: {mean:.2f}")
print(f"Median: {median}")
print(f"Mode: {mode}")


Data: [1, 3, 4, 5, 100]
Mean: 22.60
Median: 4.0
Mode: 1
