# 🧩 Tabular Difference between Z-Score and IQR Method

| **Aspect**                      | **Z-Score Method**                                    | **IQR (Interquartile Range) Method**                             |                         |                                                     |
| ------------------------------- | ----------------------------------------------------- | ---------------------------------------------------------------- | ----------------------- | --------------------------------------------------- |
| **Full Form**                   | Standard Score                                        | Interquartile Range                                              |                         |                                                     |
| **Formula**                     | $$ Z = \frac{X - \mu}{\sigma} $$                       | IQR = Q3 − Q1                                                    |                         |                                                     |
| **Outlier Criterion**           | \|Z\| > threshold (usually 3) | Values < (Q1 − 1.5×IQR) or > (Q3 + 1.5×IQR)         |
| **Basis**                       | Mean and Standard Deviation                           | Median and Quartiles (Q1, Q3)                                    |                         |                                                     |
| **Assumption**                  | Data should follow a **Normal Distribution**          | Works for **any distribution** (especially skewed ones)          |                         |                                                     |
| **Sensitivity to Outliers**     | Highly sensitive (outliers affect mean & std)         | Robust (median and quartiles are not affected by outliers)       |                         |                                                     |
| **Preferred Data Type**         | Continuous numerical data                             | Continuous or ordinal data                                       |                         |                                                     |
| **When to Use**                 | When data is approximately **bell-shaped (Gaussian)** | When data is **skewed** or **non-normal**                        |                         |                                                     |
| **Ease of Computation**         | Requires mean & std deviation                         | Requires quartiles (Q1 & Q3)                                     |                                                                            
| **Example Thresholds**          |   \| Z \| ≥ 3                                         | 1.5×IQR (or sometimes 3×IQR for stricter detection) |
| **Performance**                 | Fast and suitable for large, well-behaved datasets    | Slightly slower due to quantile calculation, but still efficient |                         |                                                     |
| **Effect of Extreme Outliers**  | Can distort the mean and σ, reducing accuracy         | Not affected since it relies on median and quartiles             |                         |                                                     |
| **Implementation**              | `stats.zscore()` or manual formula                    | `np.percentile()` or `df.quantile()`                             |                         |                                                     |
| **Visualization Compatibility** | Works well with histograms (normal curve)             | Works well with boxplots (Q1, Q3, whiskers)                      |                         |                                                     |


## ⚖️ Which One is Better?

#### ✅ If your data is normally distributed → Z-Score is better. Because it captures deviations from the mean accurately and efficiently.

#### ✅ If your data is skewed or has many outliers → IQR is better. Because it’s robust — median and quartiles are not affected by extreme values.

## 💬 In Practice:

| **Scenario**              | **Recommended Method**             |
| ------------------------- | ---------------------------------- |
| Bell-shaped (Normal) data | Z-Score                            |
| Skewed data               | IQR                                |
| Small datasets            | IQR (less sensitive to distortion) |
| Large, clean datasets     | Z-Score                            |
| Mixed distributions       | Try both and compare visually      |


## 🧠 Short Summary

| Z-Score                         | IQR                             |     |                                      |
| ------------------------------- | ------------------------------- | --- | ------------------------------------ |
| Works best for **normal data**  | Works best for **skewed data**  |     |                                      |
| Based on **mean and std**       | Based on **quartiles (Q1, Q3)** |     |                                      |
| **Sensitive** to extreme values | **Resistant** to extreme values |     |                                      |
| Formula-based threshold (\| \| >3) | Range-based threshold (1.5×IQR rule) |


## 🏁 Final Verdict

#### 👉 If you know your data is normally distributed, use Z-Score — it’s mathematically elegant and faster.
#### 👉 If you don’t know the distribution, or your data is skewed or messy, always use IQR — it’s safer and more robust.