### Core Concepts
| **Concept**         | **Definition**                                                                                                          |
| ------------------- | ----------------------------------------------------------------------------------------------------------------------- |
| **Random Sampling** | A sampling method where **each member** of the population has an **equal chance** of being selected                     |
| **Sample Bias**     | A **systematic error** that occurs when some members of the population are **over- or under-represented** in the sample |


### Comparative Table
| **Aspect**             | **Random Sampling**                                                 | **Sample Bias**                                                  |
| ---------------------- | ------------------------------------------------------------------- | ---------------------------------------------------------------- |
| **Goal**               | Ensure fairness and representation                                  | Often unintentional; leads to distorted representation           |
| **Effect on Accuracy** | Promotes **statistical validity**                                   | **Compromises reliability** and generalizability of results      |
| **Cause**              | Implemented intentionally using algorithms/random functions         | Caused by flawed design, convenience sampling, or human judgment |
| **Example**            | Randomly selecting 100 customers from a database of 10,000          | Only surveying urban users in a nationwide customer survey       |
| **Detection**          | Distribution matches the population                                 | Skewed metrics, abnormal variance, or disproportionate groups    |
| **Resolution**         | Use `random.sample()` or `np.random.choice()` with appropriate seed | Re-design sampling strategy, stratified sampling, or weighting   |


### Python Example

import pandas as pd

df = pd.read_csv("shoe_sales.csv")
sample = df.sample(n=10, random_state=42)  # Ensures reproducibility


### Interview Questions & Model Answers

| **Question**                                    | **Model Answer**                                                                    |
| ----------------------------------------------- | ----------------------------------------------------------------------------------- |
| What is random sampling?                        | It's a method that ensures every unit has an **equal chance** of being chosen.      |
| Why is sample bias problematic in analysis?     | It leads to **non-generalizable conclusions** and **misrepresents the population**. |
| How can you detect sample bias?                 | Compare sample distributions to the population or use exploratory data analysis.    |
| What methods can prevent sample bias?           | Use **random**, **stratified**, or **systematic** sampling techniques.              |
| Is random sampling always enough to avoid bias? | Not always. **Execution** matters; poor implementation can still introduce bias.    |
