# 1. Choosing the Right Type of Feature Scaling

## 1.1. Standardization (Z-score Normalization)

- When to Use:

    - When you want the features to have a mean of 0 and a standard deviation of 1.

    - When the model assumes normally distributed data.

    - When features have different units or scales.

- Advantages:

    - Useful for algorithms that assume data is normally distributed (e.g., linear regression, logistic regression, SVM).

    - Handles features with different units.

- Limitations:

    - Sensitive to outliers, which can skew the mean and standard deviation.

## 1.2. Min-Max Scaling

- When to Use:

    - When you want to scale features to a specific range, typically [0, 1].

    - Useful for algorithms sensitive to the magnitude of data (e.g., neural networks).

- Advantages:

    - Maintains the relationships between data points by scaling them proportionally.

    - Useful for preserving zero sparsity.

- Limitations:

    - Sensitive to outliers, which can distort the range.

## 1.3. Robust Scaling

- When to Use:

    - When the data contains significant outliers or is not normally distributed.

- Advantages:

    - Uses the median and interquartile range, making it robust to outliers.

    - Suitable for datasets with skewed distributions.

- Limitations:

    - Less effective when the dataset has a small number of observations.

# 2. Situations Where Scaling Might Not Be Necessary

- **Homogeneous Features**

    - If all features are on the same scale and have similar units, scaling might not significantly impact the model's performance.

- **Tree-Based Models**

    - Algorithms like decision trees, random forests, and gradient boosting do not require feature scaling because they are based on conditions and splits rather than distances or magnitudes.

- **Linear Models without Regularization**

    - In simple linear regression (without regularization), scaling is not strictly necessary, but it can improve interpretability and convergence speed in the optimization process.

- **When Interpretability is Key**

    - If interpretability is more important than model performance, keeping the original scale of features might be preferable for better understanding and communication of results.