
Machine Learning :
Machine Learning is a subset of AI that enables systems to **learn patterns from data** and make **predictions or decisions** without explicit programming.

- Feed historical data ➝ train an algorithm ➝ get a model ➝ make predictions or classifications on unseen data.


 ### Use-Cases:
- Prediction (e.g., sales forecast)
- Classification (e.g., spam detection)
- Clustering (e.g., customer segmentation)
- Recommendation systems
- Anomaly detection
- Optimization

---
## Categories of ML Models :

### 1. Supervised Learning :
- **Definition**: Learn from labeled data (input-output pairs)
- **Goal**: Predict the output for new/unseen inputs
- **Examples**: Classification & Regression

**Common Models**:
- Linear Regression
- Logistic Regression
- K-Nearest Neighbors (KNN)
- Decision Trees
- Random Forest
- Support Vector Machines (SVM)
- Gradient Boosting (XGBoost, LightGBM)

**Use-Cases**:
- Spam detection
- House price prediction
- Medical diagnosis
- Stock market prediction


---
### 2. **Unsupervised Learning**
- **Definition**: Learn patterns from **unlabeled data**
- **Goal**: Discover hidden patterns or groupings

**Common Models**:
- K-Means Clustering
- Hierarchical Clustering
- DBSCAN
- Principal Component Analysis (PCA)

**Use-Cases**:
- Customer segmentation
- Anomaly detection
- Dimensionality reduction
- Topic modeling

---
### 3. **Semi-Supervised Learning**
- **Definition**: Combination of labeled and unlabeled data
- **Use-Case**: When labeling data is expensive

**Common Algorithms**: Self-training, label propagation  
**Use-Cases**:
- Fraud detection
- Web content classification

---

### 4. **Reinforcement Learning**
- **Definition**: Learn by interacting with the environment using **reward and punishment**
- **Goal**: Maximize cumulative reward

**Common Algorithms**:
- Q-Learning
- Deep Q-Networks (DQN)
- Policy Gradient Methods

**Use-Cases**:
- Game AI (AlphaGo)
- Robotics  
- Autonomous vehicles
- Trading bots

---

### Types of Machine Learning Models

### 1. **Classification Models**  
**Goal**: Predict a category/label (Discrete output)  
**Examples**:
- Logistic Regression
- Decision Tree
- Random Forest
- SVM
- Naive Bayes
- KNN

**Use-Cases**:
- Email spam detection
- Image classification
- Disease diagnosis
- Customer churn prediction

---

### 2. **Regression Models**  
**Goal**: Predict a continuous numerical value  
**Examples**:
- Linear Regression
- Ridge/Lasso Regression
- Decision Tree Regressor
- Random Forest Regressor
- XGBoost Regressor

**Use-Cases**:
- House price prediction
- Sales forecasting
- Temperature prediction
- Stock price prediction

---

### 3. **Clustering Models**  
**Goal**: Group similar data points (unsupervised)  
**Examples**:
- K-Means
- DBSCAN
- Agglomerative Clustering

**Use-Cases**:
- Customer segmentation
- Market basket analysis
- Image compression

---

### 4. **Dimensionality Reduction Models**  
**Goal**: Reduce features while preserving variance  
**Examples**:
- PCA (Principal Component Analysis)
- t-SNE
- LDA (Linear Discriminant Analysis)

**Use-Cases**:
- Data visualization
- Speeding up ML models
- Noise reduction

---

### 5. **Ensemble Models**  
**Goal**: Combine multiple models to improve performance  
**Examples**:
- Random Forest (Bagging)
- XGBoost, LightGBM (Boosting)
- Voting Classifier (Stacking)

**Use-Cases**:
- Kaggle competitions
- Tabular data tasks
- Fraud detection
- Loan default prediction

---

## 1. Classical ML Models (Must-Know for Interviews & Projects)

| Model                      | Type           | Primary Use-Cases                              | Asked in Interviews? | Core Math / Concepts          |
|----------------------------|----------------|------------------------------------------------|----------------------|-------------------------------|
| **Linear Regression**      | Supervised     | Predicting continuous values                   | ✅ Always             | Line eq., gradient descent    |
| **Logistic Regression**    | Supervised     | Binary classification                          | ✅ Always             | Sigmoid, log loss             |
| **K-Nearest Neighbors**    | Supervised     | Classification, regression                     | 🔶 Frequently         | Euclidean distance            |
| **Naive Bayes**            | Supervised     | Text/NLP, spam filters                         | 🔶 Frequently         | Bayes' theorem                |
| **Decision Tree**          | Supervised     | Classification + Regression                    | ✅ Always             | Entropy, Gini index           |
| **Random Forest**          | Ensemble       | Better Decision Tree via Bagging               | ✅ Always             | Bootstrapping, bagging        |
| **Support Vector Machine** | Supervised     | High-dimensional classification                | 🔶 Often              | Vectors, hyperplanes, margin  |
| **K-Means Clustering**     | Unsupervised   | Grouping unlabeled data                        | 🔶 Often              | Centroids, distance metric    |
| **PCA**                    | Dim. Reduction | Feature reduction before ML                    | 🔶 Often              | Eigenvalues/vectors, SVD      |

---

### 2. Advanced ML Models (For Production & Competitive Projects)

| Model / Concept              | Type             | Primary Use-Cases                           | Asked in Interviews? | Core Math / Concepts             |
|------------------------------|------------------|---------------------------------------------|----------------------|----------------------------------|
| **XGBoost / LightGBM / CatBoost** | Ensemble Boosting | Tabular data, Kaggle, production ML         | ✅ Yes, often         | Gradient boosting concepts       |
| **Bagging / Boosting**       | Ensemble         | Combine weak learners for strong model      | ✅ Yes                | Bias-variance tradeoff           |
| **Stacking / Voting**        | Ensemble         | Combine predictions from multiple models    | 🔸 Sometimes          | Conceptual understanding         |
| **Regularization (L1, L2)**  | Optimization     | Prevent overfitting                         | ✅ Always             | Cost function with penalty terms |
| **GridSearchCV / RandomSearchCV** | Tuning       | Hyperparameter optimization                 | ✅ Yes                | None (Tool-based)                |
| **DBSCAN / Agglomerative**   | Clustering       | Noise-resistant clustering                   | 🔸 Sometimes          | Density, distance-based clusters |
| **Isolation Forest**         | Anomaly Detection| Fraud detection, outlier detection          | 🔸 Sometimes          | Random partitioning              |
| **Polynomial Regression**    | Supervised       | Modeling nonlinear relationships            | 🔸 Sometimes          | Curve fitting, degree control    |
| **Lasso / Ridge / ElasticNet** | Regularized Regression | Shrinking coefficients             | ✅ Yes                | L1/L2 penalties                  |
| **Feature Selection Models** | Preprocessing    | Reducing unimportant features               | 🔸 Sometimes          | Variance threshold, Info Gain    |

##  3. Other Useful Models and Concepts (Nice-to-Know)

| Model / Concept            | Category          | Use-Cases                                | Why Important?         | Core Idea                        |
|----------------------------|-------------------|------------------------------------------|------------------------|----------------------------------|
| **t-SNE / UMAP**           | Visualization     | Plotting high-dimensional data           | Great for presentations| Non-linear dimensionality reduction |
| **AutoML (e.g., TPOT)**    | Automation        | Auto model building/tuning               | Used in industry       | Evolutionary search              |
| **One-vs-Rest / One-vs-One** | Classification Strategy | Multi-class classification          | Interview design question | Divide multiclass into binary    |
| **K-Fold Cross Validation**| Evaluation        | Robust model validation                  | ✅ Always                | Split data for training/testing  |
| **ROC/AUC, Precision-Recall** | Evaluation    | Model performance evaluation             | ✅ Always                | Trade-offs, thresholds           |
| **Learning Curves**        | Diagnostic Tool   | Detecting under/overfitting              | Interview discussions   | Visual tool                      |
| **Bias-Variance Tradeoff** | Theory            | Understanding model generalization       | Core concept            | Low bias + low variance ideal    |

---

## What to Use When?

| Task Type                  | Suggested Model(s)                             |
|---------------------------|-------------------------------------------------|
| Predict continuous value   | Linear Regression, XGBoost                     |
| Classify binary classes    | Logistic Regression, SVM, Random Forest        |
| Group customers by behavior| K-Means, DBSCAN                               |
| High dimensional data      | PCA, t-SNE                                    |
| Real-time fraud detection  | Isolation Forest, XGBoost                     |
| Tabular data + competition | XGBoost, LightGBM                             |
| NLP tasks                  | Naive Bayes, Logistic Regression              |
| Hyperparameter tuning      | GridSearchCV, RandomSearchCV                  |
| Anomaly or rare case detection | Isolation Forest, One-Class SVM         |

---