---

# **Module 2: Supervised Learning**

Supervised learning algorithms are grouped into two main categories:

* **Classification** – Predicting **discrete labels or categories**
* **Regression** – Predicting **continuous numerical values**

---

# **Classification in Machine Learning**

Classification is a supervised learning technique used to **predict labels or categories** based on input data.
The model assigns each data point to a predefined class (e.g., spam vs. non-spam, diseased vs. healthy).

For example, a model trained on labeled dog vs. cat images can classify **new** images as either dogs or cats using features like **color**, **texture**, or **shape**.

---

## **Getting Started with Classification**
---

# **Types of Classification**

Classification tasks differ based on the number of categories and how labels are assigned.

---

## **1. Binary Classification**

The simplest type—data is classified into **two categories**.
Example: Email → Spam or Not Spam

The model uses features like keywords or sender information to decide between the two options.

---

## **2. Multiclass Classification**

The model classifies data into **more than two categories**.
Example: Image classifier → Cat, Dog, Bird

Here, each input belongs to exactly one class.

---

## **3. Multi-Label Classification**

In multi-label classification, each input can belong to **multiple categories** at once.
Example: A movie tagged as **Action** *and* **Comedy**.

This is different from multiclass classification, where each sample belongs to only one class.

*(Less essential for beginners but important in certain domains.)*

---

# **How Classification Works**

Classification is performed through these steps:

### **1. Data Collection**

Gather labeled examples where each input has a corresponding class.

### **2. Feature Extraction**

Identify important attributes such as color, texture, or shape that distinguish between classes.

### **3. Model Training**

The algorithm learns how features map to correct labels by identifying patterns in the data.

### **4. Model Evaluation**

Test the trained model on unseen data to check accuracy and generalization.

### **5. Prediction**

The model predicts the class of new data based on learned patterns.

### **6. Iterative Improvement**

If performance is poor, tune hyperparameters or try better algorithms.

---

# **Classification Algorithms**

Understanding logistic regression is essential because it is one of the most fundamental classification algorithms.
However, machine learning offers many linear and non-linear classifiers:

---

## **1. Linear Classifiers**

Linear classifiers create a **straight-line decision boundary**.

Examples:

* Logistic Regression
* Linear Support Vector Machine (kernel = 'linear')
* Single-Layer Perceptron
* Stochastic Gradient Descent (SGD) Classifier

---

## **2. Non-Linear Classifiers**

Non-linear classifiers can capture **complex relationships** between input features and target classes.

Examples:

* K-Nearest Neighbors (KNN)
* Kernel SVM
* Naive Bayes
* Decision Tree Classification
* Ensemble Classifiers:

  * Random Forest
  * AdaBoost
  * Bagging Classifier
  * Voting Classifier
  * Extra Trees Classifier
* Multi-layer Neural Networks

---

# **Real-Life Examples of Machine Learning Classification**

Classification models are widely used across many industries:

### **1. Email Spam Filtering**

Identifies emails as **spam** or **not spam** based on text patterns.

### **2. Credit Risk Assessment**

Predicts if a loan applicant is **likely to default** using credit history, income, etc.

### **3. Medical Diagnosis**

Classifies medical data to identify diseases like cancer or diabetes.

### **4. Image Classification**

Used in:

* Facial recognition
* Autonomous driving
* Medical imaging

### **5. Sentiment Analysis**

Determines whether text expresses **positive, negative, or neutral** sentiments.

### **6. Fraud Detection**

Detects fraudulent activities by analyzing patterns in transactions.

### **7. Recommendation Systems**

Recommends items based on user behavior (e.g., Netflix movie suggestions, Amazon product recommendations).

---

