# Introduction to Unsupervised Machine Learning

#### Supervised Machine Learning

In **supervised learning**, we have a dataset with:

* **Input features:** things we know, like age, experience, or salary
* **Output feature:** the thing we want to predict

The goal is to **predict the output** using the inputs.
There are two main types of supervised learning:

1. **Regression:** Predict numbers (e.g., salary)
2. **Classification:** Predict categories (e.g., yes/no, red/blue)

Examples of supervised learning algorithms are **Linear Regression, Logistic Regression, Decision Trees, Random Forest, and XGBoost**.

---

# Unsupervised Machine Learning

In **unsupervised learning**, there is **no output to predict**. Instead, we try to **find patterns or group similar data together**.

This is called **clustering**.

**Example:**
Imagine a dataset with **age, years of experience, and salary**.

* We are **not trying to predict anything**.
* Instead, we can **group people with similar age, experience, and salary**.
* These groups are called **clusters**.

---

#### Real-World Example: Customer Segmentation

Clustering is useful for **grouping customers**.

* Suppose you know customers’ **salary** and **spending score**.
* You can make clusters like:

  * Customers who buy regularly → offer a **15% discount**
  * Customers who buy occasionally → offer a **20% discount**

This helps businesses **target the right customers** without any pre-labeled data.

---

#### Common Unsupervised Learning Algorithms

Some popular clustering algorithms are:

* **K-Means Clustering**
* **Hierarchical Clustering**
* **DBSCAN Clustering**

We can also use **silhouette scoring** to check how good the clusters are.

---

### Key Points

* **Supervised learning:** Predicts a known output.
* **Unsupervised learning:** Finds patterns without an output.
* **Clustering:** Groups similar data points.
* **Algorithms to learn:** K-Means, Hierarchical, DBSCAN, and Silhouette scoring.

## **Unsupervised Learning**

* **What it is:** You have data, but **no answers or labels**. The goal is to **find patterns, groups, or important features** in the data.
* **Example:** You own a shop. You don’t tell the computer which customers are “VIP” or “regular,” but it can **group customers with similar buying habits**.

---

### **1. Clustering**

**Goal:** Put similar items together. Think of it as **grouping friends who like the same things**.

* **K-Means:**

  * Pick a number of groups (K).
  * Computer assigns each item to the nearest group.
  * Groups adjust until things are stable.
  * **Example:** 3 groups of fruits: sweet, sour, medium.

* **Hierarchical Clustering:**

  * Start with each item alone.
  * Merge the closest ones step by step.
  * Makes a **tree (dendrogram)** showing relationships.
  * **Example:** Apple and Orange merge first, then Lemon joins later.

* **DBSCAN:**

  * Looks for areas where items are **dense** (many close together).
  * Items far from any cluster are **outliers**.
  * **Example:** Most people live in 2 cities → 2 clusters. Someone alone in a village → outlier.

---

### **2. Dimensionality Reduction**

**Goal:** Simplify data while keeping the important parts. Think of it as **summarizing a big book into a short story**.

* **PCA (Principal Component Analysis):**

  * Combines features into a smaller number of “super-features.”
  * Keeps most of the variation in data.
  * **Example:** You have weight, height, age of people → PCA might combine into “body size factor.”

* **Kernel PCA:**

  * Like PCA but can handle **curved/complex relationships**.

* **t-SNE:**

  * Great for **visualizing** high-dimensional data in 2D or 3D.
  * Keeps similar items close together visually.
  * **Example:** Plot thousands of handwritten digits in 2D so clusters of 0s, 1s, 2s appear clearly.

---

✅ **Summary in one line:**

* **Clustering →** group similar things
* **Dimensionality reduction →** simplify data while keeping patterns
* **DBSCAN →** find dense groups and outliers
* **PCA/t-SNE →** make data smaller or visualizable