
---

## 🧩 a. **Semantic Segmentation**

> Goal: Classify **each pixel** in an image into a **category** (e.g., road, car, person). But **no distinction** between object instances.

### 📘 1. **FCN (Fully Convolutional Networks)**

* 🔄 Converts fully connected layers of a CNN into **convolutional layers**.
* 🧱 Uses **upsampling** to get pixel-level predictions.
* ✅ First deep learning model for semantic segmentation.

> **Example**: All pixels of a cat are labeled “cat” — no matter if there are 1 or 5 cats.

---

### 🧬 2. **U-Net**

* ➕ Shaped like a **“U”**: encoding path + decoding path.
* 🔄 **Skip connections** help preserve spatial info.
* 🧪 Originally for **medical images**, now used widely.

> **Example**: Used to segment tumors in MRI scans.

---

### 📊 3. **SegNet**

* 🧱 Encoder-Decoder architecture like U-Net.
* ⚙️ Uses **pooling indices** from encoder to guide decoder.
* ⚡ Faster and more memory-efficient.

> **Example**: Lane detection in self-driving cars.

---

### 🌐 4. **DeepLab (v1 → v3+)**

* 🎯 Uses **dilated/atrous convolutions** (larger field of view).
* 💬 Later versions (v3, v3+) added **ASPP** and **encoder-decoder** designs.
* 🔥 Very accurate, used in many benchmarks.

> **Example**: Segmenting roads, trees, and people in high-res images.

---

## 🧩 b. **Instance Segmentation**

> Goal: Classify each pixel **and** separate **each object** instance.

### 🎭 1. **Mask R-CNN**

* 📦 Extends **Faster R-CNN** with a branch for predicting masks.
* 🎯 Gives **bounding box + class + mask** per object.
* ✅ Most widely used instance segmentation model.

> **Example**: Separates and labels each person in a crowd.

---

### 🎯 2. **SOLO / SOLOv2 (Segmenting Objects by Locations)**

* 🔲 Divides image into grid cells, each responsible for one object.
* ⚡ No bounding boxes; **directly predicts masks**.
* 🚀 Faster and simpler than Mask R-CNN.

> **Example**: Fast instance segmentation in real-time videos.

---

## 🧩 c. **Panoptic Segmentation**

> Goal: Combine **semantic + instance segmentation**
> (e.g., label each **thing** separately and **stuff** like sky, road jointly)

### 🌍 1. **UPSNet (Unified Panoptic Segmentation Network)**

* 🎯 Combines semantic and instance segmentation heads.
* 🧠 Includes a **“panoptic head”** for merging predictions.
* ⚖️ End-to-end training for the whole task.

> **Example**: Sky = one class; each person = separate mask.

---

### 📦 2. **DetectoRS**

* 📏 Combines **Recursive Feature Pyramid** with **Switchable Atrous Convolution**.
* 🔍 Very accurate panoptic model.
* ✅ Advanced backbone + multi-task prediction.

> **Example**: State-of-the-art COCO panoptic leaderboard model.

---

## 🧠 Summary Table

| Task                  | Model                       | Key Feature                        | Example Use Case                   |
| --------------------- | --------------------------- | ---------------------------------- | ---------------------------------- |
| Semantic Segmentation | FCN, U-Net, SegNet, DeepLab | Pixel-wise classification          | Medical imaging, satellite imagery |
| Instance Segmentation | Mask R-CNN, SOLOv2          | Pixel-wise + per-object separation | Self-driving cars, surveillance    |
| Panoptic Segmentation | UPSNet, DetectoRS           | Combines semantic + instance       | Robotics, cityscapes segmentation  |

---

