```{contents}
```
## Semantic Search

**Semantic search** retrieves information based on **meaning and intent**, not just keyword overlap.
It is powered by **embeddings** and **vector similarity**, enabling modern AI systems to find relevant content even when the exact words do not match.

---

### **1. Core Intuition**

Traditional search:

> Match words.

Semantic search:

> Match **meanings**.

If two queries express the same idea with different wording, semantic search still retrieves the correct information.

---

### **2. How Semantic Search Works**

#### **High-Level Workflow**

```
Documents → Embedding Model → Vector Database
Query → Embedding Model → Vector
Vector Similarity Search → Top-k Documents → Results
```

---

### **3. Embedding & Retrieval Pipeline**

#### **Indexing Phase**

1. Collect documents
2. Clean and chunk text
3. Generate embeddings
4. Store vectors + metadata in vector DB

#### **Query Phase**

1. Convert query to embedding
2. Perform similarity search
3. Rank results
4. Return best matches

---

### **4. Similarity Metrics**

| Metric             | Use Case         |
| ------------------ | ---------------- |
| Cosine similarity  | Most common      |
| Dot product        | Fast GPU scoring |
| Euclidean distance | Geometry-based   |

---

### **5. Why Semantic Search Is Superior**

| Capability             | Keyword Search | Semantic Search |
| ---------------------- | -------------- | --------------- |
| Understands synonyms   | ❌              | ✅               |
| Understands intent     | ❌              | ✅               |
| Handles paraphrases    | ❌              | ✅               |
| Works across languages | ❌              | ✅               |

---

### **6. Applications**

| Domain            | Use Case               |
| ----------------- | ---------------------- |
| Enterprise Search | Knowledge assistants   |
| Customer Support  | FAQ & ticket retrieval |
| E-commerce        | Product search         |
| Legal             | Case law discovery     |
| Healthcare        | Clinical research      |
| Education         | Learning resources     |

---

### **7. Semantic Search in RAG Systems**

Semantic search retrieves **contextual knowledge** that is fed into an LLM for grounded generation.

```
User Query → Semantic Search → Relevant Docs → LLM → Answer
```

---

### **8. Hybrid Search**

Modern systems combine:

* Keyword (BM25)
* Semantic (embeddings)

This yields high recall and high precision.

---

### **9. System Design Considerations**

| Factor          | Impact            |
| --------------- | ----------------- |
| Chunk size      | Retrieval quality |
| Embedding model | Semantic accuracy |
| Index structure | Latency           |
| Re-ranking      | Final quality     |

---

### **10. Summary**

| Concept              | Description             |
| -------------------- | ----------------------- |
| Semantic search      | Meaning-based retrieval |
| Core engine          | Embeddings + similarity |
| Primary benefit      | Intent understanding    |
| Production relevance | Critical                |
