## Zero-Shot and Few-Shot Learning

**Zero-shot learning** and **few-shot learning** are techniques that allow models—especially large language models (LLMs)—to perform new tasks with **no task-specific training data** or with **very few examples**.

They are foundational to the flexibility of modern generative AI systems.

---

### **1. Core Intuition**

Traditional ML:

> Train a model for each task with labeled data.

Modern LLMs:

> Use a single pretrained model and describe the task in the prompt.

The model **infers the task** from the prompt.

---

### **2. Zero-Shot Learning**

#### **Definition**

The model performs a task **without seeing any examples**, relying only on instructions.

#### **Example**

```text
Classify the sentiment of the following sentence as Positive or Negative:
"I love this product."
```

#### **How It Works**

The pretrained model has learned general language and task patterns during pretraining and can map new instructions to learned behavior.

#### **When to Use**

* Simple, well-defined tasks
* When no labeled examples are available
* Rapid prototyping

---

### **3. Few-Shot Learning**

#### **Definition**

The model is shown **a small number of examples** in the prompt to guide behavior.

#### **Example**

```text
Translate English to French:

English: Hello
French: Bonjour

English: Good morning
French: Bonjour

English: Thank you
French:
```

#### **Why It Helps**

Examples clarify:

* Task format
* Output style
* Edge cases

#### **When to Use**

* Complex or ambiguous tasks
* Domain-specific outputs
* Structured generation

---

### **4. Comparison**

| Aspect              | Zero-Shot    | Few-Shot      |
| ------------------- | ------------ | ------------- |
| Data required       | None         | 2–10 examples |
| Prompt length       | Short        | Longer        |
| Reliability         | Moderate     | High          |
| Cost                | Low          | Higher        |
| Use-case complexity | Simple tasks | Complex tasks |

---

### **5. Relationship to In-Context Learning**

Both are forms of **in-context learning**:
the model adapts behavior **within the prompt itself**, without changing model parameters.

---

### **6. Why This Matters**

These techniques enable:

* Task generalization
* Rapid development
* Minimal labeled data dependency
* One-model-for-many-tasks systems

---

### **7. Limitations**

* Prompt length constraints
* Sensitivity to example quality
* Not a replacement for fine-tuning in high-stakes systems

---

### **8. Summary**

| Concept        | Description                         |
| -------------- | ----------------------------------- |
| Zero-shot      | Learn from instructions             |
| Few-shot       | Learn from examples                 |
| Key capability | In-context adaptation               |
| Impact         | Eliminates task-specific retraining |
