
---

# ‚≠ê **Module 1 ‚Äî LLM Foundations**

---

# **1.1 LLM Basics**

## **üîπ What Are Tokens?**

**Token = a small unit of text (word or sub-word) that the model reads.**

Examples:

* ‚ÄúApple‚Äù ‚Üí 1 token
* ‚ÄúIntelligence‚Äù ‚Üí 1 token
* ‚ÄúI love India‚Äù ‚Üí 4 tokens
* ‚Äú‡§®‡§Æ‡§∏‡•ç‡§§‡•á‚Äù ‚Üí usually 1 token

Why tokens matter:

* Token count affects **cost**
* Token count affects **speed**
* Token limit defines how much context the model can "see"

---

## **üîπ What Is a Context Window?**

**Context window = the maximum number of tokens the model can process at one time.**

Think of it like the model‚Äôs ‚Äúworking memory.‚Äù

Examples:

* GPT-4o: ~200K tokens
* Llama 3: 8K‚Äì128K tokens
* Gemini 1.5: 1M tokens (largest currently)

Why context windows matter:

* Determines how long a prompt can be
* Determines how many pages of a PDF the model can read
* Limits how much an agent can remember in one task

---

## **üîπ What Are Embeddings?**

**Embedding = a numerical vector representation of text, images, or audio.**

More simply:

> **Embedding = how AI ‚Äúunderstands‚Äù meaning.**

Example:

```
Embedding("dog")  ‚Üí similar numbers ‚Üí Embedding("pet")
Embedding("investment")  ‚Üí similar numbers ‚Üí Embedding("finance")
```

Embeddings turn text into **numbers** so that computers can compute:

* Similarity
* Context
* Meaning

### Why embeddings are important in agent systems:

Agents need embeddings for tasks like:

* RAG (Retrieval Augmented Generation)
* Semantic search
* Long-term memory
* Categorization
* Document similarity
* Multimodal memory (images + text)

### Key idea:

**LLMs generate text, but embeddings allow LLMs to *understand relationships between texts*.**

---

## **üîπ Model Selection**

### **What is a model?**

**A model = a neural network trained to understand and generate language.**

We choose models based on:

* Speed
* Cost
* Reasoning power
* Context window
* Multimodal abilities (images, audio, video)

### Model types:

#### GPT-4o

* Strong reasoning
* Strong tool calling
* Good for agents

#### Claude 3.5

* Excellent reasoning & analysis
* Very safe
* Good for long documents

#### Gemini 1.5

* Best multimodal model
* Best for document-heavy agents
* Massive context window

#### Llama 3

* Open source
* Good for self-hosting
* Doesn‚Äôt require cloud

---

# **1.2 Prompt Engineering**

## **üîπ What Is a Prompt?**

**Prompt = the input instructions given to an LLM.**

Types:

* System prompt
* User prompt
* Tool instructions
* Output instructions

For agents, we write prompts that instruct the model to:

* Choose tools
* Output JSON
* Follow rules
* Maintain structure

---

## **üîπ System Prompts for Agents**

Definition:

**System prompt = the ‚Äúrole + rules‚Äù that control agent behavior.**

Example:

```
You are an AI agent.
Use tools when necessary.
Always respond in valid JSON.
Never hallucinate tool names.
```

---

## **üîπ Tool Calling-Friendly Prompting**

Definition:

**Tool calling = the model selecting a tool and returning arguments for that tool in structured JSON.**

A good tool-calling prompt:

* Defines **all tools**
* Specifies **when** each tool should be used
* Requires **valid JSON**
* Prevents hallucination

---

## **üîπ JSON Output**

Definition:
**JSON = a structured data format that machines can easily parse.**

Agents need JSON for:

* Routing tools
* Returning structured data
* Guaranteeing predictable behavior

Example:

```
{
  "action": "use_tool",
  "tool": "web_search",
  "parameters": {
    "query": "weather in Mumbai"
  }
}
```

---

# **1.3 Reasoning Strategies**

## **üîπ What Is Reasoning?**

**Reasoning = how the model thinks through a problem before producing the answer.**

Agents rely heavily on reasoning for:

* Planning tasks
* Fixing errors
* Choosing the correct tool
* Breaking down complex tasks

---

## **üîπ Chain-of-Thought (CoT)**

**CoT = step-by-step reasoning inside the model.**

Example internal reasoning:

```
Step 1: Understand question
Step 2: Identify needed tool
Step 3: Call tool or answer
```

We keep this hidden from the user but used internally to produce more accurate results.

---

## **üîπ Tree-of-Thought (ToT)**

**ToT = model explores multiple reasoning paths and selects the best one.**

Useful for:

* Planning agents
* Coding agents
* Research agents

---

## **üîπ Self-Reflection Loops**

**Self-reflection = the model evaluates its own previous steps and improves.**

Example workflow:

```
1. Take action
2. Check result
3. Reflect on mistakes
4. Try again with corrections
```

Used in:

* Code fixer agents
* Research loops
* Multi-step planning systems

---

# ‚≠ê **Module 1 Summary (With Definitions)**

Students now understand:

‚úî What tokens are
‚úî What context windows are
‚úî What embeddings are
‚úî How to choose the right model
‚úî How prompts guide agent behavior
‚úî Why JSON and tool schemas matter
‚úî How models think using CoT, ToT & reflection
