LLM stacks \ 4.5 External data (deep dive)

25.0717 (0628) Lab notes (Gdrive), Git

4.5-1 RAG
4.5-2 Tools
4.5-3 MCP

25.0628

External data is shown on the right in the diagram below.

See docx #411.

Notes

Screenshot of docx #411 (shown below):

Green = working demo (in (#411](https://drive.google.com/drive/folders/1-Adawag9uA8_bq-hDF-nOuPYaRLz1eEO))
blue = todo
grey = maybe not do

BYTEBYTEGO

RAG (Retrieval Augmented Generation) is a method that combines information retrieval with large language models to generate answers. Here’s how RAG works on a high level:

1 The model retrieves relevant data from data sources and then extracts it to a vector database from the pre-indexed model.

2 Augment the prompts by retrieving information and merging it with the query prompt.

3 A Large Language Model (like GPT, Claude, or Gemini) understands the combined query and generates the final response.

A traditional RAG has a simple retrieval, limited adaptability, and relies on static knowledge, making it less flexible for dynamic and real-time information.

Agentic RAG improves on this by introducing AI agents that can make decisions, select tools, and even refine queries for more accurate and flexible responses. Here’s how Agentic RAG works on a high level:

1 The user query is directed to an AI Agent for processing.

2 The agent uses short-term and long-term memory to track query context. It also formulates a retrieval strategy and selects appropriate tools for the job.

3 The data fetching process can use tools such as vector search, multiple agents, and MCP servers to gather relevant data from the knowledge base.

4 The agent then combines retrieved data with a query and system prompt. It passes this data to the LLM.

5 LLM processes the optimized input to answer the user’s query.

Over to you: What else will you add to better understand RAG vs Agentic RAG?

IBM RAG videos Keen

DEMO. ANALYZE. DOCUMENT. REPEAT.
26.0502 About the author, ZiptieAI.com, Substack, Lab notes (Gdrive), Git, Youtube, Deployments

Wiki home (ZiptieAI.com)

AI concepts 26.0409

Phase 4 Agentic AI

Phase 3 Robotic intelligence (2026)

Phase 2 LLM stacks (2024-2025)

Phase 1 AI drones (2023-2024)

About the author, Lab notes (Gdrive), Git, Substack, ZiptieAI.com, videos, GI

LLM stacks \ 4.5 External data (deep dive)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!