-
Notifications
You must be signed in to change notification settings - Fork 1
LLM stacks \ 4.5 External data (deep dive)
25.0717 (0628) Lab notes (Gdrive), Git
25.0628
External data is shown on the right in the diagram below.
See docx #411.

Notes
Screenshot of docx #411 (shown below):
- Green = working demo (in (#411](https://drive.google.com/drive/folders/1-Adawag9uA8_bq-hDF-nOuPYaRLz1eEO))
- blue = todo
- grey = maybe not do
BYTEBYTEGO

RAG (Retrieval Augmented Generation) is a method that combines information retrieval with large language models to generate answers. Here’s how RAG works on a high level:
1 The model retrieves relevant data from data sources and then extracts it to a vector database from the pre-indexed model.
2 Augment the prompts by retrieving information and merging it with the query prompt.
3 A Large Language Model (like GPT, Claude, or Gemini) understands the combined query and generates the final response.
A traditional RAG has a simple retrieval, limited adaptability, and relies on static knowledge, making it less flexible for dynamic and real-time information.
Agentic RAG improves on this by introducing AI agents that can make decisions, select tools, and even refine queries for more accurate and flexible responses. Here’s how Agentic RAG works on a high level:
1 The user query is directed to an AI Agent for processing.
2 The agent uses short-term and long-term memory to track query context. It also formulates a retrieval strategy and selects appropriate tools for the job.
3 The data fetching process can use tools such as vector search, multiple agents, and MCP servers to gather relevant data from the knowledge base.
4 The agent then combines retrieved data with a query and system prompt. It passes this data to the LLM.
5 LLM processes the optimized input to answer the user’s query.
Over to you: What else will you add to better understand RAG vs Agentic RAG?
IBM RAG videos Keen
- https://www.youtube.com/watch?v=iVUMuC7OzUI&t=23s (what is ai search)
- https://www.youtube.com/watch?v=HdafI0t3sEY
- https://www.youtube.com/watch?v=zYGDpG-pTho&t=217s

DEMO. ANALYZE. DOCUMENT. REPEAT.
26.0502 About the author, ZiptieAI.com, Substack, Lab notes (Gdrive), Git, Youtube, Deployments
AI concepts 26.0409
Phase 4 Agentic AI
- 4-1 Concepts 26.0410
- 4-2 Demos 26.0425
- 4-3 Industry examples
Phase 3 Robotic intelligence (2026)
Phase 2 LLM stacks (2024-2025)
Phase 1 AI drones (2023-2024)
About the author, Lab notes (Gdrive), Git, Substack, ZiptieAI.com, videos, GI