-
Notifications
You must be signed in to change notification settings - Fork 0
Home
- Prompt Engineering
- RAG (Retrieval Augmented Generation)
- Embeddings + Vector
- DB Function Calling / Tools
🧠
Writing smart input (prompt) to get correct output from LLM
❌ Bad Prompt: Tell me about milk
✅ Good Prompt: You are a shop assistant.
Extract product name and quantity: Input: "2 milk and 1 bread"
Output JSON:
👉 Output becomes structured:
{ "milk": 2, "bread": 1 }
🧠
Retrieval-Augmented Generation (RAG) is an AI architecture that enhances large language models (LLMs) by allowing them to retrieve relevant external information before generating a response.
Instead of relying only on pre-trained knowledge, RAG enables models to access up-to-date, domain-specific, and private data sources, making responses more accurate and context-aware.
- Customer support chatbots
- Healthcare report summarization
- Legal and compliance systems
- Financial analysis tools
- Enterprise knowledge search systems
Traditional LLMs like GPT-style models generate responses based only on training data. This creates limitations:
- Knowledge cutoff (no real-time updates)
- Hallucinations (false or made-up answers)
- No access to private company data
- Lack of personalization/context
- Fetching real-time relevant data
- Grounding answers in actual documents
- Reducing hallucinations
- Keeping knowledge updated without retraining
Imagine two students preparing for an exam:
- Reads books once
- Answers from memory only
- Cannot verify facts
- Reads books
- Can open books during exam
- Verifies answers in real-time
Student 2 performs better because they can retrieve information when needed.
-
Reduces Hallucination
LLMs generate more factual and grounded responses. -
Keeps Knowledge Updated
Works with real-time and dynamic data sources.
-
Cost Efficient
Avoids expensive retraining or fine-tuning of models.
-
Data Privacy
Sensitive enterprise data stays within controlled systems.
-
Context Awareness
Personalized responses using user-specific data.
Example:
Airline chatbot knows your booking details (PNR, flight time, delay status)