-
Notifications
You must be signed in to change notification settings - Fork 0
Home
Amresh Verma edited this page May 28, 2026
·
52 revisions
- Prompt Engineering
- RAG (Retrieval Augmented Generation)
- Embeddings + Vector
- DB Function Calling / Tools
🧠
Writing smart input (prompt) to get correct output from LLM
❌ Bad Prompt: Tell me about milk
✅ Good Prompt: You are a shop assistant.
Extract product name and quantity: Input: "2 milk and 1 bread"
Output JSON:
👉 Output becomes structured:
{ "milk": 2, "bread": 1 }
🧠
Retrieval-Augmented Generation (RAG) is an AI architecture that enhances large language models (LLMs) by allowing them to retrieve relevant external information before generating a response.
Instead of relying only on pre-trained knowledge, RAG enables models to access up-to-date, domain-specific, and private data sources, making responses more accurate and context-aware.
- Customer support chatbots
- Healthcare report summarization
- Legal and compliance systems
- Financial analysis tools
- Enterprise knowledge search systems
Traditional LLMs like GPT-style models generate responses based only on training data. This creates limitations:
- Knowledge cutoff (no real-time updates)
- Hallucinations (false or made-up answers)
- No access to private company data
- Lack of personalization/context
- Fetching real-time relevant data
- Grounding answers in actual documents
- Reducing hallucinations
- Keeping knowledge updated without retraining