# Interview Preparation

## Key Interview Questions & Answers

### Architecture Questions

**Q: "Walk me through your RAG architecture end-to-end."**

A: "Our system has three main stages:
1. **Ingestion**: Documents are chunked using semantic chunking, then embedded using [model]. Chunks are stored in a vector database with BM25 index.
2. **Retrieval**: On query, we do hybrid search combining BM25 (keyword) and semantic search, fuse results with RRF, then rerank with a cross-encoder.
3. **Generation**: Retrieved chunks are formatted into context, passed to the LLM with the query, and the response is generated with citations."

**Q: "What makes your system 'advanced' compared to basic RAG?"**

A: "Three key improvements:
1. **Hybrid search** catches both exact matches (BM25) and semantic similarity
2. **Reranking** improves precision by considering query-chunk interaction
3. **Agentic capability** lets the LLM decide when to use RAG vs. direct answering"

### Technical Deep-dives

**Q: "Why did you choose your chunking strategy?"**

A: "We use [semantic/recursive] chunking because:
- Preserves semantic coherence within chunks
- Handles variable document structures
- Chunk size of ~500 tokens balances context and precision"

**Q: "How do you handle the 'lost in the middle' problem?"**

A: "LLMs often miss information in the middle of long contexts. We:
- Limit context to most relevant chunks
- Order chunks strategically (important first/last)
- Use reranking to ensure top chunks are most relevant"

**Q: "What is an AI Agent in your system?"**

A: "An agent is LLM + Tools + Decision capability. Our agent has access to:
- RAG search tool for document queries
- [Other tools if applicable]
The agent decides when to call each tool based on the query."

### Evaluation Questions

**Q: "How do you evaluate your RAG system?"**

A: "We evaluate both retrieval and generation:
- **Retrieval**: Recall@k, Precision@k, MRR
- **Generation**: LLM-as-a-judge for faithfulness, relevance, helpfulness
- **Debugging**: If answers are bad, we first check retrieval, then generation"

## Resume Bullet Templates

```text
• Designed and implemented Advanced RAG system with hybrid search 
  (BM25 + semantic), achieving [X]% improvement in retrieval accuracy

• Built agentic AI system using [OpenAI/Anthropic] tool use, enabling 
  dynamic tool selection based on query analysis

• Implemented evaluation pipeline using LLM-as-a-judge framework, 
  measuring faithfulness, relevance, and helpfulness metrics

• Optimized chunking strategy using [semantic/recursive] approach, 
  improving context relevance by [X]%

• Integrated cross-encoder reranking to improve precision of retrieved 
  documents from [X]% to [Y]%
```

## Concepts to Know Cold

| Concept | One-liner |
|---------|-----------|
| **RAG** | Retrieve relevant context, then generate with it |
| **Chunking** | Break documents into searchable pieces |
| **Embeddings** | Dense vector representations of text |
| **BM25** | Keyword-based sparse retrieval |
| **Semantic Search** | Meaning-based dense retrieval |
| **RRF** | Combines multiple ranked lists fairly |
| **Reranking** | Second-pass scoring for precision |
| **Agent** | LLM + Tools + Decision capability |
| **ReAct** | Thought → Action → Observation loop |
| **MCP** | Standard protocol for tool exposure |
| **SFT** | Supervised learning on examples |
| **DPO** | Learning from preference pairs |
| **LoRA** | Parameter-efficient fine-tuning |
| **LLM-as-Judge** | Using LLM to evaluate outputs |



## The Project Positioning

> **"Advanced RAG Agent Chat System"** — A production-ready AI project combining:
> - Hybrid search (BM25 + semantic)
> - Reranking for precision
> - Agentic tool use
> - Evaluation capabilities

The Full Pipeline

```text
┌──────────────────────────────────────────────────────────────────────┐
│                    COMPLETE RAG AGENT SYSTEM                         │
├──────────────────────────────────────────────────────────────────────┤
│                                                                      │
│  ┌──────────┐                                                        │
│  │   User   │                                                        │
│  │  Query   │                                                        │
│  └────┬─────┘                                                        │
│       │                                                              │
│       ▼                                                              │
│  ┌────────────────────────────────────────────────────────────┐      │
│  │                         AGENT                              │      │
│  │  ┌─────────────┐   ┌─────────────┐   ┌─────────────┐       │      │
│  │  │   Reason    │──▶│   Decide    │──▶│    Act      │       │      │
│  │  │  (Thought)  │   │(Tool Select)│   │ (Execute)   │       │      │
│  │  └─────────────┘   └─────────────┘   └──────┬──────┘       │      │
│  └─────────────────────────────────────────────┼──────────────┘      │
│                                                │                     │
│       ┌────────────────────────────────────────┼──────────────┐      │
│       │                                        ▼              │      │
│       │  ┌────────────────────────────────────────────────┐   │      │
│       │  │              ADVANCED RAG PIPELINE             │   │      │
│       │  │                                                │   │      │
│       │  │  Query ──▶ Hybrid Search ──▶ RRF ──▶ Rerank    │   │      │
│       │  │              │                        │        │   │      │
│       │  │         ┌────┴────┐                   │        │   │      │
│       │  │         │         │                   │        │   │      │
│       │  │       BM25    Semantic                │        │   │      │
│       │  │         │         │                   │        │   │      │
│       │  │         └────┬────┘                   │        │   │      │
│       │  │              ▼                        ▼        │   │      │
│       │  │        Vector DB              Cross-Encoder    │   │      │
│       │  └────────────────────────────────────────────────┘   │      │
│       │                          TOOLS                        │      │
│       └───────────────────────────────────────────────────────┘      │
│                                                                      │
│       ┌───────────────────────────────────────────────────────┐      │
│       │                     GENERATION                        │      │
│       │  Context + Query ──▶ LLM ──▶ Response with Citations  │      │
│       └───────────────────────────────────────────────────────┘      │
│                                                                      │
│       ┌───────────────────────────────────────────────────────┐      │
│       │                     EVALUATION                        │      │
│       │  • Retrieval: Recall@k, Precision@k, MRR              │      │
│       │  • Generation: Faithfulness, Relevance, Helpfulness   │      │
│       │  • Method: LLM-as-a-Judge                             │      │
│       └───────────────────────────────────────────────────────┘      │
│                                                                      │
└──────────────────────────────────────────────────────────────────────┘
```

