A collection of concise, practical notes on things I learn daily across DevOps, cloud infrastructure, system design, and software engineering. Each entry is a deep-dive reference designed for quick review and interview preparation.
System Design (25 entries)
Core concepts for system design interviews β distributed systems, databases, networking, and architecture patterns.
Topic
Key Concepts
CAP Theorem
CP vs AP, PACELC, real-world databases
Consistent Hashing
Virtual nodes, replication, implementation
Rate Limiting
Token bucket, sliding window, distributed with Redis
Database Sharding
Strategies, shard key selection, resharding
Database Indexing
B-Tree, composite, covering, partial indexes
Caching Strategies
Cache-aside, stampede prevention, multi-level
Load Balancing
L4 vs L7, algorithms, GSLB, health checks
Event-Driven Architecture
Outbox pattern, DLQ, choreography vs orchestration
Microservices Communication
REST vs gRPC, circuit breaker, bulkhead
Distributed Consensus
Raft, Paxos, quorum, practical usage
Saga Pattern
Orchestration vs choreography, compensation
Message Queues vs Streams
SQS vs Kafka, when to use which
API Design
REST, pagination, versioning, error handling
Back-of-Envelope Estimation
Essential numbers, estimation framework
URL Shortener Design
Capacity planning, architecture, caching
Circuit Breaker
States, implementation, when to use
CQRS
Command/query separation
Event Sourcing
Event log, state reconstruction
Bloom Filters
Probabilistic data structure
Distributed Locking
Redis, ZooKeeper, fencing tokens
Leader Election
Bully, Raft-based, ZooKeeper
CDN Architecture
Edge caching, cache invalidation
SQL vs NoSQL
Decision guide
Back Pressure
Flow control in distributed systems
Twelve-Factor App
Cloud-native application methodology
Production Kubernetes operations, networking, and troubleshooting.
Topic
Key Concepts
Networking & Services
CNI, NetworkPolicy, Ingress, DNS
Autoscaling (HPA/VPA/KEDA)
All 4 autoscalers, Karpenter
Resource Requests & Limits
QoS classes, right-sizing, LimitRange
Troubleshooting Guide
Decision trees, debugging commands
Helm Charts
Package management for K8s
RBAC
Roles, ClusterRoles, ServiceAccounts
Pod Disruption Budgets
Safe eviction policies
Resource Management
Quotas, limits, scheduling
Topic
Key Concepts
ECS vs EKS vs Lambda
Decision framework, cost comparison
DynamoDB Design Patterns
Single-table, GSI, write sharding, streams
IAM & Least Privilege
Policy evaluation, ABAC, IRSA
VPC & Networking
VPC design, Transit Gateway
Lambda Cold Starts
Optimization strategies
S3 Storage Classes
Lifecycle policies, cost optimization
Aurora vs RDS
When to choose which
EventBridge
Event routing, rules, targets
Step Functions
Workflow orchestration
Secrets Manager
Secret rotation
CloudFront
CDN, edge functions
Organizations
Multi-account strategy
DevOps & SRE (12 entries)
Topic
Key Concepts
SRE Golden Signals
USE/RED, SLI/SLO/SLA, alerting
Deployment Strategies
Canary, blue-green, Argo Rollouts
Observability
Metrics, logs, traces, OpenTelemetry
Incident Management
Response framework, post-mortem
GitOps
ArgoCD, Flux, principles
CI/CD Best Practices
IaC comparison
Chaos Engineering
Principles and tools
Feature Flags
Strategies and implementation
Container Security
Best practices
Zero Trust Security
Architecture
SLI/SLO/SLA
Error budgets
Infrastructure Testing
Testing pyramid
Total: 105+ entries across 12 categories. Updated regularly.
AI & Machine Learning (15 entries)
Deep dives into modern AI β LLMs, agents, RAG, and production AI systems.
Topic
Key Concepts
Transformer Architecture
Self-attention, multi-head, encoder vs decoder
Prompt Engineering
Few-shot, CoT, ReAct, temperature tuning
RAG
Chunking, embeddings, vector search, hybrid retrieval
AI Agents & Tool Use
ReAct loop, function calling, multi-agent
LangChain & LangGraph
LCEL, stateful workflows, human-in-the-loop
Vector Databases
HNSW, IVF, Pinecone vs Qdrant, similarity search
Fine-Tuning vs RAG
Decision framework, LoRA, QLoRA
MCP Protocol
Tools, resources, prompts, server implementation
LLM Evaluation
RAGAS, LLM-as-judge, CI testing
AI Safety & Guardrails
Prompt injection, output filtering, NeMo
Multi-Agent Systems
CrewAI, orchestration patterns, collaboration
Tokenization & Context
BPE, context limits, cost management
LLMOps
Latency, caching, model routing, monitoring
Structured Output
JSON mode, Pydantic, Instructor, parallel calls
Open Source LLMs
vLLM, Ollama, quantization, K8s deployment