Senior Cloud Engineer · AWS Community Builder – Containers · NVIDIA-Certified AI Infrastructure
I architect cloud platforms that run at scale — currently at Expedia Group where I own Kubernetes infrastructure supporting 430+ clusters, 9,000+ microservices, and AI/ML workloads in production on AWS.
current_role: Solutions Architect @ Expedia Group
focus_areas:
- Amazon EKS at scale (Karpenter, ArgoCD, GitOps)
- AI/ML infrastructure (GPU scheduling, inference platforms)
- Multi-account AWS platform engineering
- Cloud migration & disaster recovery (RTO ≤ 5 min)
certifications:
- AWS Community Builder – Containers (2026)
- NVIDIA-Certified: AI Infrastructure & Operations
- AWS Certified Solutions Architect – Associate
- HashiCorp Certified: Terraform Associate| Repo | Description | Stack |
|---|---|---|
| eks-platform-terraform | Production-ready EKS platform modules with Karpenter, IRSA, multi-account support | Terraform · AWS |
| k8s-karpenter-argocd | GitOps platform using ArgoCD App-of-Apps + Karpenter NodePools | Kubernetes · Helm |
| gpu-ml-kubernetes | GPU-accelerated AI/ML inference on EKS — NVIDIA device plugin, scheduling patterns | YAML · Python |
| aws-migration-toolkit | Scripts & runbooks for large-scale AWS migrations (MGN, DMS, Elastic DR) | Python · Bash · Terraform |
430+ Kubernetes clusters managed across multiple AWS accounts
9,000+ Microservices running on platform I built and operate
150+ On-prem servers migrated to AWS (RTO ≤ 5 min, RPO ≤ 1 min)
25% Reduction in deployment failures after GitOps transformation
20% Cloud cost reduction through Karpenter + capacity optimization
9+ Years building on AWS
Cloud & Infra
AWS EKS EC2 S3 RDS Lambda VPC IAM CloudFormation
Containers & Kubernetes
Kubernetes Docker Helm Karpenter ArgoCD ECS/Fargate
Infrastructure as Code
Terraform CloudFormation GitOps
Observability
Datadog Splunk CloudWatch DCGM Exporter
AI / ML Infra
GPU Scheduling NVIDIA Device Plugin Inference Platforms Distributed Compute
Languages
Python Go Bash YAML
I write about Kubernetes, AWS, and production platform engineering on Hashnode.
- Running Karpenter at Scale: Lessons from 430+ Clusters
- How We Migrated 150 Servers to AWS with RTO ≤ 5 Minutes
- GPU Scheduling on Kubernetes for AI/ML Inference Workloads
AWS Community Builder – Containers · Bay Area, CA