Director of Engineering (ex-Principal β Director; aiming VP). I build reliable, AI-augmented cloud platforms: SRE at scale, Kubernetes on OCI, observability, and data platforms.
- π― Focus: Reliability | K8s/OCI | AI for Ops | Data/Observability
- π Current: Berkeley ML/AI Professional Certificate
- π§ Goals: VP of Engineering; leading data & reliability orgs
- βοΈ Writing: case studies & playbooks for incident response, CMDB taxonomy, and self-healing agents
- SaaS Continuity Engine β anomaly detection + runbook automation (LLM-assisted)
- K8s Observability Starter β Prometheus/Grafana/Logs, production defaults
- CMDB CI Classification Framework β pragmatic taxonomy + governance
- VM Self-Recovery Agent (C++) β watchdogs, backoff, health checks
- Org design, roadmap & guardrails β’ SLOs & error budgets β’ Trunk-based dev β’ Change mgmt
- I like measured impact: MTTR β, on-call load β, adoption β
π« Reach me: akshoy.upadhyay@yahoo.com β’ [LinkedIn](#)