Skip to content

Agentless Autohealing

DeWayne Filppi edited this page Aug 28, 2018 · 7 revisions

Agentless Autohealing

One of the Cloudify manager's distinctive features is the ability to perform automated day 2+ operational tasks. These typically boil down to analyzing an event stream produced by agents, and reacting by running workflows. Auto-healing is one such task. A metric is identified as a trigger, and when it fails to be delivered to the manager for a period of time, the heal is triggered. This post is about adding a capability to Cloudify that emulates the simpler liveness probes style functionality found elsewhere (e.g. Kubernetes).

Clone this wiki locally