You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Feb 1, 2021. It is now read-only.
Default settings for TTLs and Node removal on storage backends are not optimal and two annoying things could happen:
It takes time using Consul and Etcd to detect a node failure, if the value of the TTL is high, swarm will see those nodes as alive until those entries are expired.
This other scenario can happen:
Swarm agent registers onto discovery service
Manager sees the new entry: it is added to the list of known agents
Swarm agent fails
Manager removes the agent after the TTL expires
Swarm agent is back to life at the same time or directly after
It may take the Manager heartbeat time to register that node again although it's alive and working
One solution could be to reduce the default heartbeat and refresh time to a low value but at the risk of an increased pressure on the metadata storage cluster, especially if we use it for both discovery and metadata.
The text was updated successfully, but these errors were encountered:
Default settings for TTLs and Node removal on storage backends are not optimal and two annoying things could happen:
heartbeat time
to register that node again although it's alive and workingOne solution could be to reduce the default
heartbeat
andrefresh
time to a low value but at the risk of an increased pressure on the metadata storage cluster, especially if we use it for both discovery and metadata.The text was updated successfully, but these errors were encountered: