-
Notifications
You must be signed in to change notification settings - Fork 74k
/
coordination_config.proto
46 lines (36 loc) · 1.93 KB
/
coordination_config.proto
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
syntax = "proto3";
package tensorflow;
option go_package = "github.com/tensorflow/tensorflow/tensorflow/go/core/protobuf/for_core_protos_go_proto";
// Coordination service configuration parameters.
// The system picks appropriate values for fields that are not set.
message CoordinationServiceConfig {
// Type of coordination service implementation to enable.
// For example, setting the service type as "standalone" starts a service
// instance on the leader task to provide the coordination services such as
// heartbeats and consistent key-value store.
string service_type = 1;
// Address where the coordination service instance is hosted.
string service_leader = 2;
// Whether to enable the health check mechanism.
bool enable_health_check = 3;
// Maximum wait time for all members in the cluster to be registered.
int64 cluster_register_timeout_in_ms = 4;
// Heartbeat timeout, if a task does not record heartbeat in this time
// window, it will be considered disconnected.
// Note: This is also used as a grace period to accept any heartbeats after
// the agent has disconnected, to account for the lag time between the service
// recording the state change and the agent stopping heartbeats.
int64 heartbeat_timeout_in_ms = 5;
// The list of jobs that partipate in the coordination service. If empty, all
// jobs will be included in the coordination service by default.
repeated string coordinated_jobs = 6;
// Denotes how long to wait for all coordination agents to reach the barriers
// (after the first shutdown request) before disconnecting together. If
// set to 0, no barrier is imposed upon shutdown and each worker can
// disconnect individually.
int64 shutdown_barrier_timeout_in_ms = 7;
// If set, agents do not make an explicit Shutdown() call. Service will only
// find out about the disconnecte agent via stale heartbeats. Used for
// testing.
bool agent_destruction_without_shutdown = 8;
}