[native] Introduce lastCoordinatorHeartbeatMs to PrestoTask. #22718

spershin · 2024-05-11T01:46:51Z

Description

Right now we use 'lastHeartbeatMs' to determine if a Task has been abandoned by
the Coordinator (Coordinator restarted or crashed or anyhow gone).

This does not work too well, as 'lastHeartbeatMs' is updated on every message to the Task.
Including 'getResults', which workers send to each other and refreshing it.
As the result, the Tasks get abandoned in waves, starting from stage 0, then stage 1, etc.
A large number of stages is possible and with the timeout is 1-3 minutes, that can cause
a large waiting time after the Coordinator long gone.

The new 'lastCoordinatorHeartbeatMs' is only updated by the messages sent by the
Coordinator, which results in all Tasks being cancelled at the same time after just single
timeout period.
It is used to determine if Coordinator has abandoned the Task.

Motivation and Context

Get rid of the Tasks, which were abandoned by the Coordinator, instead of leaving them
running forever.

Test Plan

Orchestrated the restart in the cluster with a long running query.
Observed prompt restart of the workers compared with the previous version.

== NO RELEASE NOTE ==

xiaoxmeng

@spershin thanks for fixing this % comments.

presto-native-execution/presto_cpp/main/TaskManager.h

xiaoxmeng · 2024-05-12T22:25:57Z

presto-native-execution/presto_cpp/main/TaskManager.cpp

@@ -780,6 +784,7 @@ folly::Future<std::unique_ptr<protocol::TaskInfo>> TaskManager::getTaskInfo(
  {
    std::lock_guard<std::mutex> l(prestoTask->mutex);
    prestoTask->updateHeartbeatLocked();
+    prestoTask->updateCoordinatorHeartbeatLocked();


Can we pass a flag to updateHeartbeatLocked(fromCoordinator)?

No, we cannot.
The sites we call these two differ.

presto-native-execution/presto_cpp/main/PrestoTask.h

xiaoxmeng · 2024-05-12T22:27:31Z

presto-native-execution/presto_cpp/main/PrestoTask.h

@@ -95,8 +95,14 @@ struct PrestoTask {
  /// has not been started, until the actual 'create task' message comes.
  bool taskStarted{false};

+  /// Time point (in ms) when the last message (any) came for this task.
  uint64_t lastHeartbeatMs{0};


s/lastHeartbeatMs/lastWorkerHeartbeatMs/

Do we need to track the last heartbeat or message received from worker? Or shall we rename it to lastWorkerMsgTimeMs?

I don't think we should do this renaming in this change.
It is unnecessary and is actually unclear why you are suggesting it.

Do we need to track the last heartbeat or message received from worker? Or shall we rename it to lastWorkerMsgTimeMs?

If we need this - it would be a separate PR.

I am wondering if we need to update lastHeartbeatMs when we receive message from worker? Or we only need to record the heartbeat timestamp from coordinator? Thanks!

I am wondering if we need to update lastHeartbeatMs when we receive message from worker? Or we only need to record the heartbeat timestamp from coordinator? Thanks!

I don't know that - just keeping the existing heartbeat as a status quo in order not to break the current behavior.

What I can do is to follow up by looking at the java code and see if it has something similar and how it is updated.
Not sure what I will have time for it, because it is low pri.

xiaoxmeng · 2024-05-12T22:30:26Z

presto-native-execution/presto_cpp/main/TaskManager.cpp

@@ -468,6 +469,7 @@ std::unique_ptr<TaskInfo> TaskManager::createOrUpdateTask(
  auto prestoTask = findOrCreateTask(taskId, startProcessCpuTime);
  {
    std::lock_guard<std::mutex> l(prestoTask->mutex);
+    prestoTask->updateCoordinatorHeartbeatLocked();


Why we don't update updateCoordinatorHeartbeatLocked in findOrCreateTask? thanks!

Because findOrCreateTask() is called from literally everywhere and we only need to cover the Coord's endpoints.

xiaoxmeng

@spershin thanks for the offline discussion and clear code comments. Please consider to deprecate the old heartbeat ts in followup per our discussion.

It is used to detemrine if Coordinator has abandoned the Task.

xiaoxmeng

@spershin thanks for the test!

spershin requested a review from a team as a code owner May 11, 2024 01:46

spershin requested a review from xiaoxmeng May 11, 2024 01:47

xiaoxmeng reviewed May 12, 2024

View reviewed changes

xiaoxmeng previously approved these changes May 14, 2024

View reviewed changes

spershin dismissed xiaoxmeng’s stale review via ba77fa9 May 14, 2024 05:35

spershin force-pushed the CoordHeartBeat branch from f636e2e to ba77fa9 Compare May 14, 2024 05:35

[native] Introduce lastCoordinatorHeartbeatMs to PrestoTask.

2bb11ae

It is used to detemrine if Coordinator has abandoned the Task.

spershin force-pushed the CoordHeartBeat branch from ba77fa9 to 2bb11ae Compare May 14, 2024 05:46

xiaoxmeng approved these changes May 14, 2024

View reviewed changes

spershin merged commit b15f0c5 into prestodb:master May 14, 2024
59 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[native] Introduce lastCoordinatorHeartbeatMs to PrestoTask. #22718

[native] Introduce lastCoordinatorHeartbeatMs to PrestoTask. #22718

spershin commented May 11, 2024

xiaoxmeng left a comment

xiaoxmeng May 12, 2024

spershin May 13, 2024

xiaoxmeng May 12, 2024

spershin May 13, 2024

xiaoxmeng May 14, 2024

spershin May 14, 2024

xiaoxmeng May 12, 2024

spershin May 13, 2024

xiaoxmeng left a comment

xiaoxmeng left a comment

[native] Introduce lastCoordinatorHeartbeatMs to PrestoTask. #22718

[native] Introduce lastCoordinatorHeartbeatMs to PrestoTask. #22718

Conversation

spershin commented May 11, 2024

Description

Motivation and Context

Test Plan

xiaoxmeng left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

xiaoxmeng left a comment

Choose a reason for hiding this comment

xiaoxmeng left a comment

Choose a reason for hiding this comment