Former master reports "failed to reassign persistent tasks" repeatedly #58531

DaveCTurner · 2020-06-25T09:56:54Z

Following a slightly messy master election I saw a 7.6.1 cluster with the two unelected master nodes both reporting this message every 30 seconds apparently indefinitely:

failed to reassign persistent tasks
org.elasticsearch.cluster.NotMasterException: no longer master. source: [reassign persistent tasks]

Possibly there's a race in the PersistentTasksClusterService around election time? Not sure, I am not very familiar with this area.

The text was updated successfully, but these errors were encountered:

elasticmachine · 2020-06-25T09:56:55Z

Pinging @elastic/es-distributed (:Distributed/Task Management)

droberts195 · 2020-06-25T10:33:11Z

This is a silly bug. If a persistent task cannot be immediately reassigned and has caused periodic rechecks to be scheduled (on the master node at that time) then those rechecks continue to be rescheduled regardless of whether the node is still the master. I will open a PR to fix it.

If a persistent task cannot be assigned on the first attempt then the master node will schedule periodic rechecks to see if the assignment requirements have been met. These periodic rechecks should be cancelled if the node ceases to be master. Previously they weren't, leading to exceptions being logged repeatedly. This PR cancels the rechecks on learning that the node is no longer the master. Fixes elastic#58531

If a persistent task cannot be assigned on the first attempt then the master node will schedule periodic rechecks to see if the assignment requirements have been met. These periodic rechecks should be cancelled if the node ceases to be master. Previously they weren't, leading to exceptions being logged repeatedly. This PR cancels the rechecks on learning that the node is no longer the master. Fixes #58531

DaveCTurner added >bug :Distributed/Task Management Issues for anything around the Tasks API - both persistent and node level. labels Jun 25, 2020

elasticmachine added the Team:Distributed Meta label for distributed team label Jun 25, 2020

droberts195 self-assigned this Jun 25, 2020

droberts195 mentioned this issue Jun 25, 2020

Cancel persistent task recheck when no longer master #58539

Merged

droberts195 closed this as completed in #58539 Jun 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Former master reports "failed to reassign persistent tasks" repeatedly #58531

Former master reports "failed to reassign persistent tasks" repeatedly #58531

DaveCTurner commented Jun 25, 2020

elasticmachine commented Jun 25, 2020

droberts195 commented Jun 25, 2020

Former master reports "failed to reassign persistent tasks" repeatedly #58531

Former master reports "failed to reassign persistent tasks" repeatedly #58531

Comments

DaveCTurner commented Jun 25, 2020

elasticmachine commented Jun 25, 2020

droberts195 commented Jun 25, 2020