New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tasks stuck in cancelling if connection is lost to the server #3388
Comments
Should error
be added to the |
Release Note: Tasks now no longer get stuck in running or cancellation state if there is a intermittent database connection problem |
This thread has been automatically locked since there has not been any recent activity after it was closed. If you think you've found a related issue, please contact our support team so we can triage your issue, and make sure it's handled appropriately. |
If the server cannot connect to the database when updating a task's status to Finished, the following message is logged:
Unable to mark task ServerTasks-206 as complete
When the server regains access to the database, the task appears to be still running, but the underlying task thread has finished.
When the Cancel button is clicked, the task goes into
Cancelling
state, which prompts each server node to cancel the task thread. Since it can't find one with that task id, the task never gets updated to `Cancelled.The trick is that each server node does not know whether the thread was running on itself or on another node (in a HA config).
We should detect whether the node owned the task, and if so, move it to cancelled.
Also we should see if we can try a bit harder to set the final status.
The text was updated successfully, but these errors were encountered: