[FLINK-18533][coordination] Tolerate pending deployments being reported #12948
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fixes an issue where a task would be canceled if a task executor reported the execution via heartbeats without the acknowledgement yet being processed by the JobMaster. This can happen because the Acknowledge was lost, or due to message re-ordering on the JobMaster side.
The
ExecutionDeployment(Tracker/Reconciler)now distinguish between PENDING/DEPLOYED executions.For reconciliation purposes executions in a PENDING state are ignored.
Executions are moved into a deployed state once the acknowledge by the TaskExecutor has been processed by the JobMaster.
If the task is never acknowledged, then, as before, the task is failed by the JobMaster, and then removed from the tracker (due to terminal state transition, as before).