Skip to content

[SPARK-24931][CORE]CoarseGrainedExecutorBackend send wrong 'Reason' w…#21891

Closed
bingbai0912 wants to merge 2 commits into
apache:masterfrom
bingbai0912:master
Closed

[SPARK-24931][CORE]CoarseGrainedExecutorBackend send wrong 'Reason' w…#21891
bingbai0912 wants to merge 2 commits into
apache:masterfrom
bingbai0912:master

Conversation

@bingbai0912
Copy link
Copy Markdown

TaskSetManager## What changes were proposed in this pull request?

When CoarseGrainedExecutorBackend find the executor not available, it will send a "RemoveExecutor" message of "ExecutorExited" instead "ExecutorLossReason". So it call tell driver whether is the executor "exitCausedByApp" which should be false. So when dirver(TaskSetManager) can "handleFailedTask" correctly to avoid task failed time up to the "maxTaskFailures" and finally cause job failed.

How was this patch tested?

tested in my own cluster

…hen executor exits which leading to job failed.
@AmplabJenkins
Copy link
Copy Markdown

Can one of the admins verify this patch?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants