Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-13778][Core]Set the executor state for a worker when removing it #11609

Closed
wants to merge 1 commit into from
Closed

[SPARK-13778][Core]Set the executor state for a worker when removing it #11609

wants to merge 1 commit into from

Conversation

zsxwing
Copy link
Member

@zsxwing zsxwing commented Mar 9, 2016

What changes were proposed in this pull request?

When a worker is lost, the executors on this worker are also lost. But Master's ApplicationPage still displays their states as running.

This patch just sets the executor state to LOST when a worker is lost.

How was this patch tested?

manual tests

@zsxwing
Copy link
Member Author

zsxwing commented Mar 9, 2016

cc @yhuai @andrewor14

@SparkQA
Copy link

SparkQA commented Mar 9, 2016

Test build #52761 has finished for PR 11609 at commit ff75d65.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@yhuai
Copy link
Contributor

yhuai commented Mar 9, 2016

What will be a good way to test it?

@zsxwing
Copy link
Member Author

zsxwing commented Mar 9, 2016

What will be a good way to test it?

Manual test? Just start a standalone cluster and spark-shell, then kill the workers. Before this patch, you will see executors in the application page are still in the running state.

@yhuai
Copy link
Contributor

yhuai commented Mar 9, 2016

OK Thanks. Is it easy to add a test in the pr?

@zsxwing
Copy link
Member Author

zsxwing commented Mar 9, 2016

OK Thanks. Is it easy to add a test in the pr?

Pretty hard. Master is not friendly to unit tests. We can refactor it to make it testable but that's out of this PR's scope.

@yhuai
Copy link
Contributor

yhuai commented Mar 9, 2016

OK. Thanks.

@yhuai
Copy link
Contributor

yhuai commented Mar 9, 2016

LGTM

@andrewor14
Copy link
Contributor

Merged into master.

@asfgit asfgit closed this in 40e0676 Mar 10, 2016
@zsxwing zsxwing deleted the SPARK-13778 branch March 10, 2016 18:08
roygao94 pushed a commit to roygao94/spark that referenced this pull request Mar 22, 2016
## What changes were proposed in this pull request?

When a worker is lost, the executors on this worker are also lost. But Master's ApplicationPage still displays their states as running.

This patch just sets the executor state to `LOST` when a worker is lost.

## How was this patch tested?

manual tests

Author: Shixiong Zhu <shixiong@databricks.com>

Closes apache#11609 from zsxwing/SPARK-13778.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
4 participants