Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

allow not stop process in error #97

Merged
merged 2 commits into from
Jun 25, 2019
Merged

Conversation

CodingCat
Copy link
Member

@CodingCat CodingCat commented Jun 25, 2019

part of the original #95 which has involved #96 as well

in some distributed environment like Spark, multiple workers co-locate in the same process. For anyone of them to fail, it should not stop the whole process and take down the other co-locating workers

this PR essentially makes this behavior as an option instead of the only one in rabit

@CodingCat
Copy link
Member Author

merge this, as it is already got reviewed

@CodingCat CodingCat merged commit fc85f77 into master Jun 25, 2019
@CodingCat CodingCat deleted the allow_not_stop_process_2 branch June 25, 2019 20:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant