Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FLINK-1492] Fix exceptions on blob store shutdown #376

Closed
wants to merge 1 commit into from

Conversation

uce
Copy link
Contributor

@uce uce commented Feb 9, 2015

No description provided.

@rmetzger
Copy link
Contributor

rmetzger commented Feb 9, 2015

This change is also in 0.8 so do we need to apply the fix there as well for the upcoming 0.8.1 release?

@uce
Copy link
Contributor Author

uce commented Feb 9, 2015

Yes, if it is finally OK.

@rmetzger
Copy link
Contributor

Looks good.

}
}
try {
join();
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What does this join do? I'm at a loss here.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's from the old code and I am not sure if it really needs to stay, but it ensures that the BlobServer thread really finishes when calling the shutdown method (BlobServer is a Thread and because the join is called from outside of the run method it waits for the BlobServer thread to finish).

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah of course. Thanks for the clarification.

@tillrohrmann
Copy link
Contributor

LGTM except for my single question.

@rmetzger
Copy link
Contributor

I'll merge the change.

@uce
Copy link
Contributor Author

uce commented Feb 10, 2015

OK, thanks. Just a reminder: we need to include this in 0.8.1 as well.

@asfgit asfgit closed this in b88f909 Feb 10, 2015
@mxm
Copy link
Contributor

mxm commented Feb 11, 2015

I still get

05:26:08,841 ERROR org.apache.flink.runtime.blob.BlobServer  - Error during shutdown of blob service via JVM shutdown hook: Shutdown in progress
java.lang.IllegalStateException: Shutdown in progress
at java.lang.ApplicationShutdownHooks.remove(ApplicationShutdownHooks.java:82)
at java.lang.Runtime.removeShutdownHook(Runtime.java:239)
at org.apache.flink.runtime.blob.BlobServer.shutdown(BlobServer.java:220)
at org.apache.flink.runtime.blob.BlobUtils$1.run(BlobUtils.java:210)
at java.lang.Thread.run(Thread.java:745)

On the last night's master.

@uce
Copy link
Contributor Author

uce commented Feb 11, 2015

OK, thanks for reporting this. @StephanEwen has some pending changes to the blob manager and he will look into it as well.

It is not allowed to remove the shutdown hook when a shutdown is already in progress.

@uce
Copy link
Contributor Author

uce commented Feb 11, 2015

The problem is totally deterministic when shutting down the job/task managers.

I've fixed it here: https://github.com/uce/incubator-flink/tree/flink-1492-fix_exceptions_really

@StephanEwen, if this is blocking the release vote, we should just merge my change and not wait for your changes in master, which touch the blob store.

@uce uce deleted the flink-1492-proper_shutdown_hook branch February 23, 2015 09:17
marthavk pushed a commit to marthavk/flink that referenced this pull request Jun 9, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
4 participants