[FLINK-9575]: Remove job-related BLOBS only if the job was removed suce… #6322

Wosin · 2018-07-12T09:30:57Z

What is the purpose of the change

Currently flink removes all blobs connected with the job, no matter if the job itself was removed successfully. This is not the desired behavior.

Brief change log

Blobs and data will be removed only if the job itself will be removed sucessfully

Verifying this change

This change is a trivial rework / code cleanup without any test coverage.

Does this pull request potentially affect one of the following parts:

Dependencies (does it add or upgrade a dependency): no
The public API, i.e., is any changed class annotated with @Public(Evolving): no
The serializers: no
The runtime per-record code paths (performance sensitive): no
Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Yarn/Mesos, ZooKeeper: yes
The S3 file system connector: no

Documentation

Does this pull request introduce a new feature? no
If yes, how is the feature documented? not applicable

…ssfully

yanghua · 2018-07-12T12:22:07Z

flink-runtime/src/main/scala/org/apache/flink/runtime/jobmanager/JobManager.scala

+    }

-    jobManagerMetricGroup.removeJob(jobID)



this line can also be removed

tillrohrmann

Thanks for opening this PR @Wosin. I think we only need to make the blobServer.cleanupJob call dependent on the success of the SubmittedJobGraphStore#removeJobGraph call.

Furthermore, we should also do the same in the Dispatcher.java:577.

It would be great to add a test for the cleanup behaviour in the Dispatcher.

tillrohrmann · 2018-07-13T05:43:17Z

flink-runtime/src/main/scala/org/apache/flink/runtime/jobmanager/JobManager.scala

+      case Some(future) => future.onComplete{
+        case scala.util.Success(_) => {
+          libraryCacheManager.unregisterJob(jobID)
+          blobServer.cleanupJob(jobID, removeJobFromStateBackend)


Can't we move these this line in the future where we remove the job from the SubmittedJobGraphStore?

Technically we can, but this changes the return type of the future as cleanupJob does indeed return something.

tillrohrmann · 2018-07-13T05:44:01Z

flink-runtime/src/main/scala/org/apache/flink/runtime/jobmanager/JobManager.scala

+        case scala.util.Success(_) => {
+          libraryCacheManager.unregisterJob(jobID)
+          blobServer.cleanupJob(jobID, removeJobFromStateBackend)
+          jobManagerMetricGroup.removeJob(jobID)


I think we could always execute this call independent of whether the removal from the SubmittedJobGraphStore was successful or not.

tillrohrmann · 2018-07-13T05:44:45Z

flink-runtime/src/main/scala/org/apache/flink/runtime/jobmanager/JobManager.scala

+    futureOption match {
+      case Some(future) => future.onComplete{
+        case scala.util.Success(_) => {
+          libraryCacheManager.unregisterJob(jobID)


This call should also be called if the removal of the job from the SubmittedJobGraphStore failed because it does not remove any HA files.

…emoved.

Wosin · 2018-07-16T09:31:12Z

Hey,
I think it should be okay now :) If it is i will squash it to have one commit only.

tillrohrmann

Thanks for your contribution @Wosin. LGTM. Merging this PR.

…essfully This closes apache#6322.

Wosin · 2018-07-18T12:24:35Z

Wait, I have found an issue with my code. I will update the PR accordingly.
Sorry for the trouble.

tillrohrmann · 2018-07-18T12:48:32Z

No worries @Wosin. I've already fixed the problem. You can see it here: tillrohrmann@8b3a849

Wosin · 2018-07-18T12:54:19Z

Ok! Thanks.

…essfully This closes #6322.

…essfully This closes apache#6322.

FLINK-9575: Remove job-related BLOBS only if the job was removed suce…

1c5febe

…ssfully

yanghua reviewed Jul 12, 2018

View reviewed changes

Removed empty line.

01a0f10

tillrohrmann requested changes Jul 13, 2018

View reviewed changes

Changed the dispatcher to remove blobs only if the job was properly r…

1f34f42

…emoved.

Wosin added 4 commits July 17, 2018 09:27

Test if dispatcher removes blobs only on sucessful removal of job.

ffb2864

Added missing import

6084127

Fixed test not to be time dependant.

3456c5c

Fixes for checkstyle.

0d96792

tillrohrmann approved these changes Jul 18, 2018

View reviewed changes

tillrohrmann pushed a commit to tillrohrmann/flink that referenced this pull request Jul 18, 2018

[FLINK-9575] Remove job-related BLOBS only if the job was removed suc…

74ccb29

…essfully This closes apache#6322.

asfgit pushed a commit that referenced this pull request Jul 18, 2018

[FLINK-9575] Remove job-related BLOBS only if the job was removed suc…

b9fe077

…essfully This closes #6322.

asfgit closed this in f6b2e8c Jul 18, 2018

asfgit pushed a commit that referenced this pull request Jul 18, 2018

[FLINK-9575] Remove job-related BLOBS only if the job was removed suc…

9c4b40d

…essfully This closes #6322.

sampathBhat pushed a commit to sampathBhat/flink that referenced this pull request Jul 26, 2018

[FLINK-9575] Remove job-related BLOBS only if the job was removed suc…

fcf53f9

…essfully This closes apache#6322.

rmetzger added the component=<none> label Mar 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FLINK-9575]: Remove job-related BLOBS only if the job was removed suce… #6322

[FLINK-9575]: Remove job-related BLOBS only if the job was removed suce… #6322

Uh oh!

Wosin commented Jul 12, 2018

Uh oh!

yanghua Jul 12, 2018

Uh oh!

tillrohrmann left a comment

Uh oh!

tillrohrmann Jul 13, 2018

Uh oh!

Wosin Jul 13, 2018

Uh oh!

tillrohrmann Jul 13, 2018

Uh oh!

tillrohrmann Jul 13, 2018

Uh oh!

Wosin commented Jul 16, 2018

Uh oh!

tillrohrmann left a comment

Uh oh!

Wosin commented Jul 18, 2018

Uh oh!

tillrohrmann commented Jul 18, 2018 •

edited

Loading

Uh oh!

Wosin commented Jul 18, 2018

Uh oh!

Uh oh!

[FLINK-9575]: Remove job-related BLOBS only if the job was removed suce… #6322

[FLINK-9575]: Remove job-related BLOBS only if the job was removed suce… #6322

Uh oh!

Conversation

Wosin commented Jul 12, 2018

What is the purpose of the change

Brief change log

Verifying this change

Does this pull request potentially affect one of the following parts:

Documentation

Uh oh!

yanghua Jul 12, 2018

Choose a reason for hiding this comment

Uh oh!

tillrohrmann left a comment

Choose a reason for hiding this comment

Uh oh!

tillrohrmann Jul 13, 2018

Choose a reason for hiding this comment

Uh oh!

Wosin Jul 13, 2018

Choose a reason for hiding this comment

Uh oh!

tillrohrmann Jul 13, 2018

Choose a reason for hiding this comment

Uh oh!

tillrohrmann Jul 13, 2018

Choose a reason for hiding this comment

Uh oh!

Wosin commented Jul 16, 2018

Uh oh!

tillrohrmann left a comment

Choose a reason for hiding this comment

Uh oh!

Wosin commented Jul 18, 2018

Uh oh!

tillrohrmann commented Jul 18, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Wosin commented Jul 18, 2018

Uh oh!

Uh oh!

tillrohrmann commented Jul 18, 2018 •

edited

Loading