[FLINK-11850][zk] Tolerate concurrent child deletions when deleting owned zNode #7928

tillrohrmann · 2019-03-07T12:35:34Z

What is the purpose of the change

When calling ZooKeeperHaServices#closeAndCleanupAllData it can happen that a child of the owned
zNode of the ZooKeeperHaServices is being concurrently deleted (e.g. a LeaderElectionService has
been shut down). In order to tolerate concurrent deletions, we use now ZKPaths#deleteChildren.

Verifying this change

Covered by existing ZooKeeperHaServicesTest

Does this pull request potentially affect one of the following parts:

Dependencies (does it add or upgrade a dependency): (no)
The public API, i.e., is any changed class annotated with @Public(Evolving): (no)
The serializers: (no)
The runtime per-record code paths (performance sensitive): (no)
Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Yarn/Mesos, ZooKeeper: (yes)
The S3 file system connector: (no)

Documentation

Does this pull request introduce a new feature? (no)
If yes, how is the feature documented? (not applicable)

flinkbot · 2019-03-07T12:36:21Z

Thanks a lot for your contribution to the Apache Flink project. I'm the @flinkbot. I help the community
to review your pull request. We will use this comment to track the progress of the review.

Review Progress

✅ 1. The [description] looks good.
- Approved by @zentol [PMC]
✅ 2. There is [consensus] that the contribution should go into to Flink.
- Approved by @zentol [PMC]
❓ 3. Needs [attention] from.
✅ 4. The change fits into the overall [architecture].
- Approved by @zentol [PMC]
✅ 5. Overall code [quality] is good.
- Approved by @zentol [PMC]

Please see the Pull Request Review Guide for a full explanation of the review process.

The Bot is tracking the review progress through labels. Labels are applied according to the order of the review items. For consensus, approval by a Flink committer of PMC member is required

Bot commands

The @flinkbot bot supports the following commands:

@flinkbot approve description to approve one or more aspects (aspects: description, consensus, architecture and quality)
@flinkbot approve all to approve all aspects
@flinkbot approve-until architecture to approve everything until architecture
@flinkbot attention @username1 [@username2 ..] to require somebody's attention
@flinkbot disapprove architecture to remove an approval you gave earlier

tisonkun

Looks good to me :-)

zentol · 2019-03-07T13:31:28Z

Do you have a reference for the supposed behavior (failing if a child is deleted) that is not caused by ZKPaths.deleteChildren?

The only related issue I could only find is https://issues.apache.org/jira/browse/CURATOR-430.
Based on this client.delete().deletingChildrenIfNeeded().forPath("/") calls ZKPaths.deleteChildren internally (and the source seems to confirm that).
This issue was fixed by curator within ZKPaths.deleteChildren as well. (pr)
The fix was not backported to the version we're using.

tillrohrmann · 2019-03-07T14:19:39Z

I think you are right @zentol. Our curator version 2.12.0 does not fix the problem and it looked only like this when running the tests a couple of times. I think the only solution atm is to handle this exception at the call site and retry the operation until it succeeds. I will update the PR accordingly.

tisonkun · 2019-03-07T14:23:35Z

@tillrohrmann isn't it an option that we bump our shaded curator version?

tillrohrmann · 2019-03-07T15:31:56Z

@tisonkun I wouldn't do this so close to the actual release. Rather, I prefer to do this after the 1.8 release to give it a bit more exposure.

zentol

1 minor comment.

@flinkbot approve all

zentol · 2019-03-07T16:04:54Z

...e/src/main/java/org/apache/flink/runtime/highavailability/zookeeper/ZooKeeperHaServices.java

+				client.delete().deletingChildrenIfNeeded().forPath("/");
+				zNodeDeleted = true;
+			} catch (KeeperException.NoNodeException ignored) {
+				// concurrent delete operation. Try again.


we could log this on debug just in case.

True, will add it.

…ission This commit changes the cleanup logic of the Dispatcher to only clean up job HA files if the job is not a duplicate (meaning that it is either running or has already been executed by the same JobMaster). This closes apache#7918.

…nd MiniCluster The io executor is responsible for running io operations like discarding checkpoints. By using the io executor, we don't risk that the RpcService is blocked by blocking io operations. This closes apache#7924.

tillrohrmann · 2019-03-07T18:24:22Z

Thanks for the review @tisonkun and @zentol. Addressing Chesnay's last comment and then merging this PR.

…wned zNode When calling ZooKeeperHaServices#closeAndCleanupAllData it can happen that a child of the owned zNode of the ZooKeeperHaServices is being concurrently deleted (e.g. a LeaderElectionService has been shut down). In order to tolerate concurrent deletions, we use now ZKPaths#deleteChildren. This closes apache#7928.

rmetzger added the review=description? label Mar 7, 2019

tillrohrmann mentioned this pull request Mar 7, 2019

[BP-1.8][FLINK-11850][zk] Tolerate concurrent child deletions when deleting owned zNode #7929

Merged

tisonkun approved these changes Mar 7, 2019

View reviewed changes

zentol self-assigned this Mar 7, 2019

zentol approved these changes Mar 7, 2019

View reviewed changes

tillrohrmann added 2 commits March 7, 2019 19:19

tillrohrmann force-pushed the FLINK-11850 branch from f255af3 to b464df2 Compare March 7, 2019 18:27

asfgit merged commit b464df2 into apache:master Mar 7, 2019

tillrohrmann deleted the FLINK-11850 branch March 7, 2019 21:52

rmetzger added review=approved ✅ component=Runtime/Coordination component=Tests and removed review=description? labels Mar 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FLINK-11850][zk] Tolerate concurrent child deletions when deleting owned zNode #7928

[FLINK-11850][zk] Tolerate concurrent child deletions when deleting owned zNode #7928

tillrohrmann commented Mar 7, 2019

flinkbot commented Mar 7, 2019 •

edited

tisonkun left a comment

zentol commented Mar 7, 2019

tillrohrmann commented Mar 7, 2019

tisonkun commented Mar 7, 2019

tillrohrmann commented Mar 7, 2019

zentol left a comment

zentol Mar 7, 2019

tillrohrmann Mar 7, 2019

tillrohrmann commented Mar 7, 2019

[FLINK-11850][zk] Tolerate concurrent child deletions when deleting owned zNode #7928

[FLINK-11850][zk] Tolerate concurrent child deletions when deleting owned zNode #7928

Conversation

tillrohrmann commented Mar 7, 2019

What is the purpose of the change

Verifying this change

Does this pull request potentially affect one of the following parts:

Documentation

flinkbot commented Mar 7, 2019 • edited

Review Progress

tisonkun left a comment

Choose a reason for hiding this comment

zentol commented Mar 7, 2019

tillrohrmann commented Mar 7, 2019

tisonkun commented Mar 7, 2019

tillrohrmann commented Mar 7, 2019

zentol left a comment

Choose a reason for hiding this comment

zentol Mar 7, 2019

Choose a reason for hiding this comment

tillrohrmann Mar 7, 2019

Choose a reason for hiding this comment

tillrohrmann commented Mar 7, 2019

flinkbot commented Mar 7, 2019 •

edited