-
Notifications
You must be signed in to change notification settings - Fork 13.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FLINK-14316] Properly manage rpcConnection in JobManagerLeaderListener under leader change #11603
Conversation
…er under leader change This commit changes how the rpcConnection is managed in JobManagerLeaderListener under leader change. This component clears now the fields rpcConnection and currentJobMasterId if the leader loses leadership. Moreover, it only restarts a connection attempt if the leader session id is new.
Thanks a lot for your contribution to the Apache Flink project. I'm the @flinkbot. I help the community Automated ChecksLast check on commit 34d7bb3 (Wed Apr 15 11:39:24 UTC 2020) Warnings:
Mention the bot in a comment to re-run the automated checks. Review Progress
Please see the Pull Request Review Guide for a full explanation of the review process. The Bot is tracking the review progress through labels. Labels are applied according to the order of the review items. For consensus, approval by a Flink committer of PMC member is required Bot commandsThe @flinkbot bot supports the following commands:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One small comment on logging, otherwise this looks good.
Would have preferred if the refactoring to doesNotReconnectAfterTargetLostLeadership
were done in a separate commit though.
} | ||
// check whether we are already connecting to this leader | ||
if (Objects.equals(jobMasterId, currentJobMasterId)) { | ||
LOG.debug("Trying already connecting to leader of job {}. Ignoring duplicate leader information.", jobId); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reads a bit weird? How about "Ongoing attempt to connect to leader of job{}. ..."
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good point. Will update it.
Thanks for the review @zentol. I've addressed your comment. Merging this PR now. The failing test case seems to be unrelated (blink planner). |
…er under leader change This commit changes how the rpcConnection is managed in JobManagerLeaderListener under leader change. This component clears now the fields rpcConnection and currentJobMasterId if the leader loses leadership. Moreover, it only restarts a connection attempt if the leader session id is new. This closes apache#11603.
…er under leader change This commit changes how the rpcConnection is managed in JobManagerLeaderListener under leader change. This component clears now the fields rpcConnection and currentJobMasterId if the leader loses leadership. Moreover, it only restarts a connection attempt if the leader session id is new. This closes apache#11603.
What is the purpose of the change
This commit changes how the rpcConnection is managed in JobManagerLeaderListener under leader change.
This component clears now the fields rpcConnection and currentJobMasterId if the leader loses leadership.
Moreover, it only restarts a connection attempt if the leader session id is new.
Verifying this change
JobLeaderServiceTest.canReconnectToOldLeaderWithSameLeaderAddress
Does this pull request potentially affect one of the following parts:
@Public(Evolving)
: (no)Documentation