-
Notifications
You must be signed in to change notification settings - Fork 137
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use the conf of shuffleNodesNumber from jobs to be as checking factor #208
Conversation
Codecov Report
@@ Coverage Diff @@
## master #208 +/- ##
============================================
+ Coverage 59.02% 59.17% +0.14%
- Complexity 1326 1332 +6
============================================
Files 160 160
Lines 8699 8732 +33
Branches 815 819 +4
============================================
+ Hits 5135 5167 +32
+ Misses 3301 3300 -1
- Partials 263 265 +2
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
PTAL @jerqi |
@@ -99,13 +103,17 @@ private boolean tryAccessCluster() { | |||
long retryInterval = sparkConf.get(RssSparkConfig.RSS_CLIENT_ACCESS_RETRY_INTERVAL_MS); | |||
int retryTimes = sparkConf.get(RssSparkConfig.RSS_CLIENT_ACCESS_RETRY_TIMES); | |||
|
|||
int assignmentShuffleNodesNum = sparkConf.get(RssSparkConfig.RSS_CLIENT_ASSIGNMENT_SHUFFLE_SERVER_NUMBER); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do we only modify spark3?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry, I forgot to modify in spark2
|
The conf of But in current implementation of For exampleavailable shuffle nodes number = 20 The |
I got it. |
But the |
You mean we should remove the check of |
Origin logic don't check |
Got your thought. But the conf of For the equality of every job, I wont set it and hope the checker can decide whether to reject depending on the required shuffle servers number of jobs. I thinks this is reasonable, it will still reserve the hard limitation and keep the flexibility |
Should we give a option of coordinator to decide whether to use the required shuffle server of client for users? |
Not necessary. If the conf value of |
If I want to use |
Oh. Sorry for not explaining clearly.
|
Like discussion at dev mail list, we will freeze the code and cut 0.6 version branch in September 15, we will not merge this pr before I cut 0.6 version branch, are you ok? |
Got it |
Ping @jerqi |
Should we add more documents for this feature? |
Added. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks @zuston
Thanks for your patient review @jerqi |
What changes were proposed in this pull request?
Use the conf of shuffleNodesNumber from jobs to be as checking factor
Why are the changes needed?
In the PR #97 , it allow client to specify the shuffle server numbers, but in clusterLoaderChecker, it dont take this into considering. In this PR, it will use the conf of shuffleNodesNumber from jobs to be as checking factor only when the conf of
rss.coordinator.access.loadChecker.serverNum.threshold
is missed.Does this PR introduce any user-facing change?
No.
How was this patch tested?
UTs.