Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix the logic to calculate minResources #543

Merged
merged 1 commit into from
Apr 4, 2023

Conversation

tenzen-y
Copy link
Member

@tenzen-y tenzen-y commented Apr 4, 2023

I fixed the logic to calculate minResources so that calculatePGMinResource treats the launcher as a replica of higher priority when we don't set priorityClasses.

I faced the issue at #540 (comment).

Background:
In the current implementation, if the launcher and workers have the same priority, calculatePGMinResource randomly selects prioritized replicas. This means the launcher might be treated as a lower priority than the worker replica when we don't set priorityClass in both replicas.

@tenzen-y
Copy link
Member Author

tenzen-y commented Apr 4, 2023

/assign @alculquicondor

Signed-off-by: Yuki Iwai <yuki.iwai.tz@gmail.com>
@tenzen-y
Copy link
Member Author

tenzen-y commented Apr 4, 2023

Rebased.

@alculquicondor
Copy link
Collaborator

/lgtm
/approve

@google-oss-prow google-oss-prow bot added the lgtm label Apr 4, 2023
@google-oss-prow
Copy link

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: alculquicondor

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@google-oss-prow google-oss-prow bot merged commit 03bba1f into kubeflow:master Apr 4, 2023
@tenzen-y tenzen-y deleted the fix-gang-scheduling-bug branch April 4, 2023 20:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants