-
Notifications
You must be signed in to change notification settings - Fork 25.6k
[skip-ci] .github: Set linux gpu instances to be non-ephemeral #67345
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Was hitting capacity issues, setting these to non-ephemeral would mean keeping the current capacity at the expense of "unclean" nodes Signed-off-by: Eli Uriegas <eliuriegas@fb.com> [ghstack-poisoned]
CI Flow Status⚛️ CI FlowRuleset - Version:
You can add a comment to the PR and tag @pytorchbot with the following commands: # ciflow rerun, "ciflow/default" will always be added automatically
@pytorchbot ciflow rerun
# ciflow rerun with additional labels "-l <ciflow/label_name>", which is equivalent to adding these labels manually and trigger the rerun
@pytorchbot ciflow rerun -l ciflow/scheduled -l ciflow/slow For more information, please take a look at the CI Flow Wiki. |
🔗 Helpful links
💊 CI failures summary and remediationsAs of commit 31f5b10 (more details on the Dr. CI page): 💚 💚 Looks good so far! There are no failures yet. 💚 💚 This comment was automatically generated by Dr. CI (expand for details).Please report bugs/suggestions to the (internal) Dr. CI Users group. |
@seemethere has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Just wondering why linux gpu instances in particular (I guess gpu takes up a lot of resource)? Do we have other gpu instances that should be converted to non-ephemeral as well?
Current support is only for linux instances, we can expand support to windows instances if we deem it necessary later on down the line |
Plz consider adding |
@seemethere merged this pull request in 0101b1e. |
Stack from ghstack:
Was hitting capacity issues, setting these to non-ephemeral would mean
keeping the current capacity at the expense of "unclean" nodes
Signed-off-by: Eli Uriegas eliuriegas@fb.com
Differential Revision: D31965477