Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

📖 Short term accelerated compute instance - GPU node scale down #4402

Open
5 tasks
BrianEllwood opened this issue May 23, 2024 · 1 comment
Open
5 tasks
Labels
data-platform-apps-and-tools This issue is owned by Data Platform Apps and Tools story

Comments

@BrianEllwood
Copy link
Contributor

BrianEllwood commented May 23, 2024

User Story

As a…engineer
I need/want/expect to…Have the Short term accelerated compute instance - GPU node scale down in a timely fashion.
So that…we do not incur unnecessary cost for an idle node

Value / Purpose

Currently this is a restricted release but it would be good to have this configured correctly before going on GA to avoid unnecessary costs

Proposal

Look at the best method for scaling down the GPU node, currently the vscode scheduler has a parameter MAX_IDLE_TIME of 600 (looks like minutes) before closing down an idle pod

Definition of Done

  • GPU node scales down in a timely fashion
  • README has been updated
  • Documentation has been written / updated
  • Another team member has reviewed
  • Tests are green
@BrianEllwood
Copy link
Contributor Author

This ticket came out of the work on this ticket

@AntFMoJ AntFMoJ added the data-platform-apps-and-tools This issue is owned by Data Platform Apps and Tools label May 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data-platform-apps-and-tools This issue is owned by Data Platform Apps and Tools story
Projects
Status: 👀 TODO
Development

No branches or pull requests

2 participants