-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Launch script multislice #629
Comments
For (1) SGTM, let me know if I can help. Something simple that just runs |
In my experience when an instance is preempted (suspended), then you cannot create another one with the same name. The suspended instance is going to stay there and take up the quota. |
I think you all are taking about slightly different things. If the node is
still up we want to reuse. Otherwise we have to delete the node and queued
resource and recreate
(Russell this is one of the main reasons i do not like queued resources…)
…On Wed, Jun 12, 2024 at 4:50 PM Jason Wang ***@***.***> wrote:
In my experience when an instance is preempted (suspended), then you
cannot create another one with the same name. The suspended instance is
going to stay there and take up the quota.
—
Reply to this email directly, view it on GitHub
<#629 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAACLIK7VD5ZXBJVFF6F5JDZHDNCXAVCNFSM6AAAAABJHJHEZ6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCNRUGA4DQMZTHE>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
@rjpower is there a way to specify which branch to run? |
Not specifically. The script packages whatever is in your current
directory, so you can just checkout the branch you want to run.
…On Thu, Jun 13, 2024, 2:33 PM Jason Wang ***@***.***> wrote:
@rjpower <https://github.com/rjpower> is there a way to specify which
branch to run?
—
Reply to this email directly, view it on GitHub
<#629 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAEUHZ57IXFWILWAU4CD6WLZHIF33AVCNFSM6AAAAABJHJHEZ6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDCNRWHAZTCNZUG4>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Things I'm going to add to
launch.py
tpu-vm
toqueued-resources
to work better with multislice trainingThe text was updated successfully, but these errors were encountered: