-
Notifications
You must be signed in to change notification settings - Fork 348
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Add job spec in raycluster CRD #106
Comments
We should make sure the job format is compatible with https://docs.ray.io/en/master/ray-job-submission/overview.html |
@pcmoritz Any further follow ups on the job level support at Kubernetes layer in anyscale? |
If we build the solution on top of https://docs.ray.io/en/latest/ray-job-submission/overview.html. that means
We've talked about 2a in the past. (https://docs.google.com/document/d/1aKet8Zt8FLeZvsJGJeF2G_-9u2_UWf-AXgNRSvSQxcI/edit) Does it sound like a reasonable path? In this case, operator does more than we original expected. submit options probably can be reused in CRDs.
|
I prefer the
|
I will create some patch to finish this feature. |
Following up on previous discussion in meeting. How about this alternative decouples job spec from cluster cr? Before:
After (two yamls but can be stiched together):
This forces separate controller to manage this RayJob CR. Additionally, support separate versioning of RayJob and RayCluster. As RayJob is evolving faster than RayCluster. The RayJob controller should also use the REST API for job submission and poll its status to ensure completion. |
I have wrote a new design doc: https://docs.google.com/document/d/1z8IBoc0yWAPDe01Im2zKDg6NVkLSwj580Y5g4RwGLqg/edit?usp=sharing |
Looks like we are going with the separate CR approach and controller approach? @Jeffwan are you ok with this? |
cc @akanso cc @brucez-anyscale |
I missed the message. Yes. As we discussed in the community meeting. separate controllers manage their own CRD and we bake everything in single operator |
@harryge00 @Jeffwan @simon-mo @edoakes @brucez-anyscale @shrekris-anyscale I think it would be prudent to drop this item from the 0.3.0 release, given that the implementation is in draft form. Is that fine? We can include this in a future release. |
PR is merged and we can close this issue. |
Search before asking
Description
For now, we can use
raycluster
CR to setup a ray cluster. And the head node contains a dashboard which can handle job submission.We've been talking about the same issue recently. We can support to start a job when the ray cluster is ready. Maybe it will be look like flink job cluster.
@Jeffwan @pcmoritz @DmitriGekhtman WDYT?
Use case
Create a ray cluster and run job automatically.
Related issues
No response
Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: