-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Removal of init_myservice initContainer #412
Removal of init_myservice initContainer #412
Conversation
4e644dd
to
083a2ea
Compare
e782b63
to
53950a5
Compare
After speaking to @Bobbins228 I tried out the local interactive demo notebook and unfortunately am seeing this error. Not sure what it means yet but looking into it. @tedhtchang would you have any insight here? Thanks
|
When the the local_interactive is true the SSL communication is enabled between the worker and head node. This requires all nodes to have SSL certificates which is what the create-cert initContainer does. I suspect this was the reason for this error. |
67c8c95
to
e378fd3
Compare
e378fd3
to
05db5e5
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just a few pieces on this.
We should reintroduce the init containers necessary for that local interactive use case and also add the old deletion logic back and update it to delete just those init containers.
If they are added back to the base template there we shouldn't need to have the extra local interactive template.
05db5e5
to
39e4596
Compare
Thanks @Bobbins228 I have made changes as discussed. Please review again when you have time. |
39e4596
to
cafbfe0
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One small comment
36478f4
to
43a8d20
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm Awesome stuff Fiona
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I tested local_interactive notebook on a KinD cluster. /LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/approve
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: astefanutti, Bobbins228, tedhtchang The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
b4f19db
into
project-codeflare:main
Issue link
Jira Issue
Closes #297
What changes have been made
Removal of the init_myservice initContainer for the ray workgroupspec because the operator also injects a wait-gcs-ready initContainer which does similar thing.
Verification steps
Install codeflare sdk on a cluster - test notebooks and ensure that they work correctly.
Run unit tests and ensure that they work correctly.
Checks