-
Notifications
You must be signed in to change notification settings - Fork 8
Add support for IKS and ROKS clusters on classic infra #336
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
/run pipeline |
|
@bhpratt Have you tried to deploy locally on a classic cluster? The test failed with below error, but no way to know why unless you locally deploy and check cluster logs |
|
Yes - ran local tests. Ran again today and all succeeded, including I did notice that the classic cluster defaults to one node, so I added a field to the example to bump that to two nodes in case there was resource contention for the pods. |
|
/run pipeline |
|
This error: Happens when the private vlan doesn't get cleaned up quickly enough (auto-created vlans are also auto-deleted). IKS detects it and won't create a new VLAN. I can add some stanzas to manually create (and destroy) vlans. |
|
Added static VLANs so VLAN clean-up will be controlled explicitly by terraform |
|
/run pipeline |
|
I can't re-create this error locally. I've run this 6-7 times w/no errors. Which makes me think there's some pipeline configuration that I'm not accounting for. One thing I can think of - Sysdig containers sometimes relies on connectivity to sysdig.com (if it needs to pull down a pre-compiled kernel modules). LogDNA doesn't have this requirement. So far the pipeline is only failing on the Sysdig deployment - and only on the classic cluster test. Is there possibly any classic infra firewall or security groups in this pipeline account that could be impacting public connectivity for the classic clusters? It appears to be timing out on the deploy of the sysdig agent: I could try switching data centers from syd01 to something else, as well - but that would just be trying some shots in the dark. I think I'm going to need some assistance to figure out this test failure. |
|
/run pipeline |
|
sysdig pods readiness failing with: Something seems to be blocking the connection to host: Debugging.. |
|
/run pipeline |
|
🎉 This issue has been resolved in version 1.27.0 🎉 The release is available on GitHub release Your semantic-release bot 📦🚀 |

Description
Currently the module agents only run on VPC infra clusters. Upon review, this is due to the name lookup. So, the main change is to add a block to allow the module to look up the name of a classic cluster:
This implementation requires a new variable:
is_vpc_clusterthat will be toggled by the user. Default istrue.Additional changes were to add a new test case and update example to test a classic cluster.
Outstanding questions:
I included some new blocks and updated fields inexamples/basicto avoid duplicating lots of terraform blocks in a new example. Is this ok, or should I split this out intoexamples/classicand duplicate the observability blocks?Tests dynamically choose region based on the usage in the account for VPC. These regions don't match directly to classic data centers. Right nowTestRunBasicAgentsClassicuses the defaultsyd01data center. Do I need to update this behavior?Discussed above w/Conall. Will keep as is.
Release required?
x.x.X)x.X.x)X.x.x)Release notes content
Run the pipeline
If the CI pipeline doesn't run when you create the PR, the PR requires a user with GitHub collaborators access to run the pipeline.
Run the CI pipeline when the PR is ready for review and you expect tests to pass. Add a comment to the PR with the following text:
Checklist for reviewers
For mergers