Skip to content

ci(cd): remove AWS build+deploy jobs (kops decommissioned)#85

Merged
topherwhite merged 2 commits into
stagingfrom
agent/ms4-strip-aws-cd-20260519
May 19, 2026
Merged

ci(cd): remove AWS build+deploy jobs (kops decommissioned)#85
topherwhite merged 2 commits into
stagingfrom
agent/ms4-strip-aws-cd-20260519

Conversation

@topherwhite
Copy link
Copy Markdown
Member

Summary

Removes the AWS-targeted build: (ECR push) and deploy: (kops kubectl apply) jobs from this repo's cd.yaml. The kops production cluster has been declared dead by the operator (2026-05-18 18:55 EDT), and rfcx-local is now the sole production deploy target.

Background

  • Phase 3 (2026-05-18 18:55 EDT) moved the device-upload data plane entirely off AWS — see evity-squibbon/rfcx-local's STATE.md "Phase 3 Status" + "AWS / kops decommission status" blocks.
  • All kops Mandrill keys revoked; staging/prediction/arbimon-jobs-staging namespaces deleted.
  • The AWS build: and deploy: jobs in this workflow either fail noisily (most recent run on rfcx-api) or, once KUBE_CONFIG_SUPER rotates / kops gets torn down, will fail uniformly. They have no useful behaviour going forward.
  • The deploy-rfcx-local: job (added 2026-05-16) does its own in-cluster arm64 build via the self-hosted runner in the rfcx-local cluster's cicd namespace, pushes to the in-cluster registry at 192.168.5.1:30500, and rolls apps-prod Deployments via the runner's RBAC. It has zero dependency on the AWS jobs.

What changes

In .github/workflows/cd.yaml:

  • Removed build: job (uses rfcx/cicd/.github/workflows/ecr-build-push.yaml@master)
  • Removed deploy: job (uses rfcx/cicd/.github/workflows/k8s-deploy.yaml@master)
  • Updated notify.needs: to depend only on [prepare, deploy-rfcx-local]
  • Updated notify status to report deploy-rfcx-local.result
  • Updated notify notification-footer to surface only the rfcx-local result

prepare: and configure: are kept (still needed for deploy-rfcx-local's if: namespace == 'production' gate and for notify metadata). staging is left in on.push.branches (no-op on staging push — deploy-rfcx-local is gated to production — but preserves the staging-promotion PR mechanism).

What's unchanged

deploy-rfcx-local: is byte-identical. So is concurrency:. No new secrets, no removed inputs.

Validation plan

  • Push to staging first (via this PR landing): runs prepare → configure, skips deploy-rfcx-local (namespace=staging). Should complete successfully with no AWS calls.
  • Once merged to master via the companion staging → master PR: triggers the rfcx-local CD path only. Should match recent successful runs.

Rollback

Revert this PR. The AWS jobs come back. Note: this only restores the workflow definition — kops production itself is dead regardless, so the AWS jobs will fail until kops comes back (which is not the plan).

Related

  • evity-squibbon/rfcx-local PR / branch: agent/ms4-strip-aws-cd-20260519 (this same branch name across all 7 rfcx-org repos this week).
  • Companion PRs in rfcx/{rfcx-api, ingest-service, guardian-api, guardian-dashboard, arbimon, arbimon-legacy}.

topherwhite and others added 2 commits May 16, 2026 08:22
Mirrors AWS deploy; reuses rfcx/cicd/.github/workflows/rfcx-local-cd.yaml@master.
See rfcx/rfcx-api commit fc1b78ff for the pattern + initial rollout proof.
The kops production cluster has been declared dead by the operator
(2026-05-18 18:55 EDT). The AWS-targeted `build:` (ECR push) and
`deploy:` (kubectl against KUBE_CONFIG_SUPER) jobs have been
failing-or-soon-to-fail since, and rfcx-local has been the
authoritative production deploy target.

This commit:
- Drops the `build:` job (uses `rfcx/cicd/ecr-build-push.yaml`)
- Drops the `deploy:` job (uses `rfcx/cicd/k8s-deploy.yaml`)
- Updates `notify.needs` to depend only on `deploy-rfcx-local`
- Updates notify status/footer to surface the rfcx-local result

`deploy-rfcx-local` is unchanged: it does its own in-cluster
arm64 build via the self-hosted runner in the `cicd` namespace,
pushes to the in-cluster registry at 192.168.5.1:30500, and rolls
`apps-prod` Deployments via the runner's RBAC. It has no
dependency on the AWS `build:`/`deploy:` jobs.

`prepare:` and `configure:` are kept (still needed for the
branch-name gate on `deploy-rfcx-local` and for notify metadata).

`staging` is left in the on.push.branches trigger; with AWS gone
it's a no-op on staging push (deploy-rfcx-local gates on
namespace==production), which preserves the staging-promotion-PR
workflow.

See https://github.com/evity-squibbon/rfcx-local STATE.md
"AWS / kops decommission status" block for context.
@topherwhite topherwhite merged commit 1cfb663 into staging May 19, 2026
@topherwhite topherwhite deleted the agent/ms4-strip-aws-cd-20260519 branch May 19, 2026 06:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant