Add -ness checks and refactor migrations #1674

dhageman · 2023-12-24T17:21:32Z

SUMMARY

This PR implements liveness & readiness probes supporting the split web/task container configuration. It also adjusts database migrations to be done in the init container of the task pod.

Addresses #414 & #926

ISSUE TYPE

New or Enhanced Feature

ADDITIONAL INFORMATION

It should be acknowledged that this is not the first work on implementing these features.

docs/user-guide/advanced-configuration/container-probes.md

roles/installer/tasks/install.yml

rooftopcellist · 2024-03-04T21:39:43Z

I just ran through the code one more time. The k8s job get created as expected.

Then runs to completion. The logs for the migration container show the migrations have all been completed, as can be seen by the output of:

$ oc logs awx-migration-23.9.0-6vwzm

rooftopcellist · 2024-03-04T21:43:53Z

roles/installer/tasks/migrate_schema.yml

@@ -0,0 +1,57 @@
+---
+
+- name: Check for pending migrations


👍
This task ensures that the k8s job is only run if there are new schema changes to be migrated.

And the unique name based on the version and the random hash ensure that there are no conflicts with other k8s jobs from previous migrations.

rooftopcellist · 2024-03-04T21:49:08Z

roles/installer/templates/jobs/migration.yaml.j2

+    {{ lookup("template", "../common/templates/labels/version.yaml.j2") | indent(width=4) | trim }}
+spec:
+  template:
+    spec:


@dhageman One thought I had is that we might want to be mindful of cleaning up these migration jobs in case a deployment gets in a bad state and winds up creating them in a loop after the timeout is reached. The job containers are automatically cleaned up after 3600 seconds, which is pretty reasonable.

If we need to, we can adjust this via .spec.ttlSecondsAfterFinished. Though that is probably a good default to start with since it strikes a nice balance between "not cluttering things" and having logs stick around long enough for debugging.

You make an excellent point. There is definitely a balance there.

rooftopcellist · 2024-03-06T00:55:33Z

@dhageman thank you for your time and effort to develop this solution. I think it is robust and will be a good pattern.

erz4 · 2024-03-06T08:11:49Z

The table in container-probes.md is missing table format, fixed that here #1748
@dhageman

github-actions bot added the community label Dec 24, 2023

rooftopcellist mentioned this pull request Jan 9, 2024

AWX Community Meeting Agenda - Jan 2024 ansible/awx#14712

Closed

chrismeyersfsu reviewed Jan 9, 2024

View reviewed changes

docs/user-guide/advanced-configuration/container-probes.md Outdated Show resolved Hide resolved

chrismeyersfsu reviewed Jan 9, 2024

View reviewed changes

docs/user-guide/advanced-configuration/container-probes.md Outdated Show resolved Hide resolved

dhageman force-pushed the inesscheck branch from 369f684 to cc96dd3 Compare January 10, 2024 01:14

This was referenced Jan 17, 2024

Add liveness/readiness probes to web/task - fixes #414 #1188

Closed

add startupProbe for web #1125

Closed

kdelee reviewed Jan 19, 2024

View reviewed changes

roles/installer/tasks/install.yml Show resolved Hide resolved

kdelee reviewed Jan 19, 2024

View reviewed changes

roles/installer/tasks/install.yml Show resolved Hide resolved

kdelee approved these changes Jan 19, 2024

View reviewed changes

erz4 approved these changes Jan 21, 2024

View reviewed changes

dhageman force-pushed the inesscheck branch 5 times, most recently from e49c674 to 2f1f75b Compare February 23, 2024 02:35

chrismeyersfsu approved these changes Feb 29, 2024

View reviewed changes

Add -ness checks and refactor migrations

259017a

dhageman force-pushed the inesscheck branch from 2f1f75b to 259017a Compare March 2, 2024 21:55

rooftopcellist approved these changes Mar 4, 2024

View reviewed changes

rooftopcellist reviewed Mar 4, 2024

View reviewed changes

rooftopcellist merged commit ffba1b4 into ansible:devel Mar 6, 2024
6 checks passed

erz4 mentioned this pull request Mar 6, 2024

Fix table format in container-probes.md #1748

Merged

dhageman deleted the inesscheck branch March 6, 2024 13:41

TheRealHaoLiu mentioned this pull request Mar 11, 2024

AWX Community Meeting Agenda - March 2024 ansible/awx#14969

Closed

LukWe99 mentioned this pull request Mar 19, 2024

Upgraded to 2.13.1 - awx-task pod stuck "Waiting for database migrations..." #1777

Open

3 tasks

This was referenced Mar 21, 2024

On fresh deployments, the first (or more) reconciliation loop fails on the task "Verify the resource pod name is populated" #1784

Closed

If readines probe for web pod is enabled, deployment failed due to broken readinessProbe structure #1785

Closed

This was referenced Apr 3, 2024

Add livenessProbe, startupProbe do AWX deployment #414

Closed

Live/Readyness checks for awx-web & awx-task #926

Closed

fosterseth mentioned this pull request Apr 12, 2024

AWX task pod unable to initialize in Azure openshift cluster #1814

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add -ness checks and refactor migrations #1674

Add -ness checks and refactor migrations #1674

dhageman commented Dec 24, 2023

rooftopcellist commented Mar 4, 2024

rooftopcellist Mar 4, 2024

rooftopcellist Mar 4, 2024

dhageman Mar 4, 2024

rooftopcellist commented Mar 6, 2024

erz4 commented Mar 6, 2024 •

edited

Add -ness checks and refactor migrations #1674

Add -ness checks and refactor migrations #1674

Conversation

dhageman commented Dec 24, 2023

SUMMARY

ISSUE TYPE

ADDITIONAL INFORMATION

rooftopcellist commented Mar 4, 2024

rooftopcellist Mar 4, 2024

Choose a reason for hiding this comment

rooftopcellist Mar 4, 2024

Choose a reason for hiding this comment

dhageman Mar 4, 2024

Choose a reason for hiding this comment

rooftopcellist commented Mar 6, 2024

erz4 commented Mar 6, 2024 • edited

erz4 commented Mar 6, 2024 •

edited