fix (ecs): Unstable deployments for task definitions with more than one container (#5544) #4411

atyutyunnik · 2020-03-11T03:59:57Z

The original code will not finish deployment and will eventually time out, in spite of task being successfully deployed by ECS and running (spinnaker/spinnaker#5544)

        "containers": [
            {
                "networkBindings": [],
                "networkInterfaces": [],
            },
            {
                "lastStatus": "RUNNING",
                "networkBindings": [
                    {
                        "bindIP": "0.0.0.0",
                        "containerPort": 8080,
                        "hostPort": 32773,
                        "protocol": "tcp"
                    }
            }]

ezimanyi · 2020-03-11T18:51:05Z

@atyutyunnik : Thanks for the PR! In general our process is to open fixes against the master branch then cherry-pick them back to release that need them. Can you please open this agains the master branch? Thanks!

atyutyunnik · 2020-03-11T19:06:18Z

ezimanyi, yes, I already have (see PR 4409), but it's not source-compatible with release 1.18. Hence an additional PR

ezimanyi · 2020-03-11T19:08:34Z

Oh, thanks, I mis-understood! If the code has changed enough that it can't be cleanly cherry-picked to 1.18, then it's fine to open this against 1.18 to manually back-port the fix.

It will be important to cherry-pick the master change to 1.19 though, as otherwise users upgrading from 1.18 to 1.19 will get re-broken.

allisaurus · 2020-03-12T22:33:25Z

@atyutyunnik can you please add the unit test from #4409 to this PR as well?

…r in task definition (spinnaker#5544)

atyutyunnik · 2020-03-23T14:08:09Z

@atyutyunnik can you please add the unit test from #4409 to this PR as well?

@allisaurus, cherry-picking doesn't seem possible. I had to extend the unit-test with:
...
then: amazonloadBalancing.describeTargetHealth({ DescribeTargetHealthRequest request ->
...

otherwise, there would be an NPE at 332 of TaskHealthCachingAgent.java:

if (describeTargetHealthResult.getTargetHealthDescriptions().isEmpty()) {

Now the unit-test is passing - please review

allisaurus

Thanks @atyutyunnik , you're right, there was a behavior change between 1.18.x and 1.19.x that makes mocking the call to describeTargetHealth necessary here. With that addition this LGTM!

atyutyunnik changed the title ~~fix (ecs): Unstable deployments when one container~~ fix (ecs): Unstable deployments when one task defs has more than container Mar 11, 2020

atyutyunnik changed the title ~~fix (ecs): Unstable deployments when one task defs has more than container~~ fix (ecs): Unstable deployments for task definitions with more than one container (#5544) Mar 11, 2020

ezimanyi requested review from allisaurus and clareliguori March 11, 2020 19:07

allisaurus mentioned this pull request Mar 17, 2020

ECS: Unstable deployment when there are more than one container in task definition spinnaker/spinnaker#5544

Closed

fix (ecs): Unstable deployments when there are more than one containe…

eec0c29

…r in task definition (spinnaker#5544)

atyutyunnik force-pushed the release-1.18.x branch from 275cf8d to eec0c29 Compare March 23, 2020 13:58

Merge branch 'release-1.18.x' into release-1.18.x

4f3de7e

allisaurus approved these changes Mar 23, 2020

View reviewed changes

ezimanyi merged commit ed9533c into spinnaker:release-1.18.x Mar 23, 2020

spinnakerbot added the target-release/1.18 label Mar 23, 2020

allisaurus mentioned this pull request Apr 20, 2020

REQUEST: New Approver status for allisaurus spinnaker/governance#120

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix (ecs): Unstable deployments for task definitions with more than one container (#5544) #4411

fix (ecs): Unstable deployments for task definitions with more than one container (#5544) #4411

atyutyunnik commented Mar 11, 2020

ezimanyi commented Mar 11, 2020

atyutyunnik commented Mar 11, 2020

ezimanyi commented Mar 11, 2020

allisaurus commented Mar 12, 2020

atyutyunnik commented Mar 23, 2020

allisaurus left a comment

fix (ecs): Unstable deployments for task definitions with more than one container (#5544) #4411

fix (ecs): Unstable deployments for task definitions with more than one container (#5544) #4411

Conversation

atyutyunnik commented Mar 11, 2020

ezimanyi commented Mar 11, 2020

atyutyunnik commented Mar 11, 2020

ezimanyi commented Mar 11, 2020

allisaurus commented Mar 12, 2020

atyutyunnik commented Mar 23, 2020

allisaurus left a comment

Choose a reason for hiding this comment