fix: separate out the grace_period for ALB and NLB by paragbhingre · Pull Request #4734 · aws/copilot-cli

paragbhingre · 2023-04-05T18:42:39Z

Previously, ALB and NLB's grace period was set from http's grace_period manifest field. This PR separate out the field such that ALB and NLB have their own fields.
For ALB, grace_period field was under http.healthcheck but grace_period is a service level field, so it should be under http and not http.healthcheck. So we have fixed that as well.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the Apache 2.0 License.

github-actions · 2023-04-05T18:51:19Z

🍕 Here are the new binary sizes!

Name	New size (kiB)	size (kiB)	Delta (%)
macOS (amd)	50552	50328	+0.45
macOS (arm)	50760	50516	+0.48
linux (amd)	44500	44304	+0.44
linux (arm)	42820	42564	+0.60
windows (amd)	41376	41204	+0.42

codecov-commenter · 2023-04-05T18:57:34Z

Codecov Report

Merging #4734 (d490361) into mainline (0ecf04e) will increase coverage by 0.02%.
The diff coverage is 82.05%.

@@             Coverage Diff              @@
##           mainline    #4734      +/-   ##
============================================
+ Coverage     69.93%   69.95%   +0.02%     
============================================
  Files           284      284              
  Lines         40782    40856      +74     
  Branches        272      272              
============================================
+ Hits          28519    28581      +62     
- Misses        10885    10894       +9     
- Partials       1378     1381       +3

Impacted Files	Coverage Δ
internal/pkg/manifest/svc.go	`67.98% <ø> (ø)`
internal/pkg/template/workload.go	`52.42% <ø> (ø)`
internal/pkg/manifest/errors.go	`51.57% <33.33%> (-2.64%)`	⬇️
...al/pkg/deploy/cloudformation/stack/transformers.go	`87.73% <64.70%> (-0.41%)`	⬇️
...nal/pkg/deploy/cloudformation/stack/backend_svc.go	`83.98% <100.00%> (ø)`
...rnal/pkg/deploy/cloudformation/stack/lb_web_svc.go	`82.49% <100.00%> (ø)`
internal/pkg/manifest/validate.go	`79.91% <100.00%> (+0.56%)`	⬆️

... and 1 file with indirect coverage changes

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

Lou1415926 · 2023-04-06T19:36:36Z

-	AdditionalRoutingRules   []RoutingRule `yaml:"additional_rules"`
+	Main                     RoutingRule    `yaml:",inline"`
+	TargetContainerCamelCase *string        `yaml:"targetContainer"` // Deprecated. Maintained for backwards compatibility, use [RoutingRule.TargetContainer] instead.
+	GracePeriod              *time.Duration `yaml:"grace_period"`


For ALB, grace_period field was under http.healthcheck but grace_period is a service level field, so it should be under http and not http.healthcheck.

Just in case this wasn't discussed before this PR. I recall there was a discussion around this, when grace_period was first introduced in #2576: grace_period is a health-check related configuration, therefore we put in under http.health_check. Although in ECS it is configured as a service attribute, but we tried to model our manifest "in the way that developers think", instead of "in the way how ECS models it".

I agree with your point, but it is becoming super weird with our current structure if we keep it inside healthcheck.
If we keep it inside healthcheck then:

Either it has to be under both the main as well as additional routing rules. But it is not a routing rule parameter, so having this parameter in the main or additional rules sounds wrong to me.

Or it has to be under the only Main routing rule, which is also something that is less descriptive of the purpose of that parameter.

Do let me know your thoughts on this.

If multiple containers are handling requests routed from an ELB, I'd assume customers would want to set the grace period to accommodate for the container that is the slowest to start 💭

An alternative that we can consider is to get the max of the grace periods for individual rules.

http: path: 'api' healthcheck: grace_period: 1m additional_rules: - path: 'user' healthcheck: path: 'user' - path: 'admin' healthcheck: path: 'admin' grace_period: 10m # The grace period is 10m because 10m > 1m.

I see a similar logic in this PR where you get the max of nlb and http's grace_periods.

Yeah that could be another solution, but then we choose max of the grace periods for individual rules. And at the same time if nlb's grace_period is also specified then shall we take max of nlb and http ?

Edit - that makes me wonder if nlb listeners should also have this grace_period parameter?

I'd imagine we'd want to take the max, if we were to be consistent in the design logic of the "alternative approach".

The pros of the original approach that you purposed by this PR is that it fits nicer with ECS's model. The cons is that it might be too specifically designed for ECS, and misses the abstraction that we've tried to achieve.

For the alternative approach, the pros & cons are the opposite.

I'd also love to hear from the others on this 👍🏼

As per the discussion with @iamhopaul123 and @efekarakus, here is the summary of what we are going to do:

We agree that healthcheckgraceperiodseconds should have been load balancer level setting in ECS. But until they change it, we will keep grace_period as is for Main as well as Additional Listener Rules in ALB. That means we will have grace_period on http.Main.HealthCheck level. So that in the future, whenever ECS changes it, we will open up grace_period for additional listener rules as well. For now, we will hide this setting from Additional Rules in our Copilot documents.

In NLB, grace_period will be introduced on nlb.Listener.HealthCheck level. grace_period from additional listeners will be hidden and not shown in our docs.

When ALB and NLB both have grace_period set, we will throw a specific error saying they can set either property but not both. Same goes with listener rules and listeners as well. If customers accidentally set grace_period on Main as well as Additional Listener Rules, then we error out.

Do let me know if there are any concerns around this.

iamhopaul123 · 2023-04-14T19:19:05Z

+	if strings.Contains(e.firstField, "additional_rules") {
+		return fmt.Sprintf(`cannot define "grace_period" in "http.additional_rules[%d].healthcheck.grace_period"`, e.index)
+	} else if strings.Contains(e.firstField, "additional_listeners") {


This is very confusing because it has nothing to do with the name of errGracePeriodSpecifiedMoreThanOnce. I feel like we need a different error for this case. can we use errFieldMutualExclusive if they specify grace_period in both places and a new error type if they specify grace_period in additional rules?

Looking forward the key is really about "decoupling" (so that we can make changes in one place; easier to maintain; and less error-prone): we should avoid mixing things together just because they belong to the same "topic" but focusing on what they do (functionalities). For example, in this errors file errContainersExposingSamePort is a probably ok because it is specific but we have to do it this way (the error message is specifically about port). Whereas errSpecifiedBothIngressFields is a bad one because the error msg has nothing to do with ingress fields and we should've reused errFieldMutualExclusive (even if we need new recommendation we could make it based on errFieldMutualExclusive)

yeah that sounds good to me. I have changed error message when customer define grace_period in both ALB and NLB to errFieldMutualExclusive. And renamed existing errGracePeriodSpecifiedMoreThanOnce to errGracePeriodSpecifiedInAdditionalField when customer define grace_period in additional rules or listeners. Do let me know how does that look to you?

efekarakus

thanks Parag! Looks good, not much else to add besides the existing comments.

iamhopaul123

#4734 This PR adds `grace_period` field to the NLB main listener and also removes this field from the additional listener rules for the ALB. By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the Apache 2.0 License.

fix: separate out the grace_period for ALB and NLB

496c521

paragbhingre requested a review from a team as a code owner April 5, 2023 18:42

paragbhingre requested review from efekarakus and removed request for a team April 5, 2023 18:42

amazon-ecs-cli-v2-pr-manager requested a review from bvtujo April 5, 2023 18:42

KollaAdithya reviewed Apr 5, 2023

View reviewed changes

Comment thread internal/pkg/deploy/cloudformation/stack/transformers.go Outdated

Lou1415926 reviewed Apr 6, 2023

View reviewed changes

address feedback

c7cfeec

Lou1415926 reviewed Apr 11, 2023

View reviewed changes

Comment thread internal/pkg/manifest/errors.go Outdated

Comment thread internal/pkg/manifest/validate.go Outdated

Comment thread internal/pkg/deploy/cloudformation/stack/transformers.go Outdated

paragbhingre added 2 commits April 13, 2023 16:35

address feedback

ab2c4b5

resolve static check errors

92d06ed

iamhopaul123 reviewed Apr 14, 2023

View reviewed changes

efekarakus reviewed Apr 17, 2023

View reviewed changes

Comment thread internal/pkg/deploy/cloudformation/stack/transformers.go Outdated

Comment thread internal/pkg/manifest/validate.go

paragbhingre added 2 commits April 17, 2023 11:09

address PH's fb

54ca821

address Efe's fb

7546af2

Lou1415926 reviewed Apr 17, 2023

View reviewed changes

Comment thread internal/pkg/manifest/errors.go Outdated

Comment thread internal/pkg/manifest/errors.go Outdated

Comment thread internal/pkg/manifest/errors.go Outdated

Comment thread internal/pkg/manifest/errors.go Outdated

Lou1415926 reviewed Apr 17, 2023

View reviewed changes

Comment thread internal/pkg/deploy/cloudformation/stack/transformers.go Outdated

paragbhingre added 2 commits April 18, 2023 23:49

address feedback

908f970

Merge branch 'mainline' into multipleport_gracePeriod

0b23a4b

Lou1415926 approved these changes Apr 20, 2023

View reviewed changes

Lou1415926 added the do-not-merge Pull requests that mergify shouldn't merge until the requester allows it. label Apr 20, 2023

paragbhingre added 2 commits April 21, 2023 10:05

address fb

d490361

change wantedErrorMsgPrefix to wantedError for multiple port tests

ae1747c

paragbhingre removed the do-not-merge Pull requests that mergify shouldn't merge until the requester allows it. label Apr 24, 2023

iamhopaul123 approved these changes Apr 25, 2023

View reviewed changes

Merge branch 'mainline' into multipleport_gracePeriod

829ad97

mergify Bot merged commit ea86849 into aws:mainline Apr 25, 2023

paragbhingre mentioned this pull request Apr 25, 2023

docs: add grace_period field to the NLB main listener #4802

Merged

Conversation

paragbhingre commented Apr 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Apr 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov-commenter commented Apr 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Lou1415926 Apr 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

paragbhingre Apr 6, 2023

Choose a reason for hiding this comment

Uh oh!

Lou1415926 Apr 6, 2023

Choose a reason for hiding this comment

Uh oh!

paragbhingre Apr 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Lou1415926 Apr 6, 2023

Choose a reason for hiding this comment

Uh oh!

paragbhingre Apr 6, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

iamhopaul123 Apr 14, 2023

Choose a reason for hiding this comment

Uh oh!

paragbhingre Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

efekarakus left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

iamhopaul123 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

paragbhingre commented Apr 5, 2023 •

edited

Loading

github-actions Bot commented Apr 5, 2023 •

edited

Loading

codecov-commenter commented Apr 5, 2023 •

edited

Loading

Lou1415926 Apr 6, 2023 •

edited

Loading

paragbhingre Apr 6, 2023 •

edited

Loading

paragbhingre Apr 6, 2023 •

edited

Loading