-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix critical services issue on pg follower restart #8548
fix critical services issue on pg follower restart #8548
Conversation
👷 Deploy Preview for chef-automate processing.
|
49e8b24
to
5df37cf
Compare
exit 2 | ||
fi | ||
|
||
for i in $(seq 1 5); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Aren't we suppose to add sleep time in between the calls?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
default-server inter 3s fall 3 rise 2 on-marked-down shutdown-sessions | ||
{{~#each cfg.service.parsed_nodes as |node|}} | ||
{{~#if node.is_domain}} | ||
{{~#if ../cfg.resolvers.nameservers }} | ||
server-template {{node.address}} 8 {{node.address}}:{{node.port}} check resolvers pgdns init-addr none resolve-prefer ipv4 | ||
{{else}} | ||
server {{node.address}} {{node.address}}:{{node.port}} check | ||
server {{node.address}} {{node.address}}:{{node.port}} maxconn 350 check port 6432 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
if we consider this value -> maxconn 350
earlier this is the part of backend ha proxy and its value is 350. And there are 3 HAproxy at backend.
Now the pg-gateway on each FE will have this value, does we need to increase the number of PG connection on the backend package ?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
increased default max_connection to 1500
…a to pg leader from pg-gateway Signed-off-by: Vivek Shankar <vshankar@progress.com>
Signed-off-by: Vivek Shankar <vshankar@progress.com>
Signed-off-by: Vivek Shankar <vshankar@progress.com>
Signed-off-by: Vivek Shankar <vshankar@progress.com>
Signed-off-by: punitmundra <pmundra@progress.com>
Signed-off-by: Vivek Shankar <vshankar@progress.com>
Signed-off-by: Vivek Shankar <vshankar@progress.com>
Signed-off-by: Vivek Shankar <vshankar@progress.com>
Signed-off-by: Vivek Shankar <vshankar@progress.com>
Signed-off-by: Vivek Shankar <vshankar@progress.com>
6829532
to
7e4ea92
Compare
Signed-off-by: Vivek Shankar <vshankar@progress.com>
Quality Gate failedFailed conditions |
@@ -89,6 +89,9 @@ shared_buffers = "1GB" | |||
username = "admin" | |||
name = "Local Administrator" | |||
password = "chefautomate" | |||
[deployment.v1.svc.health] | |||
health_check_interval=31 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why is this 31 ?
* fix critical services issue on pg follower restart. Directly send data to pg leader from pg-gateway Signed-off-by: Vivek Shankar <vshankar@progress.com> * added health check retry Signed-off-by: Vivek Shankar <vshankar@progress.com> * updated working patch health check config for provided services Signed-off-by: Vivek Shankar <vshankar@progress.com> * renamed folder Signed-off-by: Vivek Shankar <vshankar@progress.com> * review comments Signed-off-by: punitmundra <pmundra@progress.com> * fail timout added in es, changes port from 7432 to 5432 Signed-off-by: Vivek Shankar <vshankar@progress.com> * added log in cs health hook Signed-off-by: Vivek Shankar <vshankar@progress.com> * path after upgrade Signed-off-by: Vivek Shankar <vshankar@progress.com> * added max conn configurable Signed-off-by: Vivek Shankar <vshankar@progress.com> * fix policy Signed-off-by: Vivek Shankar <vshankar@progress.com> * fix policy Signed-off-by: Vivek Shankar <vshankar@progress.com> * revert pg Signed-off-by: Vivek Shankar <vshankar@progress.com> --------- Signed-off-by: Vivek Shankar <vshankar@progress.com> Signed-off-by: punitmundra <pmundra@progress.com> Co-authored-by: punitmundra <pmundra@progress.com>
directly send data pg leader from pg-gateway
🔩 Description: What code changed, and why?
https://chefio.atlassian.net/browse/CHEF-15716
https://chefio.atlassian.net/browse/CHEF-15717
https://chefio.atlassian.net/browse/CHEF-15718
https://chefio.atlassian.net/browse/CHEF-15719
CHEF-15716_MAKE_CONFIGURABLE_FAIL_TIMEOUT_ES_GATEWAY.mp4
CHEF-15717_HEALTH_CHECK_INTERVAL_CONFIGURABLE.mp4
CHEF-15718_OPENSEARCH_REBOOT_NO_FE_SERVICE_CRITICAL.mov
CHEF-15719_PG_FOLOWER_REBOOT_FE_SERVICE_NO_CRITICAL.mov
Upgrade
⛓️ Related Resources
👍 Definition of Done
👟 How to Build and Test the Change
✅ Checklist
All PRs must tick these:
With occasional exceptions, all PRs from Progress employees must tick these:
make spell
in any component directory)All PRs from Progress employees should tick these if appropriate:
Please add a note next to any checkbox above if you are NOT ticking it.
📷 Screenshots, if applicable