Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pacific: cephadm: allow ports to be opened in firewall during adoption, reconfig, redeploy #52083

Merged
merged 3 commits into from Oct 17, 2023

Conversation

adk3798
Copy link
Contributor

@adk3798 adk3798 commented Jun 15, 2023

backport tracker: https://tracker.ceph.com/issues/61677


backport of #51070
parent tracker: https://tracker.ceph.com/issues/59443

this backport was staged using ceph-backport.sh version 16.0.0.6848
find the latest version at https://github.com/ceph/ceph/blob/main/src/script/ceph-backport.sh

Prior to this patch we were discarding the provided
ports on reconfig and redeploy in order to not fail
thinking there was a port conflict with the instance
of the daemon we were about to reconfig/redeploy. However,
it's still desirable for us to make sure the firewall ports
are open when we do a reconfig/redpeloy, so this refactors
the port handling approach to have it do that but
still avoid checking for port conflicts. It also include
an update of the type signature of deploy_daemon
to the py3 style. That wasn't needed for the change
but since I was added an arugment there I thought we might
as well do it now.

Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit fdecd66)

Conflicts:
	src/cephadm/cephadm
	src/cephadm/tests/test_cephadm.py
Otherwise we risk the prometheus/alertmanager/grafana
not functioning properly after adoption due to the necessary
port in the firewall not being open.

Fixes: https://tracker.ceph.com/issues/59443

Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 40a9817)
Since the options are mutually exclusive, using
an enum is preferable to having multiple bools
to track each of them

Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 7081759)

Conflicts:
	src/cephadm/cephadm
@adk3798 adk3798 requested a review from a team as a code owner June 15, 2023 12:28
@adk3798 adk3798 added this to the pacific milestone Jun 15, 2023
@adk3798
Copy link
Contributor Author

adk3798 commented Oct 16, 2023

https://pulpito.ceph.com/adking-2023-10-12_12:06:41-orch:cephadm-wip-adk2-testing-2023-10-11-1434-pacific-distro-default-smithi/

reruns of failed jobs: https://pulpito.ceph.com/adking-2023-10-12_22:09:36-orch:cephadm-wip-adk2-testing-2023-10-11-1434-pacific-distro-default-smithi/

reruns of reruns of failed jobs: https://pulpito.ceph.com/adking-2023-10-13_12:25:25-orch:cephadm-wip-adk2-testing-2023-10-11-1434-pacific-distro-default-smithi/

Needed double reruns because we were hitting a combination of

toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit"

and

Failed to reconnect to smithiXXX

failures.

After the second rerun, 3 failures:

  • 2 tests failed deploying nfs clusters because they were running ceph --cluster ceph n f s ' ' c l u s t e r ' ' c r e a t e ' ' t e s t (the spaces are not a formatting error in the comment, that's what was run in the test) which failed. Must have gotten broken in the pacific suite at some point.
  • The dashboard test_e2e test is known to fail on pacific.

Nothing new or that should block merging for the backports in the run.

@adk3798 adk3798 merged commit 5703f3d into ceph:pacific Oct 17, 2023
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
4 participants