Skip to content

reef: cephadm: rm podman-auth.json if removing last cluster#56105

Merged
adk3798 merged 1 commit intoceph:reeffrom
adk3798:wip-64634-reef
Mar 13, 2024
Merged

reef: cephadm: rm podman-auth.json if removing last cluster#56105
adk3798 merged 1 commit intoceph:reeffrom
adk3798:wip-64634-reef

Conversation

@adk3798
Copy link
Copy Markdown
Contributor

@adk3798 adk3798 commented Mar 10, 2024

backport tracker: https://tracker.ceph.com/issues/64634


backport of #55588
parent tracker: https://tracker.ceph.com/issues/64433

this backport was staged using ceph-backport.sh version 16.0.0.6848
find the latest version at https://github.com/ceph/ceph/blob/main/src/script/ceph-backport.sh

@adk3798 adk3798 requested a review from a team as a code owner March 10, 2024 20:51
@adk3798 adk3798 added this to the reef milestone Mar 10, 2024
for fname in glob(f'{ctx.log_dir}/cephadm.log*'):
os.remove(fname)

unlink_file(Path('/etc/ceph/podman-auth.json'), missing_ok=True, ignore_errors=True)
Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cephadm.py:8221: error: Name "unlink_file" is not defined

will need to rewrite this bit

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Roughly took what the unlink_file function was doing and wrote it here. Not ideal, but we aren't going to end up backporting that change to reef, so this is probably our best option.

We have points in rm-cluster where we check that
there are no other clusters on the host. If that
is the case, we can also clear /etc/ceph/podman-auth.json
which gets written out when we log in to a registry
while using podman

Fixes: https://tracker.ceph.com/issues/64433

Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit d203a97)
@adk3798
Copy link
Copy Markdown
Contributor Author

adk3798 commented Mar 12, 2024

jenkins test make check

@adk3798
Copy link
Copy Markdown
Contributor Author

adk3798 commented Mar 13, 2024

https://pulpito.ceph.com/adking-2024-03-11_12:07:34-orch:cephadm-wip-adk3-testing-2024-03-11-0143-reef-distro-default-smithi/

reruns: https://pulpito.ceph.com/adking-2024-03-11_17:13:18-orch:cephadm-wip-adk3-testing-2024-03-11-0143-reef-distro-default-smithi/

After reruns:

  • 4 failed instances of mds_upgrade_sequence, konwn issue
  • test_cephadm fails with Error: Container release squid != cephadm release reef, known issue
  • mgr-nfs-upgrade test fails with rcu_sched detected stalls on CPUs/tasks: ' in syslog, known issue

Nothing to block merging

@adk3798 adk3798 merged commit 74ed9f0 into ceph:reef Mar 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants