Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pacific: python-common/drive_group: handle fields outside of 'spec' even when 'spec' is provided #52413

Merged
merged 1 commit into from Oct 17, 2023

Conversation

adk3798
Copy link
Contributor

@adk3798 adk3798 commented Jul 12, 2023

backport tracker: https://tracker.ceph.com/issues/61686


backport of #51861
parent tracker: https://tracker.ceph.com/issues/61533

this backport was staged using ceph-backport.sh version 16.0.0.6848
find the latest version at https://github.com/ceph/ceph/blob/main/src/script/ceph-backport.sh

…'spec' is provided

Otherwise certain specs such as

service_type: osd
service_id: xxx
service_name: osd.xxx
placement:
  hosts:
  - vm-00
spec:
  osds_per_device: 2
data_devices:
  paths:
  - /dev/vde

fail to apply with

Error EINVAL: ServiceSpec: 'dict' object has no attribute 'validate'

which is not a useful error message. This is caused by the
spec assuming all osd specific fields are either defined
in the 'spec' section or outside of it, but not mixed in.
We could also just consider these specs to be invalid
and just raise a better error message, but it seems easier
to make the minor adjustment for it to work, given there doesn't
seem to be an issue with mixing the styles for specs for
other service types.

Fixes: https://tracker.ceph.com/issues/61533

Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 12901f6)
@adk3798
Copy link
Contributor Author

adk3798 commented Oct 16, 2023

https://pulpito.ceph.com/adking-2023-10-12_12:06:41-orch:cephadm-wip-adk2-testing-2023-10-11-1434-pacific-distro-default-smithi/

reruns of failed jobs: https://pulpito.ceph.com/adking-2023-10-12_22:09:36-orch:cephadm-wip-adk2-testing-2023-10-11-1434-pacific-distro-default-smithi/

reruns of reruns of failed jobs: https://pulpito.ceph.com/adking-2023-10-13_12:25:25-orch:cephadm-wip-adk2-testing-2023-10-11-1434-pacific-distro-default-smithi/

Needed double reruns because we were hitting a combination of

toomanyrequests: You have reached your pull rate limit. You may increase the limit by authenticating and upgrading: https://www.docker.com/increase-rate-limit"

and

Failed to reconnect to smithiXXX

failures.

After the second rerun, 3 failures:

  • 2 tests failed deploying nfs clusters because they were running ceph --cluster ceph n f s ' ' c l u s t e r ' ' c r e a t e ' ' t e s t (the spaces are not a formatting error in the comment, that's what was run in the test) which failed. Must have gotten broken in the pacific suite at some point.
  • The dashboard test_e2e test is known to fail on pacific.

Nothing new or that should block merging for the backports in the run.

@adk3798 adk3798 merged commit 8d6664a into ceph:pacific Oct 17, 2023
8 of 9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
2 participants