doc/start: update hardware recs #47109

zdover23 · 2022-07-14T22:57:27Z

This PR picks up the parts of
#44466
that were not merged back in January, when that
pull request was raised.

Matters added here:

improved organzation of CPU section
emphasis of IOPs per core over cores per OSD

Signed-off-by: Zac Dover zac.dover@gmail.com

Contribution Guidelines

To sign and title your commits, please refer to Submitting Patches to Ceph.
If you are submitting a fix for a stable branch (e.g. "pacific"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.

Checklist

Tracker (select at least one)
- References tracker ticket
- Very recent bug; references commit where it was introduced
- New feature (ticket optional)
- Doc update (no ticket needed)
- Code cleanup (no ticket needed)
Component impact
- Affects Dashboard, opened tracker ticket
- Affects Orchestrator, opened tracker ticket
- No impact that needs to be tracked
Documentation (select at least one)
- Updates relevant documentation
- No doc update is appropriate
Tests (select at least one)
- Includes unit test(s)
- Includes integration test(s)
- Includes bug reproducer
- No tests

Show available Jenkins commands

jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard cephadm
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox
jenkins test windows

This PR picks up the parts of ceph#44466 that were not merged back in January, when that pull request was raised. Matters added here: * improved organzation of matter * emphasis of IOPs per core over cores per OSD Signed-off-by: Zac Dover <zac.dover@gmail.com>

anthonyeleven

Nice, a worthy improvement to the slippery topic of hardware recommendations.
I've made a few comments/suggestions.

anthonyeleven · 2022-07-15T02:03:06Z

doc/start/hardware-recommendations.rst

-separate hosts to avoid resource contention.
-
+CephFS metadata servers (MDS) are CPU-intensive. CephFS metadata servers (MDS)
+should therefore have quad-core (or better) CPUs and high clock rates (GHz). OSD


I think we might do well to be more clear that MDS nodes benefit a lot more from clock rate than from CPUs, so a 4-core 3.5 GHz model would be preferable to an 8-core 2.5 GHz SKU. I think the current MDS may be single-threaded, so maybe something like "(MDS) don't need more than 4 cores, but should have as high a clock rate (GHz) as possible". Or "frequency" instead of "clock rate", I think in terms of the latter, but the former might be more common with our audience.

anthonyeleven · 2022-07-15T02:03:42Z

doc/start/hardware-recommendations.rst

+should therefore have quad-core (or better) CPUs and high clock rates (GHz). OSD
+nodes need enough processing power to run the RADOS service, to calculate data
+placement with CRUSH, to replicate data, and to maintain their own copies of the
+cluster map.


Do we want to mention EC parity / hash computation?

anthonyeleven · 2022-07-15T02:06:54Z

doc/start/hardware-recommendations.rst

+the number of cores per OSD, but this cores-per-OSD metric is no longer as
+useful a metric as the number of cycles per IOP and the number of IOPs per OSD.
+For example, for NVMe drives, Ceph can easily utilize five or six cores on real
+clusters and up to about fourteen cores on single OSDs in isolation. So cores


From discussion with the good Mr. Nelson I know what isolation means here, but I might ask if that info is useful to our readers, or if it might confuse them. I also am often uncertain re whether we're talking about physical cores or [hyper] threads; I suspect these numbers are the latter.

anthonyeleven · 2022-07-15T02:09:28Z

doc/start/hardware-recommendations.rst

+modest processors. If your host machines will run CPU-intensive processes in
+addition to Ceph daemons, make sure that you have enough processing power to
+run both the CPU-intensive processes and the Ceph daemons. (OpenStack Nova is
+one such example of a CPU-intensive process.) We recommend that you run


Maybe drop the parens, since that's a standalone sentence? Or am I being sententious? Maybe word this as "OpenStack nova-compute or Proxmox" -- we seem to see a growing population of Ceph users by virtue of converged Proxmox deployments.

I think this should be changed to qemu-kvm as an example, and not openstack nova. Nova itself isn't very CPU intensive, but qemu-kvm would cover almost all use cases where ceph would be co-located with virtual machines, including kubernetes situations were vms and osds are residing on the same host.

See Zac's comment below about a followup issue

zdover23 · 2022-07-16T02:50:49Z

https://tracker.ceph.com/issues/55938 - Anthony's comments are collected in this tracker bug, which is the June 2022 hardware recommendations documentation tracker (and the page on which I track all mid-2022 hardware recommendations documentation updates)

zdover23 · 2022-07-16T02:58:59Z

#47122 - Pacific backport
#47123 - Quincy backport

zdover23 requested a review from a team as a code owner July 14, 2022 22:57

github-actions bot added the documentation label Jul 14, 2022

zdover23 requested review from neha-ojha and jdurgin July 14, 2022 23:01

anthonyeleven approved these changes Jul 15, 2022

View reviewed changes

neha-ojha requested a review from markhpc July 15, 2022 14:32

zdover23 merged commit f43c7a6 into ceph:main Jul 16, 2022

zdover23 mentioned this pull request Jul 16, 2022

doc/start: rewrite hardware docs - CPU section #44466

Closed

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

doc/start: update hardware recs #47109

doc/start: update hardware recs #47109

zdover23 commented Jul 14, 2022 •

edited

anthonyeleven left a comment

anthonyeleven Jul 15, 2022

anthonyeleven Jul 15, 2022

anthonyeleven Jul 15, 2022

anthonyeleven Jul 15, 2022

mheler Jul 16, 2022 •

edited

anthonyeleven Jul 16, 2022

zdover23 commented Jul 16, 2022

zdover23 commented Jul 16, 2022

doc/start: update hardware recs #47109

doc/start: update hardware recs #47109

Conversation

zdover23 commented Jul 14, 2022 • edited

Contribution Guidelines

Checklist

anthonyeleven left a comment

Choose a reason for hiding this comment

anthonyeleven Jul 15, 2022

Choose a reason for hiding this comment

anthonyeleven Jul 15, 2022

Choose a reason for hiding this comment

anthonyeleven Jul 15, 2022

Choose a reason for hiding this comment

anthonyeleven Jul 15, 2022

Choose a reason for hiding this comment

mheler Jul 16, 2022 • edited

Choose a reason for hiding this comment

anthonyeleven Jul 16, 2022

Choose a reason for hiding this comment

zdover23 commented Jul 16, 2022

zdover23 commented Jul 16, 2022

zdover23 commented Jul 14, 2022 •

edited

mheler Jul 16, 2022 •

edited