New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cephadm: Remove containers pids-limit #44579
Conversation
dc1dab7
to
018a39f
Compare
7054091
to
6b60f8b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Superficially, LGTM. I'd like to see this tested, if not by creating 255 iSCSI LUNs, then at least by spawning a few thousand e.g. sleep processes in both tcmu-runner and rbd-target-api containers.
6b60f8b
to
61d0f4b
Compare
|
@idryomov Did a sleep loop on both containers related to iscsi: |
|
Why not 10000 processes (or at least something bigger than 4096)? Are you sure that the default value for podman is actually 2048 and not 4096? This is a quote from |
|
Just did it for 20K processes: Regarding that limit of 4096 or 2048, I just checked podman documentation and it says now that the limit is 4096: --pids-limit=limit When I did the same change in ceph-ansible it was 2048 in the meantime they did the change in podman to match docker value: commit 118cf1fc634ffc63b908d6b082ffc3a53553a6af |
|
Great, thanks! Looks like you need to adjust |
|
as you already figured out the shell commands to run: would be great to have this in teuthology as well. Do you want to add a new workunit task that does it? https://github.com/ceph/ceph/tree/master/qa/suites/orch/cephadm/workunits Maybe something like this could work: roles:
- - host.a
- osd.0
- osd.1
- osd.2
- mon.a
- mgr.a
- client.0
tasks:
- install:
- cephadm:
- cephadm.shell:
host.a:
- ceph orch apply iscsi iscsi
- workunit:
clients:
client.0:
- cephadm/test_iscsi_pid_limit.sh?? |
61d0f4b
to
e1e23a7
Compare
450aa72
to
d81b5a9
Compare
57e6d85
to
8358592
Compare
|
jenkins test make check |
ffab464
to
e719c4f
Compare
The default pids-limit (docker 4096/podman 2048) prevent some customization from working (http threads on RGW) or limits the number of luns per iscsi target. Fixes: https://tracker.ceph.com/issues/52898 Signed-off-by: Teoman ONAY <tonay@redhat.com>
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
e719c4f
to
f0ade57
Compare
|
I'm not sure why, but with docker on Ubuntu Focal this issue doesn't seem to reproduce in teuthology. I went with restricting this test to podman on CentOS Stream 8 (where the problem was reported and reproduces) and tweaked the workunit script a bit as the switch between docker and podman is no longer needed. I also added an assert on |
|
https://pulpito.ceph.com/dis-2022-03-22_20:43:37-orch:cephadm-wip-dis-testing-distro-default-smithi/ Without the fix: With the fix: |
cephadm: Remove containers pids-limit
The default pids-limit (docker 4096/podman 2048) prevent some
customization from working (http threads on RGW) or limits the number
of luns per iscsi target.
Replaces existing PR #42214
Fixes: https://tracker.ceph.com/issues/52898
Signed-off-by: Teoman ONAY tonay@redhat.com