Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

mimic: rados: prevent ShardedOpWQ suicide_grace drop when waiting for work. #34883

Closed
wants to merge 1 commit into from

Conversation

hillpd
Copy link
Contributor

@hillpd hillpd commented May 2, 2020

backport tracker: https://tracker.ceph.com/issues/45358


backport of #34575
parent tracker: https://tracker.ceph.com/issues/45076

this backport was staged using ceph-backport.sh version 15.1.1.389
find the latest version at https://github.com/ceph/ceph/blob/master/src/script/ceph-backport.sh

The Sharded OpWQ will opportunistically wait for more work when
processing an empty queue. While waiting, the default work queue
heartbeat timeout and suicide_grace values are modified. The
`threadpool_default_timeout` grace is applied and suicide_grace is
disabled. If this op hangs, the heartbeat watchdog will not trigger an
OSD suicide recovery.

The default work queue values for grace and suicide_grace are re-applied
after finding work. This keeps the heartbeat timeouts consistent with
the values applied on _process() entry.

Fixes: https://tracker.ceph.com/issues/45076
Signed-off-by: Dan Hill <daniel.hill@canonical.com>
(cherry picked from commit 85f6e8d)
@hillpd
Copy link
Contributor Author

hillpd commented May 2, 2020

@ceph/backport-admins

@smithfarm smithfarm added this to the mimic milestone May 4, 2020
@smithfarm smithfarm added the core label May 4, 2020
@smithfarm
Copy link
Contributor

mimic EOL

@smithfarm smithfarm closed this Jan 27, 2021
@hillpd hillpd deleted the wip-45358-mimic branch January 28, 2021 21:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
4 participants