-
Notifications
You must be signed in to change notification settings - Fork 552
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Introduced partition shutdown watchdog timer #16067
Introduced partition shutdown watchdog timer #16067
Conversation
Signed-off-by: Michal Maslanka <michal@redpanda.com>
ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/43670#018cf8de-ec3c-4a65-8df4-ae3fc71ed87a ducktape was retried in https://buildkite.com/redpanda/redpanda/builds/43714#018cfd8f-71dd-44b9-9425-bec94b464426 |
@@ -101,6 +107,11 @@ partition_manager::get_topic_partition_table( | |||
return rs; | |||
} | |||
|
|||
ss::future<> partition_manager::start() { | |||
maybe_arm_shutdown_watchdog(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: I think this is a good usecase for arm_periodic()
Introduced property that is going to be used by `cluster::partition_manager` shutdown watchdog mechanism. Signed-off-by: Michal Maslanka <michal@redpanda.com>
1b7884a
to
f1cac93
Compare
Introduced a watchdog that is tracking partition shutdown state. The watchdog is intended to provide a mechanism that will ease debugging of partition shutdown issues. The watchdog tracks state of partitions that were requested to be stopped or removed. When there was no state update for the time longer than the configurable threshold (by default 30 seconds) the watchdog will emit error log entry which will inform user about the problem. Signed-off-by: Michal Maslanka <michal@redpanda.com>
f1cac93
to
3adfdfc
Compare
/backport v23.3.x |
/backport v23.2.x |
Failed to create a backport PR to v23.3.x branch. I tried:
|
Failed to create a backport PR to v23.2.x branch. I tried:
|
Introduced a watchdog that is tracking partition shutdown state. The
watchdog is intended to provide a mechanism that will ease debugging of
partition shutdown issues. The watchdog tracks state of partitions that
were requested to be stopped or removed. When there was no state update
for the time longer than the configurable threshold (by default 30
seconds) the watchdog will emit error log entry which will inform user
about the problem.
Backports Required
Release Notes