Reserve CPU for system operations #14068
Labels
complexity: no estimate
feature: administration
feature
need refined description
A maintainer should refine the description and clarify the scope
Problem Statement
With the right set of heavy queries, it is possible to exhaust the entire available CPUs on a CrateDB Cloud cluster. The cluster at that point becomes unresponsive and difficult to debug (i.e. not possible to query the
sys.jobs
andsys.jobs_log
and other system tables).It would be super useful if CrateDB had some form of QoS for threadpools, and always reserved a fraction of a CPU for system management operations.
An inspiration for this is the disk high watermark on Linux, where the last 5% (configurable) is reserved for root.
Possible Solutions
If QoS is not possible, can we somehow tweak the thread pool sizes when starting a cluster? This is if there is a separate thread pool for management/system operations.
Considered Alternatives
No response
The text was updated successfully, but these errors were encountered: