RFE - Monitor the current DB locks ( nsslapd-db-current-locks ) #4623

droideck · 2021-02-15T20:04:40Z

Is your feature request related to a problem? Please describe.
db lock gets exhausted because of unindexed internal searches (under a transaction). Indexing those searches is the way to prevent exhaustion.

Describe the solution you'd like
To prevent db lock exhaustion and help admin task a possible solutions would be:

If db lock get exhausted during a txn, it leads to db panic and the later recovery can possibly fail. That leads to a full reinit of the instance where the db locks got exhausted. The server should monitor the db lock and trigger server shutdown (similar to disk full) if the db lock is close to be exhausted. Because of the performance impact, the monitoring should be limited to unindexed (allid(candidate)) internal searches (under a txn) and periodically (after each 1000 evaluated candidate). unindexed should be flagged in ldbm_back_search. transaction can be tested with pblock(SLAPI_TXN), monitoring should be done in iterate, internal op is an operation flag OP_FLAG_INTERNAL.
To help indexing the appropriate attributes, unindexed internal search (under txn) should log a warning with the search filter.
a config parameter should toggle monitoring/shutdown. By default it should be enabled.
Monitoring returns value that may be not exact. The threshold to trigger the shutdown should take into account that the value is not perfect.

droideck · 2021-04-09T11:20:40Z

Bugzillas:
https://bugzilla.redhat.com/show_bug.cgi?id=1831812
https://bugzilla.redhat.com/show_bug.cgi?id=1812286

@Firstyear

* Issue 4623 - RFE - Monitor the current DB locks Description: DB lock gets exhausted because of unindexed internal searches (under a transaction). Indexing those searches is the way to prevent exhaustion. If db lock get exhausted during a txn, it leads to db panic and the later recovery can possibly fail. That leads to a full reinit of the instance where the db locks got exhausted. Add three attributes to global BDB config: "nsslapd-db-locks-monitoring-enabled", "nsslapd-db-locks-monitoring-threshold" and "nsslapd-db-locks-monitoring-pause". By default, nsslapd-db-locks-monitoring-enabled is turned on, nsslapd-db-locks-monitoring-threshold is set to 90% and nsslapd-db-locks-monitoring-threshold is 500ms. When current locks are close to the maximum locks value of 90% - returning the next candidate will fail until the maximum of locks won't be increased or current locks are released. The monitoring thread runs with the configurable interval of 500ms. Add the setting to UI and CLI tools. Fixes: #4623 Reviewed by: @Firstyear, @tbordaz, @jchapma, @mreynolds389 (Thank you!!)

@Firstyear

Description: DB lock gets exhausted because of unindexed internal searches (under a transaction). Indexing those searches is the way to prevent exhaustion. If db lock get exhausted during a txn, it leads to db panic and the later recovery can possibly fail. That leads to a full reinit of the instance where the db locks got exhausted. Add three attributes to global BDB config: "nsslapd-db-locks-monitoring-enabled", "nsslapd-db-locks-monitoring-threshold" and "nsslapd-db-locks-monitoring-pause". By default, nsslapd-db-locks-monitoring-enabled is turned on, nsslapd-db-locks-monitoring-threshold is set to 90% and nsslapd-db-locks-monitoring-threshold is 500ms. When current locks are close to the maximum locks value of 90% - returning the next candidate will fail until the maximum of locks won't be increased or current locks are released. The monitoring thread runs with the configurable interval of 500ms. Add the setting to UI and CLI tools. Fixes: #4623 Reviewed by: @Firstyear, @tbordaz, @jchapma, @mreynolds389 (Thank you!!)

@Firstyear

Description: DB lock gets exhausted because of unindexed internal searches (under a transaction). Indexing those searches is the way to prevent exhaustion. If db lock get exhausted during a txn, it leads to db panic and the later recovery can possibly fail. That leads to a full reinit of the instance where the db locks got exhausted. Add three attributes to global BDB config: "nsslapd-db-locks-monitoring-enabled", "nsslapd-db-locks-monitoring-threshold" and "nsslapd-db-locks-monitoring-pause". By default, nsslapd-db-locks-monitoring-enabled is turned on, nsslapd-db-locks-monitoring-threshold is set to 90% and nsslapd-db-locks-monitoring-threshold is 500ms. When current locks are close to the maximum locks value of 90% - returning the next candidate will fail until the maximum of locks won't be increased or current locks are released. The monitoring thread runs with the configurable interval of 500ms. Add the setting to UI and CLI tools. Fixes: #4623 Reviewed by: @Firstyear, @tbordaz, @jchapma, @mreynolds389 (Thank you!!)

droideck · 2021-05-26T11:40:46Z

e05afab..a69c215 389-ds-base-1.4.3 -> 389-ds-base-1.4.3
50606d8..bba519c 389-ds-base-1.4.4 -> 389-ds-base-1.4.4

droideck · 2021-06-23T08:00:45Z

Related issue: #4803

…locks ) Description: Added additional tests for DB locks monitoring to check if invalid values are correctly rejected for nsslapd-db-locks and nsslapd-db-locks-monitoring-threshold. Relates: 389ds#4623 Reviewed by: droideck (Thanks!)

…locks ) Description: Added additional tests for DB locks monitoring to check if invalid values are correctly rejected for nsslapd-db-locks and nsslapd-db-locks-monitoring-threshold. Relates: #4623 Reviewed by: droideck (Thanks!)

droideck added the needs triage The issue will be triaged during scrum label Feb 15, 2021

mreynolds389 removed the needs triage The issue will be triaged during scrum label Feb 18, 2021

mreynolds389 added this to the 1.4.3 milestone Feb 18, 2021

tbordaz added priority_high need urgent fix / highly valuable / easy to fix In JIRA ticket is in JIRA labels Mar 25, 2021

droideck self-assigned this Apr 9, 2021

droideck mentioned this issue May 7, 2021

Issue 4623 - RFE - Monitor the current DB locks #4762

Merged

droideck closed this as completed in #4762 May 20, 2021

bsimonova mentioned this issue Aug 2, 2021

Issue 4623 - RFE - Monitor the current DB locks ( nsslapd-db-current-… #4852

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFE - Monitor the current DB locks ( nsslapd-db-current-locks ) #4623

RFE - Monitor the current DB locks ( nsslapd-db-current-locks ) #4623

droideck commented Feb 15, 2021

droideck commented Apr 9, 2021

droideck commented May 26, 2021

droideck commented Jun 23, 2021

RFE - Monitor the current DB locks ( nsslapd-db-current-locks ) #4623

RFE - Monitor the current DB locks ( nsslapd-db-current-locks ) #4623

Comments

droideck commented Feb 15, 2021

droideck commented Apr 9, 2021

droideck commented May 26, 2021

droideck commented Jun 23, 2021