Prevent deadlock when notifying shard of ledger #3683

miguelportilla · 2020-12-02T22:52:13Z

High Level Overview of Change

This change is primarily about preventing a potential thread deadlock when notifying shards on ledger completions.

Context of Change

Shards are notified by the inbound ledger process when pertaining ledgers have been acquired. Upon notification, the SQLite database is populated with ledger data. If the ledger data resides in the same shard, it is possible to encounter a deadlock when shard fetch calls are made while storing to the SQLite database.

Type of Change

Bug fix (non-breaking change which fixes an issue)
Refactor (non-breaking change that only restructures code)

seelabs

Looks good. I had one question about who closes the backend_ in one case, and left two nits. Other than that, nice solution to the deadlock issue!

src/ripple/nodestore/impl/Shard.h

src/ripple/nodestore/impl/Shard.cpp

seelabs · 2020-12-03T18:59:26Z

src/ripple/nodestore/impl/Shard.cpp

-    {
-        JLOG(j_.error()) << "shard " << index_
-                         << " missing acquire SQLite database";
+    auto const scopedCount{makeBackendCount()};


On line 153 of this file (not in this changeset), we check the backendCount_ and exit early if in use. We also lock mutex_ but not storedMutex_ there. The old code would have closed the backend_ but the new code may not.

Who closes backend_ in such a case? Is it DatabaseShardImp::sweep()? If nobody does, an easy fix is to lock the storedMutex there as well. If it's not an issue, the of couse fine as-is.

Yes, DatabaseShardImp::sweep() will close it or the shard Dtor. I don't see an issue with tryClose and storedMutex_ as is, but maybe I missed something you saw.

ghost

I compiled these changes together with Deterministic shards PR and run it on mainnet. It successfully collecting shards.

seelabs

👍

miguelportilla requested a review from seelabs December 2, 2020 22:54

miguelportilla assigned seelabs Dec 2, 2020

miguelportilla requested a review from a user December 2, 2020 22:54

miguelportilla assigned ghost Dec 2, 2020

miguelportilla requested a review from undertome December 2, 2020 22:55

miguelportilla assigned undertome Dec 2, 2020

miguelportilla force-pushed the fix_deadlock branch from 2aeed84 to 52c7e94 Compare December 2, 2020 23:00

ghost mentioned this pull request Dec 2, 2020

Deterministic shards, ver 2.0 #3595

Closed

seelabs reviewed Dec 3, 2020

View reviewed changes

ghost approved these changes Dec 4, 2020

View reviewed changes

undertome approved these changes Dec 8, 2020

View reviewed changes

miguelportilla force-pushed the fix_deadlock branch from 79839d4 to 561e437 Compare December 16, 2020 18:21

seelabs reviewed Dec 16, 2020

View reviewed changes

miguelportilla added the Passed Passed code review & PR owner thinks it's ready to merge. Perf sign-off may still be required. label Dec 16, 2020

miguelportilla requested a review from seelabs December 16, 2020 18:57

Prevent deadlock in storeSQLite

c0f64d9

miguelportilla force-pushed the fix_deadlock branch from 561e437 to c0f64d9 Compare December 27, 2020 14:16

nbougalis mentioned this pull request Jan 5, 2021

Proposed 1.7.0-b10 #3720

Merged

nbougalis closed this in #3720 Jan 11, 2021

miguelportilla deleted the fix_deadlock branch January 28, 2021 12:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent deadlock when notifying shard of ledger #3683

Prevent deadlock when notifying shard of ledger #3683

miguelportilla commented Dec 2, 2020

seelabs left a comment

seelabs Dec 3, 2020

miguelportilla Dec 8, 2020

ghost left a comment

seelabs left a comment

Prevent deadlock when notifying shard of ledger #3683

Prevent deadlock when notifying shard of ledger #3683

Conversation

miguelportilla commented Dec 2, 2020

High Level Overview of Change

Context of Change

Type of Change

seelabs left a comment

Choose a reason for hiding this comment

seelabs Dec 3, 2020

Choose a reason for hiding this comment

miguelportilla Dec 8, 2020

Choose a reason for hiding this comment

ghost left a comment

Choose a reason for hiding this comment

seelabs left a comment

Choose a reason for hiding this comment