UTxO-HD Ledger DB lock tweaks #43

jasagredo · 2023-02-23T16:20:59Z

After today's meeting, several next steps came up to reduce the locking of the Ledger DB in the UTxO-HD implementation.

The reason for the lock

The LedgerDB lock is introduced to make sure that whenever we acquire a DbChangelog and read the BackingStore, these two values are in agreement, in particular that the anchor of the DbChangelog is indeed the slot that was flushed to the BackingStore.

It is the case that LMDB allows for consistent views of the database as long as a transaction is kept open. We therefore can make use of this feature to "acquire" a transaction and release the lock of the DB.

Places that need to change

Flushing: It is right now the case that flushing differences in caught-up mode only happens when taking snapshots. Therefore we are waiting ~72 minutes between flushes on mainnet which is too much. Flushing should happen regularly, perhaps every 100 blocks or so (make it configurable). Either do this on a background thread (maybe even the copyAndSnapshotRunner can sometimes flush?) or synchronous with the logic that advances the chain.
When flushing it will be the only moment a write lock will be acquired, not when taking a snapshot. Taking a snapshot in fact just needs to read the db.
In the places where we are now acquiring the read lock and doing stuff, we must instead acquire the read lock, acquire the relevant state of the mutable variables (i.e. read the DbChangelog tvar and open a read tx to the db) and then do the processing without holding the lock.

Done?

To consider this done, it should be the case that after implementing the changes above, the system level benchmarks show a reasonable amount of locking (perhaps even minimal if things go according to plan).

The prevision for starting this is once the cleanup branch of UTxO-HD is complete.

The text was updated successfully, but these errors were encountered:

jasagredo · 2023-05-08T15:13:11Z

The code has been ported, but I'm finding issues in many test-suites. Perhaps I introduced a logic bug.

In any case, I think as access to the BackingStore is performed in several places, we should document somewhere the places where this happens.

# Description Rework the locking logic for the ledger DB RAW lock. There are mainly 4 places where locking happens: - background thread that flushes regularly: uses a **write** lock while writing the differences only. - background thread that creates snapshots: holds a **read** lock for the duration of the snapshot - forging loop: **quick read** locking to acquire a ledger db and a value handle to get a snapshot - queries: **quick read** locking to acquire a ledger db and a value handle This should reduce locking issues. There are also some side effects in the PR: - the forging loop is again a `WithEarlyExit` block - getting a snapshot can no longer fail s you provide the chlog and the value handle - new policy function `onDiskShouldFlush` Closes #43

jasagredo self-assigned this Feb 23, 2023

This was referenced Mar 6, 2023

Implement UTxO-HD in ouroboros-consensus IntersectMBO/ouroboros-network#4344

Merged

Remaining UTxO-HD tasks for v0.1-ready #127

Closed

jasagredo mentioned this issue Mar 30, 2023

LedgerDB lock improvements IntersectMBO/ouroboros-network#4481

Closed

11 tasks

jasagredo linked a pull request Mar 30, 2023 that will close this issue

LedgerDB lock improvements IntersectMBO/ouroboros-network#4481

Closed

11 tasks

This was referenced Apr 3, 2023

UTxO-HD prototype benchmarking IntersectMBO/ouroboros-network#4210

Closed

Redesign work due to not acceptable system level benchmarks IntersectMBO/ouroboros-network#4487

Closed

dnadales mentioned this issue Apr 26, 2023

Expose flushing frequence for the LedgerDB as a configuration parameter #98

Closed

jasagredo transferred this issue from IntersectMBO/ouroboros-network Apr 26, 2023

jasagredo linked a pull request May 10, 2023 that will close this issue

Ledger DB lock improvements #74

Merged

dnadales moved this from 🏗 In progress to 👀 In review in Consensus Team Backlog May 16, 2023

This was referenced May 29, 2023

Expose batch size for table traversing queries as a configuration parameter #97

Closed

Tracers: Determining UTxO set size in the UTxO HD era #57

Closed

jasagredo closed this as completed Jun 2, 2023

github-project-automation bot moved this from 👀 In review to ✅ Done in Consensus Team Backlog Jun 2, 2023

jorisdral added the UTxO-HD label Jun 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UTxO-HD Ledger DB lock tweaks #43

UTxO-HD Ledger DB lock tweaks #43

jasagredo commented Feb 23, 2023

jasagredo commented May 8, 2023

UTxO-HD Ledger DB lock tweaks #43

UTxO-HD Ledger DB lock tweaks #43

Comments

jasagredo commented Feb 23, 2023

The reason for the lock

Places that need to change

Done?

jasagredo commented May 8, 2023