ImmutableDB: allow to lookup blocks by slot, leverage in db-{analyser,truncater} #1143

amesgen · 2024-06-13T11:52:08Z

This PR adds a new function to the internal ImmutableDB API:

-- | Get the hash of the block in the given slot. If the slot contains both
-- an EBB and a non-EBB, return the hash of the non-EBB.
getHashForSlot :: SlotNo -> m (Maybe (HeaderHash blk))

It is then used in two ways:

In db-analyser: Many analyses do not actually need a ledger state to perform their work. This PR adds support for avoiding this redundant and long work on startup. However, the existing ImmutableDB streaming API needs a point (previously read from the ledger state), not just a slot to start streaming. This is where getHashForSlot comes in.
In db-truncater: Previously, truncating after a slot took linear time (by iterating over the entire database). With getHashForSlot, it is easy to now change it to take constant time. Note that the behavior changes slightly, see the commit message for details.

ouroboros-consensus-cardano/src/unstable-cardano-tools/Cardano/Tools/DBAnalyser/Analysis.hs

See later commits for how this is useful in db-{analyser,truncater}.

This also slightly changes the semantics: Previously, it would truncate to the block with the largest slot that is smaller or equal to the slot argument. Now, it will only truncate to the (non-EBB) block in exactly that slot, and fail otherwise. In fact, the new behavior more closely corresponds to the CLI description: --truncate-after-slot SLOT_NUMBER The slot number of the intended new tip of the chain after truncation

…,truncater} (#1143) This PR adds a new function to the internal ImmutableDB API: ```haskell -- | Get the hash of the block in the given slot. If the slot contains both -- an EBB and a non-EBB, return the hash of the non-EBB. getHashForSlot :: SlotNo -> m (Maybe (HeaderHash blk)) ``` It is then used in two ways: - In db-analyser: Many analyses do not actually need a ledger state to perform their work. This PR adds support for avoiding this redundant and long work on startup. However, the existing ImmutableDB streaming API needs a point (previously read from the ledger state), not just a slot to start streaming. This is where `getHashForSlot` comes in. - In db-truncater: Previously, truncating after a slot took linear time (by iterating over the entire database). With `getHashForSlot`, it is easy to now change it to take constant time. Note that the behavior changes slightly, see the commit message for details.

Closes #1202 This PR reverts the behavioral change of #1143, specifically 5747d3c. Concretely, `--truncate-after-slot slotNo` will now remove all blocks with a slot number higher than `slotNo` in the ImmutableDB, but does not require that a block with exactly that slot number exists. This is convenient eg for truncating all blocks after an epoch without having to find out the exact slot of the last block in the epoch just before. At the same time, the run time is still much faster than before #1143: We iteratively check all slot numbers descending from the given one, and truncate to the first point that is in the ImmutableDB. As realistic ImmutableDBs are only somewhat sparse (active slot coefficient is `f = 1/20`), this should be very fast (ie still constant time in the length of the chain if we consider the slot distance between any two adjacent blocks to be bounded). In addition, we explicitly check whether the given argument is beyond the tip of the ImmutableDB, and immediately exit (successfully) in that case.

amesgen force-pushed the amesgen/db-immutaliser branch 2 times, most recently from e01abc7 to 9939044 Compare June 13, 2024 13:53

amesgen force-pushed the amesgen/immdb-slots branch from 42e0000 to 10fb20c Compare June 13, 2024 13:59

dnadales assigned amesgen Jun 13, 2024

dnadales approved these changes Jun 14, 2024

View reviewed changes

ouroboros-consensus-cardano/src/unstable-cardano-tools/Cardano/Tools/DBAnalyser/Analysis.hs Outdated Show resolved Hide resolved

amesgen force-pushed the amesgen/db-immutaliser branch from 9939044 to 443e350 Compare June 14, 2024 12:52

amesgen force-pushed the amesgen/immdb-slots branch from 10fb20c to 42c61f1 Compare June 14, 2024 12:52

amesgen marked this pull request as ready for review June 14, 2024 12:56

amesgen requested a review from a team as a code owner June 14, 2024 12:57

Base automatically changed from amesgen/db-immutaliser to main June 17, 2024 14:32

ImmutableDB: add internal getHashForSlot

00d0001

See later commits for how this is useful in db-{analyser,truncater}.

amesgen force-pushed the amesgen/immdb-slots branch from 42c61f1 to 7877e43 Compare June 17, 2024 14:44

amesgen enabled auto-merge June 17, 2024 14:45

amesgen added 3 commits June 17, 2024 16:51

db-analyser: allow to run without a ledger state if possible

d36b5ab

ImmutableDB.Tip: add headerToTip

f445bac

amesgen force-pushed the amesgen/immdb-slots branch from 7877e43 to 5747d3c Compare June 17, 2024 14:51

amesgen added this pull request to the merge queue Jun 17, 2024

Merged via the queue into main with commit 4e3ff22 Jun 17, 2024
16 checks passed

amesgen deleted the amesgen/immdb-slots branch June 17, 2024 22:42

This was referenced Aug 1, 2024

[BUG] - "Unable to find a truncate point" with slot based truncation #1202

Closed

db-truncater: make --truncate-after-slot more lenient again #1203

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ImmutableDB: allow to lookup blocks by slot, leverage in db-{analyser,truncater} #1143

ImmutableDB: allow to lookup blocks by slot, leverage in db-{analyser,truncater} #1143

amesgen commented Jun 13, 2024 •

edited

Loading

ImmutableDB: allow to lookup blocks by slot, leverage in db-{analyser,truncater} #1143

ImmutableDB: allow to lookup blocks by slot, leverage in db-{analyser,truncater} #1143

Conversation

amesgen commented Jun 13, 2024 • edited Loading

amesgen commented Jun 13, 2024 •

edited

Loading