Align parquet file rotation with cache chunk boundaries by jewei1997 · Pull Request #3280 · sei-protocol/sei-chain

jewei1997 · 2026-04-21T02:44:21Z

Describe your changes and provide context

Align cache + parquet rotation to block % MaxBlocksPerFile == 0. Coverage is now derived from LatestVersion ([floor(latest/interval)*interval, latest]).
Fix: empty blocks short-circuited SetReceipts, so rotation never fired when the boundary block had zero receipts. Oversized files then got pruned from eth_getLogs queries.parquet.Store.ObserveEmptyBlock now rotates on empty-boundary blocks.

Testing performed to validate your change

TestFilterLogsSurvivesEmptyRotationBoundary / TestFilterLogsSurvivesBoundaryThatCrossesFileWidth — regress the empty-boundary bug.
TestCachedReceiptStoreMergesDuckDBAndCache{AcrossBoundary,ReceiptsAcrossBoundary} — exercise the cache+parquet merge path for logs and receipts.
Tested on node and run RPC queries on it

github-actions · 2026-04-21T02:45:20Z

The latest Buf updates on your PR. Results from workflow Buf / buf (pull_request).

Build	Format	Lint	Breaking	Updated (UTC)
`✅ passed`	`✅ passed`	`✅ passed`	`✅ passed`	Apr 24, 2026, 7:37 PM

codecov · 2026-04-21T02:47:48Z

Codecov Report

❌ Patch coverage is 75.45455% with 27 lines in your changes missing coverage. Please review.
✅ Project coverage is 59.21%. Comparing base (e9d506b) to head (eae213d).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
sei-db/ledger_db/parquet/store.go	78.46%	8 Missing and 6 partials ⚠️
sei-db/ledger_db/receipt/cached_receipt_store.go	75.00%	5 Missing and 3 partials ⚠️
sei-db/ledger_db/receipt/parquet_store.go	28.57%	3 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3280      +/-   ##
==========================================
+ Coverage   59.19%   59.21%   +0.01%     
==========================================
  Files        2091     2091              
  Lines      171432   171462      +30     
==========================================
+ Hits       101481   101526      +45     
+ Misses      61152    61129      -23     
- Partials     8799     8807       +8

Flag	Coverage Δ
sei-chain-pr	`75.75% <75.45%> (?)`
sei-db	`70.41% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
sei-db/config/receipt_config.go	`79.16% <ø> (ø)`
sei-db/ledger_db/parquet/reader.go	`81.73% <100.00%> (+1.49%)`	⬆️
sei-db/ledger_db/receipt/parquet_store.go	`71.26% <28.57%> (+1.55%)`	⬆️
sei-db/ledger_db/receipt/cached_receipt_store.go	`88.82% <75.00%> (+2.85%)`	⬆️
sei-db/ledger_db/parquet/store.go	`69.68% <78.46%> (+1.25%)`	⬆️

... and 22 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

cody-littley

I asked a LLM to do an audit and it provided the following findings. Please address each of these, or comment why you think the findings are incorrect.

1. Race window: cache claims coverage of a block before its receipts are inserted (HIGH)

In cachedReceiptStore.SetReceipts (sei-db/ledger_db/receipt/cached_receipt_store.go):

func (s *cachedReceiptStore) SetReceipts(ctx sdk.Context, receipts []ReceiptRecord) error {
    if err := s.backend.SetReceipts(ctx, receipts); err != nil {
        return err
    }
    if ctx.BlockHeight() > 0 {
        s.cacheMu.Lock()
        s.maybeRotateCacheLocked(uint64(ctx.BlockHeight()))
        s.cacheMu.Unlock()
    }
    s.cacheReceipts(receipts)
    return nil
}

For a boundary block N (e.g., 500, 1000, ...):

backend.SetReceipts updates backend.LatestVersion = N.
The outer maybeRotateCacheLocked(N) rotates the in-memory ledgerCache (current chunk swaps to "previous", new current chunk is empty) and advances cacheNextRotate.
cacheMu is released.
cacheReceipts(receipts) re-acquires cacheMu and finally inserts block N's receipts/logs into the new current chunk.

Between steps 3 and 4, a concurrent FilterLogs call sees:

cache.FilterLogsWithMinBlock(...) → empty (nothing in current chunk for block N yet, prev chunk lost block N because rotation already happened).
coverageWindow() → reads cacheNextRotate (already advanced) and backend.LatestVersion = N, so it returns from = floor(N/interval)*interval, to = N, hasCoverage = true.
The early-return hasCoverage && fromBlock >= coveredFrom && toBlock <= coveredTo fires for any query inside [N, N] (or fully inside the new chunk's range), and the function returns empty cacheLogs without ever consulting the backend.

The on-disk parquet file for block N is also still open (not in closedReceiptFiles), so the backend wouldn't have it either. Net effect: eth_getLogs for the just-committed block N can transiently return empty when it should return the block's logs.

The previous logic computed cacheMin from the actual cache contents under logMu, so it could not claim coverage of a block that wasn't in the cache snapshot. The new boundary-aligned, version-driven coverage window widens the claim and opens this TOCTOU.

Fixes worth considering:

Drop the outer maybeRotateCacheLocked and let cacheReceipts (which already calls it under cacheMu) do the rotation atomically with the insertion. For empty receipts, special-case rotation inside cacheReceipts instead of doing it outside.
Or hold cacheMu across both the rotate and the insert (move cacheReceipts's body inline under the same lock).

2. Race window: empty-block coverage after a cold reopen with no warmup (MEDIUM)

The coverageWindow doc comment promises:

Coverage only applies once the cache has observed at least one write, so a cold-reopen where WAL replay produced no warmup records reports no coverage and lets FilterLogs fall through to the backend.

But the new SetReceipts empty-block path defeats that:

if ctx.BlockHeight() > 0 {
    s.cacheMu.Lock()
    s.maybeRotateCacheLocked(uint64(ctx.BlockHeight()))
    s.cacheMu.Unlock()
}

After a cold reopen where WAL replay produced no warmup records, the first committed block — even if empty, even if mid-window — sets cacheNextRotate away from 0, immediately enabling coverageWindow to claim coverage of [floor(latest/interval)*interval, latest].

In most realistic chains the block at latest is the just-committed empty block (so claiming "no logs there" is correct), and historical blocks below floor(latest/interval)*interval correctly fall through to the backend. So this isn't catastrophically broken — but it does invalidate the documented invariant and is a foot-gun if the assumption ever drifts (e.g., if a later change starts using coveredFrom < latest to short-circuit older blocks).

Same fix as #1 (maybeRotateCacheLocked should also gate on receipts being non-empty, or coverage must be derived from actual cache contents).

3. New crash window in `WriteReceipts` at rotation boundaries (LOW)

WriteReceipts now rotates before writing the boundary block's WAL entry:

for _, b := range batches {
    if s.receiptWriter != nil && b.blockNumber != s.lastSeenBlock && s.IsRotationBoundary(b.blockNumber) {
        if err := s.rotateFileLocked(b.blockNumber); err != nil {
            return err
        }
    }

    entry := WALEntry{
        BlockNumber: b.blockNumber,
        Receipts:    b.receipts,
    }
    if err := s.wal.Write(entry); err != nil {
        return err
    }
    ...

rotateFileLocked flushes, closes the file, clears the WAL, then returns. A crash between ClearWAL and s.wal.Write(entry) loses the boundary block's WAL entry; on restart, the boundary block must be re-applied by Cosmos (it's no longer in the WAL nor in the just-rotated file).

The crash test was updated to assert exactly this: TestCrashRecoveryAtEachHookPoint now asserts that for needsRotation hooks the boundary block is not recovered and the caller must retry.

Note from Cody: are we expecting the outer context to replay something here in order to recover? Does it currently do that, or is that something we'd have to implement? In general, I think it's better to make this data store fully crash durable in order to simplify the mental model needed to interact with the store.

4. File-name misalignment after lazy init at a non-boundary block (LOW)

When the first receipt after a fresh start lands at, say, block 1234, applyReceiptLocked lazily initializes with fileStartBlock = 1234. The next rotation fires at 1500, producing receipts_1234.parquet (containing 1234–1499) instead of the expected receipts_1000.parquet. The reader's fileForBlock and GetFilesBeforeBlock both compute file coverage as [startBlock, startBlock+maxBlocksPerFile), so:

Pruning is delayed by up to (maxBlocksPerFile − offset) blocks (file "looks like" it can hold up to 1733).
Targeted lookups for block 1500 may pick receipts_1234.parquet first (since 1234 ≤ 1500), find nothing, and fall back to a full scan. Correct, but slower.

This is the same pattern that arises when an empty boundary block is observed before any writer exists (ObserveEmptyBlock updates lastSeenBlock without rotating, and the next non-boundary write lazy-inits at that height). Worth noting in operations docs and possibly fixed by snapping fileStartBlock to floor(input.BlockNumber/MaxBlocksPerFile) * MaxBlocksPerFile at lazy init.

5. `SetMaxBlocksPerFile` writes `Reader.maxBlocksPerFile` racily (LOW, test-only)

func (s *Store) SetMaxBlocksPerFile(n uint64) {
    s.mu.Lock()
    defer s.mu.Unlock()
    s.config.MaxBlocksPerFile = n
    if s.Reader != nil {
        s.Reader.maxBlocksPerFile = n
    }
}

Holding the store's s.mu does not synchronize with the Reader's reads of r.maxBlocksPerFile (which happen under r.mu at most). The doc-comment marks this as test-only ("Not safe to call while writes are in flight"), so it's acceptable, but if it ever leaks into production code it'll be a data race under -race.

6. `ObserveEmptyBlock` allows out-of-order updates of `lastSeenBlock` (LOW)

func (s *Store) ObserveEmptyBlock(height uint64) error {
    s.mu.Lock()
    defer s.mu.Unlock()

    if height == s.lastSeenBlock {
        return nil
    }
    if s.receiptWriter == nil || !s.IsRotationBoundary(height) {
        s.lastSeenBlock = height
        return nil
    }
    ...

If height < s.lastSeenBlock (out-of-order observation, e.g., a buggy caller or a test), lastSeenBlock moves backwards. The very next WriteReceipts could then mis-evaluate b.blockNumber != s.lastSeenBlock for a block already seen. Cosmos commits in order so this isn't expected in production, but a height > s.lastSeenBlock guard would make the contract explicit and cheap to enforce.

cody-littley · 2026-04-22T13:17:53Z

+
 	for _, b := range batches {
+		if s.receiptWriter != nil && b.blockNumber != s.lastSeenBlock && s.IsRotationBoundary(b.blockNumber) {
+			if err := s.rotateFileLocked(b.blockNumber); err != nil {


The godoc on rotateFileLocked() says the function is used during WAL replay, but this call is not being made during WAL replay.

cody-littley · 2026-04-22T14:23:00Z

The way the parquet receipt store does locking sometimes makes the thread safety and crash recovery safety a bit tricky to reason about. When code gets like this, race conditions sneak into the codebase, and it's a lot harder to be confident that you've found and fixed them all.

As a follow up task, perhaps we can discuss how we could alter the code structure to make these things simpler to reason about. (Not a request for change in this PR.)

… are inserted

cody-littley · 2026-04-24T16:28:03Z

Another batch of race conditions identified by LLM:

Audit: `fix-merge-results-log-cache`

Audit of the parquet receipt store and its cache layer. Core files reviewed:

sei-db/ledger_db/parquet/store.go, reader.go
sei-db/ledger_db/receipt/parquet_store.go, cached_receipt_store.go, receipt_cache.go

Crash tests, race tests, and coverage/rotation tests were read; hot paths traced for correctness under crash and under concurrent reads/writes.

Summary

#	Severity	Item
1	Critical	`FilterLogs` stale-read race at chain tip
2	High (latent)	Tx-hash index can desync after a rotation + crash
3	Medium	Non-monotonic `SetLatestVersion` on empty-block path
4	Low	`UpdateLatestVersion` uses non-atomic Load-then-Store

Item 1 is a live correctness bug that is already reachable in production under concurrent eth_getLogs load. Item 2 is a latent corruption-class bug that becomes reachable as soon as multi-block writes are introduced. Recommend blocking on item 1 before merging.

1. CRITICAL — Stale-read race in `cachedReceiptStore.FilterLogs` at the chain tip

File: sei-db/ledger_db/receipt/cached_receipt_store.go

SetReceipts updates state in two steps, in this order:

// sei-db/ledger_db/receipt/cached_receipt_store.go:111-117
func (s *cachedReceiptStore) SetReceipts(ctx sdk.Context, receipts []ReceiptRecord) error {
    if err := s.backend.SetReceipts(ctx, receipts); err != nil {
        return err
    }
    s.cacheReceipts(receipts, ctx.BlockHeight())
    return nil
}

backend.SetReceipts promotes backend.LatestVersion() to block N (via UpdateLatestVersion inside parquetReceiptStore.SetReceipts).
Only afterwards does cacheReceipts run AddReceiptsBatch / AddLogsForBlock on the cache.

FilterLogs uses coverageWindow to decide whether to skip the backend:

// sei-db/ledger_db/receipt/cached_receipt_store.go:128-145
func (s *cachedReceiptStore) coverageWindow() (uint64, uint64, bool) {
    ...
    latest := s.backend.LatestVersion()
    ...
    latestU := uint64(latest)
    from := (latestU / s.cacheRotateInterval) * s.cacheRotateInterval
    return from, latestU, true
}

// sei-db/ledger_db/receipt/cached_receipt_store.go:160-165
coveredFrom, coveredTo, hasCoverage := s.coverageWindow()
if hasCoverage && fromBlock >= coveredFrom && toBlock <= coveredTo {
    s.reportLogFilterCacheHit()
    sortLogs(cacheLogs)
    return cacheLogs, nil
}

Race sequence

Writer commits block N; reader polls for N concurrently.

Writer: backend.SetReceipts returns → backend.LatestVersion() == N.
Reader: FilterLogs(N, N, crit).
- s.cache.FilterLogsWithMinBlock(N, N, crit) → empty (cache not yet populated).
- coverageWindow() sees LatestVersion == N and a non-zero cacheNextRotate, computes coveredFrom = floor(N/interval)*interval, coveredTo = N.
- Query [N, N] is declared fully covered → backend skipped → returns [].
Writer: cacheReceipts runs AddReceiptsBatch / AddLogsForBlock for block N.

Impact

eth_getLogs and filter subscriptions silently drop all logs for the newest block under load. Data is durable in the backend; the query path returns an incomplete (empty) answer. This is a correctness regression the existing crash tests cannot catch because TestSlowFlushWithConcurrentReads gates readers on a post-write committed counter.

Rotation makes it slightly worse: if block N is a rotation boundary, the rotate + reset of cacheNextRotate happens inside cacheReceipts but after backend.SetReceipts. During the window, coverageWindow with either the new or the old cacheNextRotate still computes a window ending at N while the cache is empty for N.

Fix options

Populate the cache first, then call backend.SetReceipts. Tears crash consistency in the other direction (cache exposes a block not yet in the backend), so it also needs the LatestVersion gate inverted.
Preferred: track the highest block the cache has observed (cachedLatest uint64 bumped under cacheMu after AddReceiptsBatch / AddLogsForBlock) and use min(cachedLatest, backend.LatestVersion()) in coverageWindow. Coverage only advances to N once the cache is guaranteed populated for N.
Failing that, require FilterLogs to also consult the backend for max(coveredTo-1, backendTo) instead of trusting coverage alone.

Test gap

No existing test exercises this race. Add a test that spawns a reader polling FilterLogs(latest, latest, ...) concurrently with many single-block SetReceipts calls and asserts no logs are dropped.

2. HIGH (latent) — Tx-hash index can desync from the parquet store across rotations

File: sei-db/ledger_db/receipt/parquet_store.go

Write order is:

// sei-db/ledger_db/receipt/parquet_store.go:251-266
if err := s.store.WriteReceipts(inputs); err != nil {
    return err
}

if s.txHashIndex != nil {
    if err := s.indexReceiptInputs(inputs); err != nil {
        return fmt.Errorf("tx hash index write failed: %w", err)
    }
}

if maxBlock > 0 {
    s.store.UpdateLatestVersion(int64(maxBlock))
}

WriteReceipts may rotate when the batch contains a boundary block. Rotation calls ClearWAL, which preserves only the last WAL entry:

// sei-db/ledger_db/parquet/store.go:437-462
// ClearWAL truncates the WAL after rotation, preserving the last entry.
func (s *Store) ClearWAL() error {
    ...
    if err := s.wal.TruncateBefore(lastOffset); err != nil {

And replayWAL drops WAL entries whose block is below fileStartBlock:

// sei-db/ledger_db/receipt/parquet_store.go:389-393
blockNumber := entry.BlockNumber
if blockNumber < s.store.FileStartBlock() {
    dropOffset = offset
    return nil
}

Crash scenario

Single SetReceipts whose inputs span more than one block and cross a rotation boundary:

WriteReceipts writes all blocks, rotates at the boundary, flushes pre-boundary blocks to the closed parquet file, truncates WAL keeping only the boundary block.
indexReceiptInputs starts iterating batches and crashes after indexing block k but before block k+1, where k+1 is non-boundary and already flushed to disk.
Restart: replayWAL sees only the boundary block's WAL entry; all non-boundary blocks fail the blockNumber < FileStartBlock() check and are dropped. They are re-indexed only by the tx-hash index path — which never got to them.

Result: receipts are durable in parquet but permanently unreachable by tx-hash lookup (the tx-index code path returns "not found"; receipt_config.go documents that no full-scan fallback is offered when the index is the configured backend). Durable data corruption from the RPC consumer's perspective.

Mitigating factors

In production the Cosmos commit path calls SetReceipts with a single block's receipts, so inputs never spans multiple blocks and the race is not triggered today. This is architectural — nothing in the type system prevents multi-block inputs, and warmupReceipts / replayWAL paths do batch across blocks. A future refactor that batches commits (fast-forward sync, snapshot restore, etc.) would start losing tx-index entries.

Recommendations

Either: index before WriteReceipts. WAL replay then catches any missed index writes — WAL preserves the boundary block, and all pre-boundary blocks are already in disk files by replay time.
Or: make the index write idempotent and re-derive from parquet on startup (scan files for blocks whose fileStartBlock >= indexHighWatermark).
At minimum, add a FaultHooks.AfterWALClear or new AfterIndex injection point and a crash test that asserts every pre-boundary tx is still reachable by hash after an index-time crash.

3. MEDIUM — `SetLatestVersion` on the empty-block path is non-monotonic

// sei-db/ledger_db/receipt/parquet_store.go:189-191
if ctx.BlockHeight() > s.store.LatestVersion() {
    s.store.SetLatestVersion(ctx.BlockHeight())
}

The non-empty path uses UpdateLatestVersion, which is guarded monotonically:

// sei-db/ledger_db/parquet/store.go:308-313
func (s *Store) UpdateLatestVersion(version int64) {
    if version > s.latestVersion.Load() {
        s.latestVersion.Store(version)
    }
}

The empty path does an unlocked Load-then-SetLatestVersion (unconditional Store). A concurrent non-empty SetReceipts could land a higher value in between; the subsequent SetLatestVersion(ctx.BlockHeight()) then rolls the counter back.

In production, Cosmos commits are serialized, so this is latent rather than active. Still worth swapping to UpdateLatestVersion for symmetry — the current asymmetry is a footgun for any future concurrent commit path.

4. LOW — `UpdateLatestVersion` itself is racy between writers

// sei-db/ledger_db/parquet/store.go:308-313
func (s *Store) UpdateLatestVersion(version int64) {
    if version > s.latestVersion.Load() {
        s.latestVersion.Store(version)
    }
}

Classic non-atomic compare-then-store. Two concurrent writers with versions V1 < V2 can observe V2 landed first and overwrite it with V1 (only if V1 is still greater than the currently stored value at the time of Load — which includes cases where another writer stored V2 between this goroutine's Load and Store).

Single writer in Cosmos, so latent. Replace with a CompareAndSwap loop.

cody-littley

Approved conditional on refactor we discussed, prior to release to production environments.

jewei1997 added 3 commits April 20, 2026 09:29

align rotation boundaries between cache and parquet

b101beb

fix log rotation bug

df14bbc

Merge branch 'main' into fix-merge-results-log-cache

95cc7ef

jewei1997 added the non-app-hash-breaking label Apr 21, 2026

jewei1997 changed the title ~~Fix merge results log cache~~ Align parquet file rotation with cache chunk boundaries Apr 21, 2026

Merge branch 'main' into fix-merge-results-log-cache

df5374f

jewei1997 requested review from Kbhat1, cody-littley and yzang2019 April 21, 2026 14:00

yzang2019 approved these changes Apr 21, 2026

View reviewed changes

cody-littley requested changes Apr 22, 2026

View reviewed changes

jewei1997 added 10 commits April 23, 2026 13:01

fix ObserveEmptyBlock allows out-of-order updates of lastSeenBlock

6f1853e

fix SetMaxBlocksPerFile writes Reader.maxBlocksPerFile racily

43a19a9

fix File-name misalignment after lazy init at a non-boundary block

b488418

fix New crash window in WriteReceipts at rotation boundaries

ec3520d

fix empty-block coverage window invariant

345b2c8

fix Race window: cache claims coverage of a block before its receipts…

9f5efc5

… are inserted

Merge branch 'main' into fix-merge-results-log-cache

9e5a39e

fix tests for disable full scan

c83742d

fix trucate active chunk

fd4463a

Merge branch 'main' into fix-merge-results-log-cache

b4b45ea

add NOTE

eae213d

cody-littley approved these changes Apr 24, 2026

View reviewed changes

jewei1997 added this pull request to the merge queue Apr 27, 2026

Merged via the queue into main with commit da20ae8 Apr 27, 2026
38 checks passed

jewei1997 deleted the fix-merge-results-log-cache branch April 27, 2026 13:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Align parquet file rotation with cache chunk boundaries#3280

Align parquet file rotation with cache chunk boundaries#3280
jewei1997 merged 15 commits into
mainfrom
fix-merge-results-log-cache

jewei1997 commented Apr 21, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 21, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Apr 21, 2026 •

edited

Loading

Uh oh!

cody-littley left a comment

Uh oh!

cody-littley Apr 22, 2026

Uh oh!

cody-littley commented Apr 22, 2026 •

edited

Loading

Uh oh!

cody-littley commented Apr 24, 2026

Uh oh!

cody-littley left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

jewei1997 commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes and provide context

Testing performed to validate your change

Uh oh!

github-actions Bot commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov Bot commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

cody-littley left a comment

Choose a reason for hiding this comment

1. Race window: cache claims coverage of a block before its receipts are inserted (HIGH)

2. Race window: empty-block coverage after a cold reopen with no warmup (MEDIUM)

3. New crash window in WriteReceipts at rotation boundaries (LOW)

4. File-name misalignment after lazy init at a non-boundary block (LOW)

5. SetMaxBlocksPerFile writes Reader.maxBlocksPerFile racily (LOW, test-only)

6. ObserveEmptyBlock allows out-of-order updates of lastSeenBlock (LOW)

Uh oh!

cody-littley Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

cody-littley commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cody-littley commented Apr 24, 2026

Audit: fix-merge-results-log-cache

Summary

1. CRITICAL — Stale-read race in cachedReceiptStore.FilterLogs at the chain tip

Race sequence

Impact

Fix options

Test gap

2. HIGH (latent) — Tx-hash index can desync from the parquet store across rotations

Crash scenario

Mitigating factors

Recommendations

3. MEDIUM — SetLatestVersion on the empty-block path is non-monotonic

4. LOW — UpdateLatestVersion itself is racy between writers

Uh oh!

cody-littley left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jewei1997 commented Apr 21, 2026 •

edited

Loading

github-actions Bot commented Apr 21, 2026 •

edited

Loading

codecov Bot commented Apr 21, 2026 •

edited

Loading

3. New crash window in `WriteReceipts` at rotation boundaries (LOW)

5. `SetMaxBlocksPerFile` writes `Reader.maxBlocksPerFile` racily (LOW, test-only)

6. `ObserveEmptyBlock` allows out-of-order updates of `lastSeenBlock` (LOW)

cody-littley commented Apr 22, 2026 •

edited

Loading

Audit: `fix-merge-results-log-cache`

1. CRITICAL — Stale-read race in `cachedReceiptStore.FilterLogs` at the chain tip

3. MEDIUM — `SetLatestVersion` on the empty-block path is non-monotonic

4. LOW — `UpdateLatestVersion` itself is racy between writers