WAL utility tears itself down after the first error by cody-littley · Pull Request #3006 · sei-protocol/sei-chain

cody-littley · 2026-03-03T14:55:32Z

Describe your changes and provide context

https://linear.app/seilabs/issue/STO-397/address-wal-feedback

Per feedback, the desired behavior of the WAL utility is that it should tear itself down after it encounters the first error.

Testing performed to validate your change

Unit test coverage.

github-actions · 2026-03-03T14:56:31Z

The latest Buf updates on your PR. Results from workflow Buf / buf (pull_request).

Build	Format	Lint	Breaking	Updated (UTC)
`✅ passed`	`✅ passed`	`✅ passed`	`✅ passed`	Mar 3, 2026, 8:26 PM

github-actions · 2026-03-03T14:56:33Z

The latest Buf updates on your PR. Results from workflow Buf / buf (pull_request).

Build	Format	Lint	Breaking	Updated (UTC)
`✅ passed`	`✅ passed`	`✅ passed`	`✅ passed`	Mar 3, 2026, 2:56 PM

codecov · 2026-03-03T15:04:37Z

Codecov Report

❌ Patch coverage is 58.97436% with 32 lines in your changes missing coverage. Please review.
✅ Project coverage is 58.13%. Comparing base (fb21209) to head (8069893).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
sei-db/wal/wal.go	58.97%	25 Missing and 7 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3006      +/-   ##
==========================================
- Coverage   58.26%   58.13%   -0.14%     
==========================================
  Files        2108     2113       +5     
  Lines      173664   174000     +336     
==========================================
- Hits       101181   101147      -34     
- Misses      63456    63798     +342     
- Partials     9027     9055      +28

Flag	Coverage Δ
sei-chain-pr	`67.19% <58.97%> (?)`
sei-db	`70.41% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
sei-db/wal/wal.go	`68.85% <58.97%> (-2.91%)`	⬇️

... and 95 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

blindchaser · 2026-03-03T15:37:07Z

sei-db/wal/wal.go

@@ -341,13 +372,23 @@ func (walLog *WAL[T]) handleTruncate(req *truncateRequest) {
 		err = walLog.log.TruncateBack(req.index)
 	}
 	if err != nil {
-		req.errChan <- fmt.Errorf("failed to truncate: %w", err)
+		err = fmt.Errorf("failed to truncate: %w", err)
+		if strings.Contains(err.Error(), "out of range") {


in case tidwall/wal library changes its error message wording in the future, how about:

if strings.Contains(err.Error(), "out of range") { req.errChan <- fmt.Errorf("failed to truncate: %w", err) return } walLog.reportFatalError(fmt.Errorf("failed to truncate: %w", err), req.errChan) return

I wrapped the error inside the "out of range" block, but I'm not sure if that is what you were asking me to do. New code is below:

if err != nil { err = fmt.Errorf("failed to truncate: %w", err) if strings.Contains(err.Error(), "out of range") { err = fmt.Errorf("out of range truncate error: %w", err) req.errChan <- err return } walLog.reportFatalError(err, req.errChan) return }

blindchaser · 2026-03-03T15:38:42Z

sei-db/wal/wal.go

+	// Store on heap so the pointer remains valid after this function returns.
+	p := new(error)
+	*p = err
+	walLog.asyncError.Store(p)


should we call walLog.cancel() in the reportFatalError()?

Explicitly calling cancel() is not required. When asyncError is set, the loop exits, and immediately after the loop exits the context is cancelled.

for running && walLog.asyncError.Load() == nil { select { case <-walLog.ctx.Done(): running = false case req := <-walLog.writeChan: walLog.handleWrite(req) case req := <-walLog.truncateChan: walLog.handleTruncate(req) case <-pruneChan: walLog.prune() case <-walLog.closeReqChan: running = false } } walLog.cancel()

yzang2019 · 2026-03-03T16:33:56Z

sei-db/wal/wal.go

+	// Store on heap so the pointer remains valid after this function returns.
+	p := new(error)
+	*p = err
+	walLog.asyncError.Store(p)


If we have multiple errors, will the later errors replace the previous one here? And are we OK with that?

Went back and checked, and there was one edge case where this was possible (i.e. during drain() when we are tearing down the system). I fixed this issue, and so now it should never be possible for asyncError to be set more than once.

## Describe your changes and provide context https://linear.app/seilabs/issue/STO-397/address-wal-feedback Per feedback, the desired behavior of the WAL utility is that it should tear itself down after it encounters the first error. ## Testing performed to validate your change Unit test coverage. --------- Co-authored-by: Cody Littley <cody.littley@seinetwork.io>

WAL utility tears itself down after the first error

3ba0eb6

cody-littley requested review from blindchaser and yzang2019 March 3, 2026 14:55

cody-littley self-assigned this Mar 3, 2026

cody-littley added the non-app-hash-breaking label Mar 3, 2026

Tweak how out of range errors are treated

8586eca

blindchaser reviewed Mar 3, 2026

View reviewed changes

blindchaser approved these changes Mar 3, 2026

View reviewed changes

made suggested change

1e1efaf

yzang2019 approved these changes Mar 3, 2026

View reviewed changes

Cody Littley and others added 2 commits March 3, 2026 10:54

Don't overwrite async err

da38e4e

Merge branch 'main' into cody-littley/wal-fail-behavior

c6120e4

cody-littley enabled auto-merge (squash) March 3, 2026 18:47

Merge branch 'main' into cody-littley/wal-fail-behavior

8069893

cody-littley merged commit ed4946d into main Mar 3, 2026
35 checks passed

cody-littley deleted the cody-littley/wal-fail-behavior branch March 3, 2026 20:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WAL utility tears itself down after the first error#3006

WAL utility tears itself down after the first error#3006
cody-littley merged 6 commits intomainfrom
cody-littley/wal-fail-behavior

cody-littley commented Mar 3, 2026

Uh oh!

github-actions bot commented Mar 3, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 3, 2026

Uh oh!

codecov bot commented Mar 3, 2026 •

edited

Loading

Uh oh!

blindchaser Mar 3, 2026

Uh oh!

cody-littley Mar 3, 2026

Uh oh!

blindchaser Mar 3, 2026

Uh oh!

cody-littley Mar 3, 2026

Uh oh!

yzang2019 Mar 3, 2026

Uh oh!

cody-littley Mar 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cody-littley commented Mar 3, 2026

Describe your changes and provide context

Testing performed to validate your change

Uh oh!

github-actions bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 3, 2026

Uh oh!

codecov bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

blindchaser Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

cody-littley Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

blindchaser Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

cody-littley Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

yzang2019 Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

cody-littley Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Mar 3, 2026 •

edited

Loading

codecov bot commented Mar 3, 2026 •

edited

Loading