refactor: simplify DA worker to prioritize fresh blocks by pthmas · Pull Request #38 · evstack/atlas

pthmas · 2026-03-19T17:08:55Z

Summary

Unified the two-phase DA worker design into a single query that always processes the highest-numbered blocks first. Fresh blocks now get priority over historical backfill, preventing the previous issue where new blocks would be inserted with 0 DA heights but never re-checked until the full backfill completed.

The new SELECT_NEEDS_DA_SQL combines both missing blocks and 0-height blocks in one query ordered by block number DESC. This eliminates phase starvation where phase 2 (pending re-checks) never got RPC budget when phase 1 (historical backfill) consumed the full 100-request quota.

Changes: merged two separate query paths into one, simplified the main run loop from managing two phases to a single process_blocks call, removed ~90 lines of code.

Testing

All existing tests pass. The query correctly identifies both missing and pending blocks, and the upsert operation properly suppresses no-op writes with IS DISTINCT FROM.

Summary by CodeRabbit

Improvements
- Optimized the blockchain data synchronization and indexing process to streamline operations and reduce unnecessary database overhead.

Unified the two-phase DA worker (backfill + pending) into a single query that always processes the highest-numbered blocks needing DA info first. The new SELECT_NEEDS_DA_SQL finds blocks missing from block_da_status OR with 0 heights, ordered DESC so fresh blocks float to the top during historical backfill — preventing starvation of new blocks. This fixes the issue where new blocks arriving during backfill would get inserted with 0 DA heights but phase 2 would never run to re-check them once Celestia included them (because phase 1 always consumed the full budget). Now all blocks are checked by the same priority queue.

coderabbitai · 2026-03-19T17:09:17Z

📝 Walkthrough

Walkthrough

The worker loop was refactored from a two-phase approach to a single unified flow that processes blocks in one step, combining missing status row detection and pending row updates into one SQL query and simplifying writes through an improved UPSERT statement with no-op suppression.

Changes

Cohort / File(s)	Summary
DA Worker Refactoring `backend/crates/atlas-server/src/indexer/da_worker.rs`	Consolidated two-phase worker loop (`backfill_new_blocks`, `update_pending_blocks`) into a single `process_blocks` method with unified SQL query ordered by `blocks.number DESC`. Replaced separate insert and conditional update operations with INSERT ... ON CONFLICT DO UPDATE featuring a `WHERE ... IS DISTINCT FROM ...` clause to suppress no-op updates. Updated cycle logging to report only `processed` count and adjusted test assertions for new query and UPSERT logic.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Poem

🐰 One query hops where two once trampled,
The batch now flows with elegant UPSERT sampled,
No phantom writes disturb the night,
Just blocks processed, distinctly right!
hop ✨

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'refactor: simplify DA worker to prioritize fresh blocks' directly matches the main objective of unifying a two-phase worker into a single query that prioritizes fresh blocks, accurately summarizing the primary change.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch pthmas/tianjin-v1

📝 Coding Plan

Generate coding plan for human review comments

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

🧹 Nitpick comments (1)

backend/crates/atlas-server/src/indexer/da_worker.rs (1)
259-272: Consider: String-based SQL assertions are brittle.

These tests verify SQL structure via substring matching, which can break on formatting changes even when query semantics remain correct. For now this provides reasonable regression coverage, but consider adding integration tests against a real database (e.g., using sqlx::test) to validate actual query behavior if not already covered elsewhere.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@backend/crates/atlas-server/src/indexer/da_worker.rs` around lines 259 - 272,
The tests assert SQL structure via brittle substring checks on
SELECT_NEEDS_DA_SQL and UPSERT_DA_STATUS_SQL; instead add integration tests that
execute the statements against a real test database (e.g., using sqlx::test or a
temporary test database) and assert the observed behavior (that the SELECT
returns newest blocks first and filters rows where d.block_number IS NULL or
header_da_height/data_da_height == 0, and that the UPSERT suppresses noop
writes) rather than relying on fragile string matches; implement these
integration tests calling the actual query builders/functions that produce
SELECT_NEEDS_DA_SQL and UPSERT_DA_STATUS_SQL (or normalize/parse the SQL if you
must keep unit tests) so semantics are validated end-to-end.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@backend/crates/atlas-server/src/indexer/da_worker.rs`:
- Around line 259-272: The tests assert SQL structure via brittle substring
checks on SELECT_NEEDS_DA_SQL and UPSERT_DA_STATUS_SQL; instead add integration
tests that execute the statements against a real test database (e.g., using
sqlx::test or a temporary test database) and assert the observed behavior (that
the SELECT returns newest blocks first and filters rows where d.block_number IS
NULL or header_da_height/data_da_height == 0, and that the UPSERT suppresses
noop writes) rather than relying on fragile string matches; implement these
integration tests calling the actual query builders/functions that produce
SELECT_NEEDS_DA_SQL and UPSERT_DA_STATUS_SQL (or normalize/parse the SQL if you
must keep unit tests) so semantics are validated end-to-end.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 1bd13e6d-989a-46c4-942a-60f3ed331837

📥 Commits

Reviewing files that changed from the base of the PR and between 2fb2426 and 9e937dd.

📒 Files selected for processing (1)

backend/crates/atlas-server/src/indexer/da_worker.rs

coderabbitai bot reviewed Mar 19, 2026

View reviewed changes

pthmas self-assigned this Mar 19, 2026

tac0turtle approved these changes Mar 19, 2026

View reviewed changes

pthmas merged commit ba5b44e into main Mar 20, 2026
8 checks passed

pthmas deleted the pthmas/tianjin-v1 branch March 20, 2026 08:33

coderabbitai bot mentioned this pull request Mar 20, 2026

perf: stream DA SSE updates as each RPC call completes #39

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: simplify DA worker to prioritize fresh blocks#38

refactor: simplify DA worker to prioritize fresh blocks#38
pthmas merged 1 commit intomainfrom
pthmas/tianjin-v1

pthmas commented Mar 19, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Mar 19, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pthmas commented Mar 19, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pthmas commented Mar 19, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 19, 2026 •

edited

Loading