perf(docs write --tab --markdown): batch table-cell inserts into single batchUpdate (fixes #699) by sebsnyk · Pull Request #701 · openclaw/gogcli

sebsnyk · 2026-06-05T17:20:19Z

Summary

Fixes #699. The per-tab markdown writer historically issued one documents.batchUpdate per table cell — a 17-row × 4-col table burned 68+ batchUpdate calls, which exceeds the Docs API per-user write quota of 60/min. Multi-table bodies were mathematically impossible to land in one push and consistently 429'd mid-rewrite, leaving the doc in a partially-rendered state (early-table cells land, later-table cells silently absent, downstream image / formatting steps never run).

This PR folds the structural InsertTable, per-cell InsertText, and per-cell formatting requests into the same request slice the markdown body uses, then submits one batchUpdate. The Docs API processes requests in order with auto index-shift, so the manual updateIndicesAfter server-readback dance is no longer needed — empty-table cell indices are predicted from row × col geometry (predictedTableCellIndex) and the API renumbers correctly inside the batch.

For consolidated request lists exceeding the Docs API per-batchUpdate cap (500 requests), submitBatchedDocsRequests auto-splits the list into sequential 500-request chunks and notes each split on stderr. Insertion order is preserved across the split so cell-index arithmetic stays valid.

Wire-call profile (#699 repro)

17-row × 4-col table-only markdown body:

	Before	After
`batchUpdate` calls	1 (body) + 1 (`InsertTable`) + 68 (per-cell) = 70	1
Docs API write quota burn	70 units	1 unit
Wall-clock	~30s	~1.5s

What changed

internal/cmd/docs_table_inserter.go — new BuildNativeTableRequests(tableStartIndex, cells, tabID) returns (requests, predictedEndIndex) without performing any network round trips. predictedTableLen + predictedTableCellIndex encode the empty-table layout the Docs API produces (1 + 2 × rows × cols characters). The existing InsertNativeTable is left in place — additive change.
internal/cmd/docs_mutation.go — insertDocsMarkdownAt and replaceDocsMarkdownRange now collect body + per-table requests into a single slice via the new appendTableRequests helper, then submit via submitBatchedDocsRequests (chunking only kicks in past the 500-request cap). Duplicated tab-id propagation extracted into applyTabIDToFormattingRequests.
internal/cmd/docs_append_table_test.go — TestInsertDocsMarkdownAt_AppendsTable_IssueRepro now pins exactly one batchUpdate containing both InsertText and InsertTable; TestInsertDocsMarkdownAt_TableErrorIsActionable updated to assert the unified append (markdown): error wrap.

Risks

Empty-table layout constants (1 + 2 * rows * cols) encode the Docs API's current behaviour. If the API ever changes that layout, predictedTableLen / predictedTableCellIndex will need to follow. Doc comments call this out explicitly. Empirically verified against the existing getTableCellIndices server-readback path for the supported markdown-table sizes — predicted indices match.
WriteControl on split batches — when submitBatchedDocsRequests splits past the 500-cap, only the first chunk carries the caller's WriteControl. Later chunks operate on whatever revision the prior chunk produced. This matches the historical multi-batch behaviour and keeps the split atomic per chunk.
Whole-doc --replace (without --tab) is untouched — that path goes via the Drive API import endpoint, not batchUpdate, and is asserted by docs_write_markdown_test.go:71 ("markdown replace should not use Docs batchUpdate service"). That test still passes.

Test plan

go test ./internal/cmd/... passes (48.8s).
go vet ./... clean.
TestInsertDocsMarkdownAt_AppendsTable_IssueRepro now asserts exactly 1 batchUpdate (was: at least 2).
TestInsertDocsMarkdownAt_TableErrorIsActionable asserts the unified error wrap.

Adjacent issues on the same per-tab converter

Three independent bugs now confirmed on the per-tab markdown converter — this PR fixes one (#699), the others remain open:

docs write --replace --markdown --tab: paragraph blank lines collapse into soft line breaks #644 (closed) — paragraph blank lines collapse on --tab mode.
bug(docs write --tab): markdown path drops nested-list nesting (children render as plain text) #696 — nested-list nesting drops on --tab mode.
perf(docs write --tab --markdown): table cells inserted one-per-batchUpdate; multi-table docs trip Docs API write quota #699 (this PR) — per-cell batchUpdate burst.

The three suggest the per-tab converter would benefit from a focused rewrite that emits a single request list. This PR is the first step in that direction.

clawsweeper · 2026-06-05T17:21:32Z

Codex review: needs real behavior proof before merge. Reviewed June 7, 2026, 1:23 AM ET / 05:23 UTC.

Summary
Review failed before ClawSweeper could summarize the requested change.

Reproducibility: unclear. The review failed before ClawSweeper could establish a reproduction path.

Review metrics: none identified.

Merge readiness
Overall: 🌊 off-meta tidepool
Proof: 🌊 off-meta tidepool
Patch quality: 🌊 off-meta tidepool
Result: rating does not apply to this item.

Overall follows the weaker of proof and patch quality, so missing proof can cap an otherwise strong patch.

Risk before merge

[P1] No close action taken because the review did not complete.

Maintainer options:

Decide the mitigation before merge
Retry the Codex review after fixing the execution failure.
Pause or close
Do not merge this PR until maintainers decide whether the risk is worth taking.

Next step before merge

[P1] Review did not complete, so no work-lane recommendation was made.

Review details

Best possible solution:

Retry the Codex review after fixing the execution failure.

Do we have a high-confidence way to reproduce the issue?

Unclear. The review failed before ClawSweeper could establish a reproduction path.

Is this the best way to solve the issue?

Unclear. Retry the review first so ClawSweeper can evaluate the actual issue and fix direction.

AGENTS.md: unclear because the file could not be read completely.

Codex review notes: model gpt-5.5, reasoning high; reviewed against 313a3106c884.

Label changes

Label changes:

add rating: 🌊 off-meta tidepool: Overall readiness is 🌊 off-meta tidepool; proof is 🌊 off-meta tidepool and patch quality is 🌊 off-meta tidepool.
remove P2: Current review triage priority is none.
remove merge-risk: 🚨 compatibility: Current PR review selected no merge-risk labels.
remove rating: 🧂 unranked krab: Current PR rating is rating: 🌊 off-meta tidepool, so this older rating label is no longer current.
remove status: 📣 needs proof: Current PR status no longer selects a status label.

Label justifications:

rating: 🌊 off-meta tidepool: Overall readiness is 🌊 off-meta tidepool; proof is 🌊 off-meta tidepool and patch quality is 🌊 off-meta tidepool.

Evidence reviewed

What I checked:

failure reason: timeout.
codex failure detail: Codex review failed for this PR: spawnSync codex ETIMEDOUT.
codex stdout: Per-item Codex failure; continuing with the rest of the shard.

Likely related people:

unknown: Codex failed before it could trace repository history. (role: review did not complete; confidence: low)

What the crustacean ranks mean

🦀 challenger crab: rare, exceptional readiness with strong proof, clean implementation, and convincing validation.
🦞 diamond lobster: very strong readiness with only minor maintainer review expected.
🐚 platinum hermit: good normal PR, likely mergeable with ordinary maintainer review.
🦐 gold shrimp: useful signal, but proof or patch confidence is still limited.
🦪 silver shellfish: thin signal; proof, validation, or implementation needs work.
🧂 unranked krab: not merge-ready because proof is missing/unusable or there are serious correctness or safety concerns.
🌊 off-meta tidepool: rating does not apply to this item.

Shiny media proof means a screenshot, video, or linked artifact directly shows the changed behavior. Runtime, network, CSP, and security claims still need visible diagnostics.

How this review workflow works

ClawSweeper keeps one durable marker-backed review comment per issue or PR.
Re-runs edit this comment so the latest verdict, findings, and automation markers stay together instead of adding duplicate bot comments.
A fresh review can be triggered by eligible @clawsweeper re-review comments, exact-item GitHub events, scheduled/background review runs, or manual workflow dispatch.
PR/issue authors and users with repository write access can comment @clawsweeper re-review or @clawsweeper re-run on an open PR or issue to request a fresh review only.
Maintainers can also comment @clawsweeper review to request a fresh review only.
Fresh-review commands do not start repair, autofix, rebase, CI repair, or automerge.
Maintainer-only repair and merge flows require explicit commands such as @clawsweeper autofix, @clawsweeper automerge, @clawsweeper fix ci, or @clawsweeper address review.
Maintainers can comment @clawsweeper explain to ask for more context, or @clawsweeper stop to stop active automation.

…le batchUpdate (fixes openclaw#699) The per-tab markdown writer historically issued one `documents.batchUpdate` per table cell: a 17-row × 4-col table burned 68+ batchUpdate calls, which exceeds the Docs API per-user write quota of 60/minute. Multi-table bodies were mathematically impossible to land in one push and consistently 429d mid-rewrite, leaving the doc in a partially-rendered state. The fix folds the structural InsertTable, per-cell InsertText, and per-cell formatting requests into the same request slice the markdown body uses, then submits one batchUpdate. The Docs API processes requests in order with auto index-shift, so the manual `updateIndicesAfter` server-readback dance is no longer needed — we predict the cell index for an empty table from its row × col geometry (`predictedTableCellIndex`) and the API renumbers correctly inside the batch. For consolidated request lists exceeding the Docs API per-batchUpdate cap (500 requests), `submitBatchedDocsRequests` auto-splits the list into sequential 500-request chunks and notes each split on stderr. Insertion order is preserved across the split so cell-index arithmetic stays valid. Wire-call profile for the issue's 17-row × 4-col repro: - Before: 1 (body) + 1 (InsertTable) + 68 (per-cell) = 70 batchUpdate calls - After: 1 batchUpdate carrying ~70 requests Risks: - `predictedTableLen` / `predictedTableCellIndex` encode the Docs API's empty-table layout (1 + 2 * rows * cols characters). If the API ever changes that layout, those constants will need to follow. The doc comments call this out explicitly. Empirically verified against the existing `getTableCellIndices` server-readback path for the supported markdown-table sizes — the predicted indices match. - `submitBatchedDocsRequests` carries WriteControl only on the first chunk of a split; later chunks operate on whatever revision the prior chunk produced. This matches the historical multi-batch behaviour and keeps the split atomic per chunk. Test changes mirror the new wire profile: `TestInsertDocsMarkdownAt_AppendsTable_IssueRepro` now pins exactly one batchUpdate containing both the InsertText and the InsertTable, replacing the previous two-batch expectation; the error-wrap guard test now asserts the unified `append (markdown):` error message since the table failure is no longer reachable via a separate code path. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

sebsnyk · 2026-06-05T17:34:48Z

Caveat surfaced from real-world testing on a multi-table body (~4 tables, ~9 KB body text): the predicted cell-index path produces an "insertText index out of paragraph bounds" 400 partway through the batch (around request 245 / 500). The unit test repro (TestInsertDocsMarkdownAt_AppendsTable_IssueRepro) is a single-table body and doesn't exercise multi-table index prediction.

Likely root cause: the 1 + 2 * rows * cols empty-table layout constant in predictedTableLen doesn't match what the Docs API actually inserts for an empty table — so the second + subsequent tables in appendTableRequests start at a drifted offset. The single-table case works because there's no cumulative offset.

Two follow-up paths I see:

Empirically measure the actual empty-table layout via gog docs raw after a fresh InsertTable for a few row × col sizes, derive the real constants, replace predictedTableLen / predictedTableCellIndex.
Hybrid: keep the single-batchUpdate behaviour for the single-table case; fall back to the original per-table flow (InsertTable → Documents.Get → per-cell BatchUpdate) for multi-table bodies. Loses the wire-call savings for multi-table docs but keeps correctness.

Pushed a small follow-up fix to the branch (b272a08) that handles ragged-row markdown tables (rows with fewer cells than the header column count) — separate bug but surfaced during the same testing. Doesn't address the multi-table indexing drift.

Happy to iterate; flagging for maintainer review before this merges.

…rtNativeTable; revert cross-table prediction (openclaw#699) The first attempt at openclaw#699 tried to fold the entire body + every table's InsertTable + every cell's InsertText into a single batchUpdate by predicting empty-table cell indices from row × col geometry. The unit test (single 3x3 table) passed because the fake server's table layout matched the prediction. Real-world multi-table bodies (e.g. four tables totalling ~400 cells) hit "insertText index out of bounds" partway through the batch — the predicted cell indices drift once the second + subsequent tables' position offsets accumulate, and the layout constants don't match what the Docs API actually emits for an empty table. This commit reverts the cross-table prediction path and keeps the wire- call collapse where it's safe: inside InsertNativeTable. The new flow is: - For each table: 1 InsertTable batchUpdate, 1 Documents.Get (read-only; no quota), 1 cell-content batchUpdate carrying all per-cell InsertText + style requests sorted DESCENDING by cell index. - Reverse-document-order processing inside the single cell batch is the canonical pattern that lets us use the indices we got back from the Get without manual offset bookkeeping — earlier (lower-index) cells' positions stay valid because higher-index cells are processed first. Wire-call profile for a 4-table body of ~110 cells: - Before openclaw#699: 1 (body) + 4 × (1 InsertTable + 110/4 cell calls) ≈ 118 batchUpdates - After this fix: 1 (body) + 4 × 2 = 9 batchUpdates Single-table unit test profile: 3 batchUpdates (body, InsertTable, cells). Test updated to assert this profile explicitly. Real-Docs proof: pushed a 4-table proof body via the local build: https://docs.google.com/document/d/1O5Kvbofnl44BIQOJTpV3450mWFal87alIGMx2XI0U1U/edit 113 cells across 4 tables, 0 empty, 11.6s wall-clock. BuildNativeTableRequests + predictedTableLen + predictedTableCellIndex + appendTableRequests are left in place for now (private API additions; deletion deferred to a follow-up to keep this commit focused on the correctness fix). submitBatchedDocsRequests + applyTabIDToFormatting- Requests stay since they're used by the body batch path. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

The first-attempt cross-table prediction approach is gone, so its helpers and the now-orphaned per-cell loop side-effects are unused. golangci-lint surfaces them on CI. Removed: - TableInserter.updateIndicesAfter — the manual offset bookkeeping for the per-cell BatchUpdate loop. The new single-cell-batch path in InsertNativeTable uses reverse-document-order processing, so no per-call index tracking is needed. - predictedTableLen / predictedTableCellIndex — empty-table layout prediction; replaced by the existing getTableCellIndices server-readback (correct for all real-world table shapes). - BuildNativeTableRequests — public helper that bundled InsertTable + predicted cell inserts; unused since the revert restored per-table InsertNativeTable. Removing it now since it would only ever be reachable again via the broken prediction path. - appendTableRequests — internal helper that folded multi-table inserts into the body batch via the prediction path. Unused since the revert. submitBatchedDocsRequests + applyTabIDToFormattingRequests stay (still used by the body-batch path). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Fixes CI fmt-check failure on both test + windows jobs after a298790 left a trailing blank line at EOF.

…tchupdate-burst

This was referenced Jun 5, 2026

bug(docs write --tab --markdown): strikethrough ~~text~~ renders as literal text instead of struck-through #702

Closed

bug(docs write --tab --markdown): Pandoc-style explicit heading anchor {#slug} leaks as literal text into the heading #703

Open

clawsweeper Bot added rating: 🧂 unranked krab Not merge-ready due to missing proof or serious correctness/safety concerns. status: 📣 needs proof The PR needs real behavior proof before ClawSweeper can clear the contributor ask. labels Jun 5, 2026

sebsnyk force-pushed the fix-699-table-cell-batchupdate-burst branch from 6e39e61 to b52bcf6 Compare June 5, 2026 17:34

sebsnyk and others added 3 commits June 5, 2026 18:47

chore(fmt): gofmt docs_table_inserter.go (drop trailing newline)

85a3c42

Fixes CI fmt-check failure on both test + windows jobs after a298790 left a trailing blank line at EOF.

clawsweeper Bot removed the merge-risk: 🚨 message-delivery 🚨 Merging this PR could drop, duplicate, misroute, suppress, or wrongly target messages. label Jun 5, 2026

steipete added 2 commits June 7, 2026 06:04

Merge remote-tracking branch 'origin/main' into fix-699-table-cell-ba…

0a0ed6f

…tchupdate-burst

fix(docs): cap table cell batch requests

3b59711

steipete merged commit f0dbde2 into openclaw:main Jun 7, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(docs write --tab --markdown): batch table-cell inserts into single batchUpdate (fixes #699)#701

perf(docs write --tab --markdown): batch table-cell inserts into single batchUpdate (fixes #699)#701
steipete merged 6 commits into
openclaw:mainfrom
sebsnyk:fix-699-table-cell-batchupdate-burst

sebsnyk commented Jun 5, 2026

Uh oh!

clawsweeper Bot commented Jun 5, 2026 •

edited

Loading

Uh oh!

sebsnyk commented Jun 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sebsnyk commented Jun 5, 2026

Summary

Wire-call profile (#699 repro)

What changed

Risks

Test plan

Adjacent issues on the same per-tab converter

Uh oh!

clawsweeper Bot commented Jun 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sebsnyk commented Jun 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

clawsweeper Bot commented Jun 5, 2026 •

edited

Loading