test(amber): add unit test coverage for NetworkOutputBuffer by aglinxinyuan · Pull Request #4958 · apache/texera

aglinxinyuan · 2026-05-05T23:35:07Z

What changes were proposed in this PR?

Adds NetworkOutputBufferSpec covering NetworkOutputBuffer (defined in amber/src/main/scala/org/apache/texera/amber/engine/architecture/sendsemantics/partitioners/Partitioner.scala) — the per-receiver batched-tuple sender every concrete Partitioner subclass uses to push data downstream. The class has stateful buffer + flush semantics that nothing currently pins.

Surface	Pinned
Construction	`batchSize` default = `ApplicationConfig.defaultDataTransferBatchSize`; `to` / `dataOutputPort` exposed as immutable accessors; no implicit auto-flush at construction.
`addTuple`	No flush below `batchSize`; exact-boundary auto-flush; one DataFrame per batch with tuples in insertion order; no cross-batch leakage; data-channel routing to configured receiver.
`flush()`	Non-empty: sends a DataFrame, resets buffer. Empty: no-op (pin so a regression sending empty DataFrames breaks here). Sequence numbers monotonically increase across flushes.
`flush()` + `sendState` share a sequence stream	Pin: DataFrame and StateFrame go through the same channel and share the gateway's per-channel sequence counter. A regression that side-channels StateFrame would produce a non-monotonic stream and fail this.
`sendState`	Pre-flush bookend drains pending tuples FIRST as their own DataFrame, then sends StateFrame; pre-flush is a no-op when nothing is pending; trailing post-state flush leaves the buffer clean for subsequent `addTuple`.
`batchSize` edge cases	`batchSize = 1` flushes after every `addTuple`. Reachable-from-API `batchSize <= 0`: server-side path (`SyncExecutionResource`) accepts `request.workflowSettings` directly, no `>= 1` validation, so 0 / negative values reach `NetworkOutputBuffer` from a direct API caller despite the UI restricting it. Covered by a paired (characterization + `pendingUntilFixed`) — characterization pins today's lenient `>=`-guard per-tuple-flush behavior so a regression in that path surfaces, `pendingUntilFixed` pins the desired hardening (rejection at construction). When the hardening lands, both flip and must be updated together.

The test wires a real NetworkOutputGateway with a recording handler, so assertions exercise the production codepath end-to-end (sequence-number assignment, channel-id construction, payload-type routing) rather than mocking the gateway.

No production code changed; this is test-only.

Any related issues, documentation, discussions?

Closes #4957

How was this PR tested?

sbt "WorkflowExecutionService/Test/testOnly org.apache.texera.amber.engine.architecture.sendsemantics.partitioners.NetworkOutputBufferSpec"
# → 16 passing + 1 pendingUntilFixed (the future-hardening contract for batchSize <= 0)

sbt "WorkflowExecutionService/Test/scalafmtCheck"
# → clean

Was this PR authored or co-authored using generative AI tooling?

Generated-by: Claude Code (Claude Opus 4.7)

Pin the per-receiver batched-tuple sender used by every concrete Partitioner subclass: - Construction: batchSize defaults to ApplicationConfig.defaultDataTransferBatchSize; `to` and `dataOutputPort` are exposed as immutable accessors; an empty buffer at construction does not implicitly auto-flush. - addTuple: does not flush below batchSize; auto-flushes exactly at the batchSize boundary; produces one DataFrame per batch with tuples in insertion order, with no leak across batches; routes DataFrames to the configured receiver on the data (non-control) channel. - flush: sends a DataFrame with the buffered tuples and resets the buffer when non-empty; is a no-op when called on an empty buffer (so a regression that sends an empty DataFrame breaks here); assigns monotonically increasing sequence numbers across multiple flushes; the same sequence-number stream is shared with the StateFrame on the same channel (pin so a regression that side-channels StateFrame breaks). - sendState: pre-flush bookend drains pending tuples in their own DataFrame BEFORE the StateFrame; pre-flush is a no-op when nothing is pending; trailing post-state flush leaves the buffer clean for the next addTuple. - Edge cases: batchSize=1 flushes after every addTuple; batchSize=0 collapses to the same behavior because the `>=` guard fires for any non-empty buffer (characterized so a future tightening to `>` breaks on purpose). The test wires a real NetworkOutputGateway with a recording handler, so the assertions exercise the production codepath end-to-end (sequence-number assignment, channel-id construction, payload type) rather than mocking the gateway. Closes apache#4957

Copilot

Pull request overview

This PR adds focused unit coverage for NetworkOutputBuffer, the batching helper used by partitioners to send DataFrame and StateFrame payloads through NetworkOutputGateway. It fits into Amber’s messaging layer by pinning buffering, flush, and sequencing behavior that previously had no direct spec coverage.

Changes:

Adds a new NetworkOutputBufferSpec exercising constructor defaults, tuple buffering, auto-flush, and explicit flush().
Verifies sendState ordering and sequence-number behavior using a real NetworkOutputGateway with a recording handler.
Adds edge-case coverage for small batch sizes, including batchSize = 1 and batchSize = 0.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Yicong-Huang · 2026-05-05T23:52:49Z

let's make each test-related PR a bit larger. This is a trade off.

In general, we want to keep PRs small so it is easier to review and avoid a large scale revert that conflicts with a lot files.

For PRs that adding tests only, as it usually being a new file, less likely to conflict and would be easier to review, we can slightly make the size bigger to reduce the pressure to review a lot of PRs, and reduce the CI pressure.

Address Copilot review on apache#4958: - Drop the `batchSize = 0` characterization-pin. The workflow-settings UI restricts data-transfer batch size to `>= 1` before the value ever reaches `NetworkOutputBuffer`, so 0 is not a reachable production input. Pinning the current `>=` behavior would enshrine an implementation detail and block a future hardening change that rejects the invalid value at construction time. Replaced the test with a NOTE block documenting the rationale so a future contributor doesn't reflexively re-add it. - Remove unused `ChannelIdentity` import. The amber module enables `-Ywarn-unused:imports` in `amber/build.sbt`, so the unused import was avoidable compiler noise.

codecov-commenter · 2026-05-06T00:09:44Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 42.23%. Comparing base (6b79896) to head (58acc7c).
⚠️ Report is 4 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff            @@
##               main    #4958   +/-   ##
=========================================
  Coverage     42.22%   42.23%           
  Complexity     2180     2180           
=========================================
  Files           980      980           
  Lines         36287    36287           
  Branches       3783     3783           
=========================================
+ Hits          15321    15324    +3     
+ Misses        20038    20034    -4     
- Partials        928      929    +1

Flag	Coverage Δ
amber	`43.14% <ø> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

aglinxinyuan · 2026-05-06T00:10:42Z

let's make each test-related PR a bit larger. This is a trade off.

In general, we want to keep PRs small so it is easier to review and avoid a large scale revert that conflicts with a lot files.

For PRs that adding tests only, as it usually being a new file, less likely to conflict and would be easier to review, we can slightly make the size bigger to reduce the pressure to review a lot of PRs, and reduce the CI pressure.

Will make future test PRs larger.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Address Copilot follow-up on apache#4958: my previous NOTE claimed batchSize=0 was not reachable from production because the workflow- settings UI restricts the value to >= 1. That was inaccurate — SyncExecutionResource accepts request.workflowSettings directly from the API and the backend forwards workflowSettings.dataTransferBatchSize into NetworkOutputBuffer with no server-side validation, so a malicious or buggy API caller can reach batchSize=0 today. Replace the misleading NOTE with: - An accurate comment block describing the actual reachability (server-side API path, no validation). - A pendingUntilFixed test asserting the desired hardening: construction with batchSize <= 0 should throw IllegalArgumentException (e.g. via a require(batchSize > 0, ...) guard). Today this test is pending because the constructor accepts non-positive values; once the hardening lands the assertion will pass and pendingUntilFixed will invert that into a deliberate failure forcing the marker to be removed.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…ngUntilFixed Address Copilot follow-up on apache#4958: pendingUntilFixed alone left the reachable-from-API batchSize<=0 path uncovered by a real regression test. Add a paired characterization test that pins today's lenient behavior (per-tuple flush via the `>=` guard) for both batchSize=0 and batchSize=-1. The pair pattern matches the deserializeKey extra-segments coverage in apache#4954: a future hardening that rejects `<= 0` at construction will break the characterization on purpose AND flip pendingUntilFixed to passing, forcing both markers to be updated together — instead of leaving a regression hole today.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated no new comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot AI review requested due to automatic review settings May 5, 2026 23:35

github-actions Bot added the engine label May 5, 2026

github-actions Bot assigned aglinxinyuan May 5, 2026

Copilot started reviewing on behalf of aglinxinyuan May 5, 2026 23:35 View session

Copilot AI reviewed May 5, 2026

View reviewed changes

Comment thread ...he/texera/amber/engine/architecture/sendsemantics/partitioners/NetworkOutputBufferSpec.scala Outdated

Comment thread ...he/texera/amber/engine/architecture/sendsemantics/partitioners/NetworkOutputBufferSpec.scala Outdated

aglinxinyuan requested a review from Copilot May 6, 2026 00:12

Copilot started reviewing on behalf of aglinxinyuan May 6, 2026 00:12 View session

Copilot AI reviewed May 6, 2026

View reviewed changes

Comment thread ...he/texera/amber/engine/architecture/sendsemantics/partitioners/NetworkOutputBufferSpec.scala Outdated

aglinxinyuan requested a review from Copilot May 6, 2026 00:28

Copilot started reviewing on behalf of aglinxinyuan May 6, 2026 00:29 View session

Copilot AI reviewed May 6, 2026

View reviewed changes

Comment thread ...he/texera/amber/engine/architecture/sendsemantics/partitioners/NetworkOutputBufferSpec.scala Outdated

aglinxinyuan requested review from Yicong-Huang and Copilot May 6, 2026 01:36

Copilot started reviewing on behalf of aglinxinyuan May 6, 2026 01:36 View session

Copilot AI reviewed May 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(amber): add unit test coverage for NetworkOutputBuffer#4958

test(amber): add unit test coverage for NetworkOutputBuffer#4958
aglinxinyuan wants to merge 4 commits intoapache:mainfrom
aglinxinyuan:test-network-output-buffer-spec

aglinxinyuan commented May 5, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Yicong-Huang commented May 5, 2026

Uh oh!

codecov-commenter commented May 6, 2026 •

edited

Loading

Uh oh!

aglinxinyuan commented May 6, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

aglinxinyuan commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this PR?

Any related issues, documentation, discussions?

How was this PR tested?

Was this PR authored or co-authored using generative AI tooling?

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Yicong-Huang commented May 5, 2026

Uh oh!

codecov-commenter commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

aglinxinyuan commented May 6, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

aglinxinyuan commented May 5, 2026 •

edited

Loading

codecov-commenter commented May 6, 2026 •

edited

Loading