fix: remature context cancellation during graceful shutdown in PartitionedFanoutWriter by xixipi-lining · Pull Request #1225 · apache/iceberg-go

xixipi-lining · 2026-06-18T10:53:15Z

Problem Statement

Currently, when partitionedFanoutWriter.Write completes successfully, it waits for all upstream producers to finish (fanoutWorkers.Wait()), and then attempts a graceful shutdown by calling writerFactory.closeAll().

However, writerFactory.closeAll() iterates through the RollingDataWriters and calls closeAndWait() on each. The close() method does the following:

func (r *RollingDataWriter) close() {
	r.cancel() // Problem here!
	close(r.recordCh)
}

By immediately calling r.cancel(), it actively cancels the context passed to the background stream goroutine. Since recordCh is buffered (size 64), it may still contain unprocessed RecordBatch items. If the underlying Arrow functions (e.g., compute.SortRecordBatch) or Parquet writers check ctx.Done() during this draining phase, they will abort immediately with a context.Canceled error. This converts what should be a successful, graceful shutdown into an error, and silently drops the remaining buffered records.

Proposed Solution

To support proper channel draining, RollingDataWriter should rely on close(r.recordCh) as the primary signal for the stream goroutine to process the remaining buffered records and exit naturally.

In this PR:

We remove r.cancel() from close().
We move r.cancel() to the end of closeAndWait(), after r.wg.Wait() has returned and all residual batches have been successfully processed and flushed to Parquet.

This ensures that the memory buffer is correctly fully drained, preventing silent data loss or unexpected context.Canceled panics during the finalization phase. The overall timeout/abort mechanism is still safely controlled by the parent context passed into the Write function.

…ionedFanoutWriter

tanmayrauth

Thanks for the PR, Two things before this lands:

Moving cancel() to the end of closeAndWait means it's skipped on the error return at rolling_data_writer.go:427-429, so the context leaks whenever a writer errors. defer r.cancel() near the top keeps the drain-before-cancel ordering (it still runs after wg.Wait()) while always firing.
I don't think removing r.cancel() from close() prevents cancellation during the drain. r.ctx is a child of the errgroup context (rolling_data_writer.go:266, ctx from errgroup.WithContext at partitioned_fanout_writer.go:83), and fanoutWorkers.Wait() at partitioned_fanout_writer.go:187 cancels that errgroup ctx as soon as it returns — before closeAll() runs at :188. So r.ctx is already Done during the drain, with or without this change. Could you share a repro so I can see the mechanism you're fixing?

tanmayrauth · 2026-06-18T14:47:30Z

 		return fmt.Errorf("error in rolling data writer: %w", err)
 	}

+	r.cancel()


Moving cancel() to the end of closeAndWait means it's skipped on the error return, so the context leaks whenever a writer errors. defer r.cancel() near the top keeps the drain-before-cancel ordering (it still runs after wg.Wait()) while always firing.

laskoviymishka

Diagnosis is right, cancelling before recordCh drains is what makes the in-flight Arrow calls (openFileWriter / ToRequestedSchema / SortRecordBatch) fail with context.Canceled and drop buffered records, so moving cancel past wg.Wait() is the right shape.

I'd hold before merge though: pulling cancel() out of close() opens two leaks: the error branch of closeAndWait() now returns without cancelling, and the unpartitionedWrite sites in arrow_utils.go that call close() directly lose the cancel they used to get as a side effect. defer r.cancel() inside the stream goroutine closes both at once.

A regression test for the drain would be needed too. Details inline.

laskoviymishka · 2026-06-21T19:34:53Z

 		return fmt.Errorf("error in rolling data writer: %w", err)
 	}

+	r.cancel()


placing cancel() only on the success return means the error branch above (return fmt.Errorf("error in rolling data writer: %w", err)) now exits without ever cancelling — so every writer that hits a stream error leaks its derived context until the parent is cancelled, which in a long-lived process (compaction daemon, streaming ingest) may be never. closeAll() keeps iterating after the first error, so a batch of failing writers leaks one each.

I'd move this further out: defer r.cancel() inside the stream goroutine. It owns the derived ctx and every caller already wg.Wait()s on it, so cancel fires once when streaming ends on any path — that also covers the unpartitionedWrite sites in arrow_utils.go and closeCurrentWriter in clustered_writer.go, which call close() directly and lose the cancel they used to get from it. A defer r.cancel() here in closeAndWait() only fixes this branch and leaves those two leaking. wdyt?

laskoviymishka · 2026-06-21T19:34:53Z

@@ -429,6 +428,8 @@ func (r *RollingDataWriter) closeAndWait() error {
 		return fmt.Errorf("error in rolling data writer: %w", err)


minor: if cancel() ends up staying here rather than moving into the goroutine, nlreturn will want a blank line before this return too — the new return nil below got one but this branch didn't. CI runs gofumpt + nlreturn, so worth a quick make lint before pushing.

zeroshade

Requesting changes because the current patch still leaves the graceful-drain bug unresolved. PR #1368 already implements the correct fix for this same bug; I recommend consolidating/closing #1225 in favor of #1368, or adopting #1368's approach here. Also, laskoviymishka's earlier change request remains unaddressed and there is no regression test covering this path.

zeroshade · 2026-07-04T17:31:15Z

 		return fmt.Errorf("error in rolling data writer: %w", err)
 	}

+	r.cancel()


Blocking: moving cancel() here does not fix the fanout drain path. RollingDataWriter instances are created with the errgroup.WithContext context in table/partitioned_fanout_writer.go around line 160, and fanoutWorkers.Wait() cancels that context before writerFactory.closeAll() drains writers around line 188. That means closeAndWait() can still drain buffered records under a canceled context and hit context.Canceled in stream() / openFileWriter / ToRequestedSchema / SortRecordBatch. Please decouple the per-writer drain context from the fanout errgroup context on success, and cancel writers only on the abort/error path. This is the approach already implemented in PR #1368; please consolidate with or adopt that fix.

Fix premature context cancellation during graceful shutdown in Partit…

b9c48c7

…ionedFanoutWriter

xixipi-lining requested a review from zeroshade as a code owner June 18, 2026 10:53

tanmayrauth reviewed Jun 18, 2026

View reviewed changes

zeroshade changed the title ~~Fix premature context cancellation during graceful shutdown in PartitionedFanoutWriter~~ fix: remature context cancellation during graceful shutdown in PartitionedFanoutWriter Jun 19, 2026

laskoviymishka requested changes Jun 21, 2026

View reviewed changes

zeroshade mentioned this pull request Jul 4, 2026

fix(table): separate rolling data writer close/abort paths #1368

Open

zeroshade requested changes Jul 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: remature context cancellation during graceful shutdown in PartitionedFanoutWriter#1225

fix: remature context cancellation during graceful shutdown in PartitionedFanoutWriter#1225
xixipi-lining wants to merge 1 commit into
apache:mainfrom
xixipi-lining:fix-premature-context-cancellation

xixipi-lining commented Jun 18, 2026

Uh oh!

tanmayrauth left a comment

Uh oh!

tanmayrauth Jun 18, 2026

Uh oh!

laskoviymishka left a comment

Uh oh!

laskoviymishka Jun 21, 2026

Uh oh!

laskoviymishka Jun 21, 2026

Uh oh!

zeroshade left a comment

Uh oh!

zeroshade Jul 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		@@ -429,6 +428,8 @@ func (r *RollingDataWriter) closeAndWait() error {
		return fmt.Errorf("error in rolling data writer: %w", err)

Uh oh!

Conversation

xixipi-lining commented Jun 18, 2026

Problem Statement

Proposed Solution

Uh oh!

tanmayrauth left a comment

Choose a reason for hiding this comment

Uh oh!

tanmayrauth Jun 18, 2026

Choose a reason for hiding this comment

Uh oh!

laskoviymishka left a comment

Choose a reason for hiding this comment

Uh oh!

laskoviymishka Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

laskoviymishka Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

zeroshade left a comment

Choose a reason for hiding this comment

Uh oh!

zeroshade Jul 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants