[SPARK-56227][CORE] Fix GcmTransportCipher to correctly handle multiple messages per channel by aajisaka · Pull Request #55028 · apache/spark

aajisaka · 2026-03-26T01:43:58Z

What changes were proposed in this pull request?

This fixes three bugs in GcmTransportCipher introduced by SPARK-47172.

Bug 1: DecryptionHandler silently drops every message after the first.

AesGcmHkdfStreaming is a one-shot streaming primitive — each independently encrypted message carries its own random IV and requires a fresh StreamSegmentDecrypter. The DecryptionHandler never reset its per-message state (completed, decrypterInit, expectedLength, segmentNumber, etc.) nor replaced the single final StreamSegmentDecrypter instance between messages. After the first message was decoded, completed = true permanently, and all subsequent messages were silently dropped because both initalizeExpectedLength() and initalizeDecrypter() returned early as no-ops and the inner while loop never ran.

Fix: add resetForNextMessage() which clears all per-message fields and allocates a new StreamSegmentDecrypter; call it after each fully decoded message.

Bug 2: DecryptionHandler discards bytes from messages batched in the same channelRead() call.

Under shuffle load, TCP coalesces multiple encrypted messages into a single ByteBuf. The original code exited the decryption loop as soon as one message completed and released the buffer — including any trailing bytes belonging to subsequent messages. The next channelRead() then received bytes starting mid-stream of the second message, interpreted them as an 8-byte length header, and threw: IllegalStateException: Invalid expected ciphertext length.

Fix: wrap the decryption logic in an outer loop that continues consuming messages from the same buffer until either the buffer is exhausted or a partial message is encountered. resetForNextMessage() is called inside the loop immediately after each complete message while the buffer is still held.

Bug 3: TCP-fragmented frame header causes IndexOutOfBoundsException

ByteBuf.readBytes(ByteBuffer dst) requires exactly dst.remaining() bytes to be present and throws IndexOutOfBoundsException if the source is shorter. Under high load, TCP can fragment a GCM message's 24-byte internal header (or 8-byte length prefix) across multiple channelRead() calls. The code incorrectly assumed readBytes would stop early and leave hasRemaining() == true.

Fix: compute toRead = Math.min(readable, dst.remaining()), temporarily narrow dst.limit to position + toRead, call readBytes(dst), then restore limit.

Bug 4 (minor): EncryptionHandler shares working buffers across concurrent GcmEncryptedMessage instances.

plaintextBuffer and ciphertextBuffer were fields of EncryptionHandler passed into every GcmEncryptedMessage. The constructor's ciphertextBuffer.limit(0) call could corrupt an in-flight message's buffer state if Netty batched writes. Fix: move buffer ownership into GcmEncryptedMessage so each message allocates its own working buffers.

Without the above fixes, enabling AES/GCM/NoPadding RPC encryption causes YARN executor containers to fail: the auth handshake succeeds but all post-auth RPC messages are dropped or corrupted, leaving the channel hung until YARN kills the container.

Why are the changes needed?

To successfully run Spark jobs on YARN with spark.network.crypto.cipher="AES/GCM/NoPadding"

Fixes #54999

Does this PR introduce any user-facing change?

No

How was this patch tested?

Added unit tests:

testMultipleMessages: encrypts and decrypts two independent messages through the same handler pair with separate channelRead() calls.
testBatchedMessages: concatenates two ciphertexts into one ByteBuf and delivers them in a single channelRead() call, verifying both are decoded correctly.
testSplitHeader: ciphertext split at byte 12 (8-byte length field + 4 bytes into the 24-byte GCM header) across two channelRead() calls.

Ported these changes to our Spark 3.4.x-based internal branch and ran multiple jobs in YARN cluster successfully.

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Claude Code (Claude Opus 4.6)

aajisaka · 2026-03-26T04:35:35Z

Converted to draft. We are seeing job failures in benchmarking test.

…le messages per channel Three bugs in `GcmTransportCipher` cause failures in production YARN clusters when AES-GCM RPC encryption is enabled (`spark.network.crypto.cipher=AES/GCM/NoPadding`). **Bug 1 — DecryptionHandler is single-use per channel (YARN container launch failure)** After decoding the first post-auth message, `completed = true` was never reset. `AesGcmHkdfStreaming` is a one-shot streaming primitive: each GCM message carries its own random IV and requires a fresh `StreamSegmentDecrypter`. With `decrypter` declared `final` and all guard flags stuck at their terminal values, every subsequent message on the channel was silently discarded. Fix: make `decrypter` non-final, add `resetForNextMessage()` that reinstates all per-message state (including a fresh `StreamSegmentDecrypter`), and call it after each successfully decoded message. **Bug 2 — TCP-coalesced messages lost (SparkSQL IllegalStateException)** When TCP delivers multiple back-to-back GCM messages in a single `channelRead()` call (common under shuffle load), the old code released the `ByteBuf` after decoding the first message, discarding any remaining bytes. The next `channelRead()` then read bytes from the middle of the second message as its length header, producing a negative `long` and throwing `IllegalStateException("Invalid expected ciphertext length")`. Fix: wrap the decode logic in an outer `while(true)` loop that exhausts all complete messages from the buffer before releasing it; call `resetForNextMessage()` inside the loop between messages. **Bug 3 — TCP-fragmented frame header causes IndexOutOfBoundsException (benchmark)** `ByteBuf.readBytes(ByteBuffer dst)` requires exactly `dst.remaining()` bytes to be present and throws `IndexOutOfBoundsException` if the source is shorter. Under high load, TCP can fragment a GCM message's 24-byte internal header (or 8-byte length prefix) across multiple `channelRead()` calls. The code incorrectly assumed `readBytes` would stop early and leave `hasRemaining() == true`. Fix: compute `toRead = Math.min(readable, dst.remaining())`, temporarily narrow `dst.limit` to `position + toRead`, call `readBytes(dst)`, then restore `limit`. **Bug 4 — EncryptionHandler shares mutable buffers across GcmEncryptedMessage instances** `plaintextBuffer` and `ciphertextBuffer` were `EncryptionHandler` fields reused across all `GcmEncryptedMessage` instances. Under Netty's write pipeline a new message can be constructed (via `write()`) before a prior one's `transferTo()` completes; the new constructor's `ciphertextBuffer.limit(0)` would corrupt the in-flight message's buffer. Fix: allocate `plaintextBuffer` and `ciphertextBuffer` inside the `GcmEncryptedMessage` constructor so each message owns its own buffers. - Cache `headerLength` in `DecryptionHandler` to avoid repeated `getHeaderLength()` calls - Replace `Integer.min()` with `Math.min()` for style consistency - `testMultipleMessages`: regression for Bug 1 — same `DecryptionHandler` decodes two independent messages delivered via separate `channelRead()` calls - `testBatchedMessages`: regression for Bug 2 — two ciphertexts concatenated into one `ByteBuf` and delivered in a single `channelRead()` call - `testSplitHeader`: regression for Bug 3 — ciphertext split at byte 12 (8-byte length field + 4 bytes into the 24-byte GCM header) across two `channelRead()` calls Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

aajisaka · 2026-03-27T09:39:42Z

Our internal benchmark tests passed in YARN cluster. This patch is ready for review.

aajisaka changed the title ~~Fix GcmTransportCipher to correctly handle multiple messages per channel~~ [SPARK-56227] Fix GcmTransportCipher to correctly handle multiple messages per channel Mar 26, 2026

aajisaka changed the title ~~[SPARK-56227] Fix GcmTransportCipher to correctly handle multiple messages per channel~~ [SPARK-56227][CORE] Fix GcmTransportCipher to correctly handle multiple messages per channel Mar 26, 2026

aajisaka marked this pull request as draft March 26, 2026 04:35

aajisaka force-pushed the fix-rpc-encryption branch from 7b33cf6 to ed42963 Compare March 27, 2026 09:36

aajisaka marked this pull request as ready for review March 27, 2026 09:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-56227][CORE] Fix GcmTransportCipher to correctly handle multiple messages per channel#55028

[SPARK-56227][CORE] Fix GcmTransportCipher to correctly handle multiple messages per channel#55028
aajisaka wants to merge 1 commit intoapache:masterfrom
aajisaka:fix-rpc-encryption

aajisaka commented Mar 26, 2026 •

edited

Loading

Uh oh!

aajisaka commented Mar 26, 2026

Uh oh!

aajisaka commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

aajisaka commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

aajisaka commented Mar 26, 2026

Uh oh!

aajisaka commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

aajisaka commented Mar 26, 2026 •

edited

Loading