fix: preserve compressed/binary request bodies in proxy handler by harlan-zw · Pull Request #619 · nuxt/scripts

harlan-zw · 2026-03-03T03:49:14Z

🔗 Linked issue

Resolves #618

❓ Type of change

📚 Description

The first-party proxy handler unconditionally read request bodies via h3's readBody(), which decodes binary bytes as UTF-8 text. This corrupted compressed payloads (e.g. PostHog with compression=gzip-js) and caused repeated upstream timeouts (504).

When no privacy transforms are needed (anyPrivacy=false) or the body is binary/compressed (content-encoding header or application/octet-stream), the raw request stream is now piped directly to upstream via getRequestWebStream() — zero buffering, zero re-encoding. Privacy stripping still uses readBody() only for text bodies that actually need transforms.

Also adds PostHog to the first-party E2E test fixture and a new unit test suite (proxy-handler-binary.test.ts) covering gzip, binary, and passthrough scenarios.

The first-party proxy handler unconditionally read request bodies via h3's readBody(), which decodes binary bytes as UTF-8 text. This corrupted compressed payloads (e.g. PostHog gzip-js) and caused upstream timeouts. When no privacy transforms are needed or the body is binary/compressed, the raw request stream is now piped directly to upstream via getRequestWebStream() — zero buffering, zero re-encoding.

vercel · 2026-03-03T03:49:19Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
scripts-docs	Ready	Preview, Comment	Mar 3, 2026 0:46am
scripts-playground	Ready	Preview, Comment	Mar 3, 2026 0:46am

pkg-pr-new · 2026-03-03T03:51:28Z

Open in StackBlitz

npm i https://pkg.pr.new/@nuxt/scripts@619

commit: 3fa31c2

Adds posthog-js dependency, fixture page, and provider paths so PostHog is exercised in the E2E error-check sweep with a real browser connection.

coderabbitai · 2026-03-03T03:54:51Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Detects binary/compressed request bodies (Content-Encoding present, application/octet-stream, or compression query param) and introduces a passthrough mode that streams the raw request to upstream for write methods. When not passthrough, the handler reads and may transform bodies (JSON fingerprint stripping, parsing JSON strings, form handling). Adjusts header handling (content-length, x-forwarded-for/anonymization, selected privacy masking for UA/Accept-Language/client hints) and uses duplex/raw stream as fetch body for passthrough. Adds tests and fixtures (PostHog) and a unit suite validating byte-for-byte preservation of gzip/binary payloads and privacy-related JSON behavior.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title clearly and specifically describes the main change: preserving compressed/binary request bodies in the proxy handler.
Description check	✅ Passed	The description includes a linked issue, selected type of change, and detailed explanation of the problem solved and implementation approach.
Linked Issues check	✅ Passed	All code changes directly address the requirements in `#618`: prevent body corruption by piping binary/compressed streams directly to upstream, apply privacy transforms only to text bodies, and add comprehensive tests.
Out of Scope Changes check	✅ Passed	All changes are in scope: proxy handler modifications, PostHog fixture additions for E2E testing, and new unit tests for binary body handling—all directly supporting the `#618` fix objectives.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix/proxy-binary-passthrough

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (1)

test/unit/proxy-handler-binary.test.ts (1)

117-161: Add one more regression case: compression=gzip-js with privacy enabled and no content-encoding.

Current coverage validates gzip-js only when privacy is off, and privacy-on only when content-encoding is present. Adding this combination will lock in the intended behavior for compressed text/plain uploads.

💡 Suggested test case

+  it('preserves gzip-js body when privacy is enabled without content-encoding', async () => {
+    const payload = JSON.stringify({ event: 'test', ua: 'fingerprint' })
+    const compressed = gzipSync(Buffer.from(payload))
+
+    await fetch(`http://localhost:${proxyPort}/batch?compression=gzip-js`, {
+      method: 'POST',
+      headers: {
+        'content-type': 'text/plain',
+        'x-test-privacy': 'true',
+      },
+      body: compressed,
+    })
+
+    expect(capturedUpstreamBody).not.toBeNull()
+    expect(Buffer.compare(capturedUpstreamBody!, compressed)).toBe(0)
+  })

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@test/unit/proxy-handler-binary.test.ts` around lines 117 - 161, Add a new
test case mirroring "preserves gzip-compressed body sent without content-type"
but with privacy enabled: send the same gzip-compressed payload to
`http://localhost:${proxyPort}/batch?compression=gzip-js` including header
`x-test-privacy: 'true'`, using the same `gzipSync` and `compressed` variables,
then assert `capturedUpstreamBody` is not null and that
`Buffer.compare(capturedUpstreamBody!, compressed) === 0`; place the test near
the other binary/compression tests so it exercises the `compression=gzip-js` +
privacy path.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/runtime/server/proxy-handler.ts`:
- Around line 232-233: The current truthy checks on rawBody drop valid falsy
payloads (''/0/false); update both occurrences (the if (rawBody) guards around
the object branch and the later check at lines ~312-314) to test explicitly for
undefined (e.g., if (rawBody !== undefined)) so only an absent body is treated
as missing while preserving the existing object branch behavior (the typeof
rawBody === 'object' branch and subsequent handling should remain unchanged).
- Around line 105-113: The current isBinaryBody check in proxy-handler.ts
(computed using getHeaders and content-type) is too narrow and misses cases like
?compression=gzip-js with text/plain, causing privacy-enabled write requests to
still call readBody; update the isBinaryBody logic used before readBody() so it
also treats compressed/encoded bodies as binary: extend the condition in the
isBinaryBody computation (used around getHeaders, contentType, and before
readBody()) to include common compression tokens in content-type (e.g., match
/gzip|br|deflate|compress|x-gzip/ or +gzip/+br suffixes) and to detect a
compression query param on the request URL (e.g.,
URL(event.url).searchParams.has('compression') or
searchParams.get('compression') includes gzip/br), so any of those cases
short-circuit to passing raw bytes instead of calling readBody().

---

Nitpick comments:
In `@test/unit/proxy-handler-binary.test.ts`:
- Around line 117-161: Add a new test case mirroring "preserves gzip-compressed
body sent without content-type" but with privacy enabled: send the same
gzip-compressed payload to
`http://localhost:${proxyPort}/batch?compression=gzip-js` including header
`x-test-privacy: 'true'`, using the same `gzipSync` and `compressed` variables,
then assert `capturedUpstreamBody` is not null and that
`Buffer.compare(capturedUpstreamBody!, compressed) === 0`; place the test near
the other binary/compression tests so it exercises the `compression=gzip-js` +
privacy path.

ℹ️ Review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4d79e91 and 676a217.

📒 Files selected for processing (4)

src/runtime/server/proxy-handler.ts
test/e2e/first-party.test.ts
test/fixtures/first-party/nuxt.config.ts
test/unit/proxy-handler-binary.test.ts

src/runtime/server/proxy-handler.ts

- Detect ?compression=gzip-js (PostHog) so compressed bodies pass through raw even when privacy is enabled - Fix truthy checks on rawBody/body to avoid dropping falsy-but-valid payloads - Add test for gzip-js + privacy enabled scenario

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/runtime/server/proxy-handler.ts`:
- Around line 219-220: The code treats request bodies as Record<string, unknown>
which causes top-level JSON arrays to be coerced into object-like shapes; update
the handling around the variables body and rawBody and any calls to
stripPayloadFingerprinting so that Array.isArray(rawBody) or Array.isArray(body)
is detected and arrays are preserved (e.g., allow body to be typed as unknown |
unknown[] | Record<string, unknown>) and pass arrays through to
stripPayloadFingerprinting or handle them with a separate array-aware branch
instead of casting to Record<string, unknown>, ensuring
stripPayloadFingerprinting receives the original array shape.
- Around line 111-117: Normalize the Content-Type header before doing
case-sensitive substring checks: change how contentType is derived from
originalHeaders (e.g., const contentType = (originalHeaders['content-type'] ||
'').toLowerCase()) and then use that lowercased value in isBinaryBody and any
other checks (such as the later includes check around line 257) so matches like
'Octet-Stream' or mixed-case types are handled correctly; update all occurrences
where contentType.includes(...) is used to reference the normalized variable.

ℹ️ Review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 556b6ed and 3b7e564.

📒 Files selected for processing (2)

src/runtime/server/proxy-handler.ts
test/unit/proxy-handler-binary.test.ts

🚧 Files skipped from review as they are similar to previous changes (1)

test/unit/proxy-handler-binary.test.ts

coderabbitai · 2026-03-03T04:35:33Z

src/runtime/server/proxy-handler.ts

+  const contentType = originalHeaders['content-type'] || ''
+  const compressionParam = new URL(event.path, 'http://localhost').searchParams.get('compression')
+  const isBinaryBody = Boolean(
+    originalHeaders['content-encoding']
+    || contentType.includes('octet-stream')
+    || (compressionParam && /gzip|deflate|br|compress/i.test(compressionParam)),
+  )


⚠️ Potential issue | 🟡 Minor

Normalize Content-Type before matching.

At Line 115 and Line 257, includes(...) checks are case-sensitive. Content-Type is case-insensitive by spec, so mixed-case values can bypass binary/form handling and route through the wrong path.

💡 Suggested fix

- const contentType = originalHeaders['content-type'] || '' + const contentType = originalHeaders['content-type'] || '' + const normalizedContentType = contentType.toLowerCase() @@ - || contentType.includes('octet-stream') + || normalizedContentType.includes('octet-stream') @@ - else if (contentType.includes('application/x-www-form-urlencoded')) { + else if (normalizedContentType.includes('application/x-www-form-urlencoded')) {

Also applies to: 257-257

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/runtime/server/proxy-handler.ts` around lines 111 - 117, Normalize the Content-Type header before doing case-sensitive substring checks: change how contentType is derived from originalHeaders (e.g., const contentType = (originalHeaders['content-type'] || '').toLowerCase()) and then use that lowercased value in isBinaryBody and any other checks (such as the later includes check around line 257) so matches like 'Octet-Stream' or mixed-case types are handled correctly; update all occurrences where contentType.includes(...) is used to reference the normalized variable.

src/runtime/server/proxy-handler.ts

PostHog's config.js endpoint returns JSON but the SDK requests it as a script, causing a MIME type error in strict-mode browsers. This is a known third-party behavior, not a proxy rewrite bug.

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@test/e2e/first-party.test.ts`:
- Line 947: The test currently unconditionally suppresses the regex "/MIME type
.* is not executable/i", which hides real proxy/content-type regressions; change
the logic that builds the ignored-errors list (where "/MIME type .* is not
executable/i" is added) so that this pattern is only applied for the PostHog
config.js case — detect the PostHog provider or request path (e.g., provider
name 'posthog' or URL ending in "config.js") and add the MIME-ignore only in
that branch; leave all other providers/tests to surface the MIME execution error
normally.

ℹ️ Review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 3b7e564 and c97fc8d.

📒 Files selected for processing (1)

test/e2e/first-party.test.ts

test/e2e/first-party.test.ts

Only suppress the MIME-not-executable error when the URL contains config.js, so real MIME type regressions on other providers still surface.

Arrays passed to stripPayloadFingerprinting were coerced into objects ({"0": ..., "1": ...}). Now arrays are detected and each element is stripped individually, preserving the original array shape.

4xx responses from upstream APIs are expected with fake test API keys (e.g. PostHog returning 404/401 for phc_test). Only 5xx indicates actual proxy infrastructure failures.

Use the same real API key from the basic fixture so upstream returns valid responses. Scope failed proxy request assertions to only check the current provider's paths to avoid cross-provider noise from globally-registered scripts.

zizzfizzix · 2026-03-03T05:26:26Z

src/runtime/server/proxy-handler.ts

+  const isBinaryBody = Boolean(
+    originalHeaders['content-encoding']
+    || contentType.includes('octet-stream')
+    || (compressionParam && /gzip|deflate|br|compress/i.test(compressionParam)),


They also use base64, see: https://posthog.com/docs/references/posthog-js-1.285.0/types/Compression

- Add base64 to compression query param regex (PostHog uses it) - Replace placeholder IDs with real ones from playground across plausible, cloudflare, rybbit, fathom, intercom, and crisp

Every console error, uncaught exception, and failed proxy request is now treated as critical. No noise lists, no per-provider scoping — if it errors, the test fails.

…ider lists - Use [input, { trigger: 'manual' }] registry format with runtimeConfig - Remove all noise filtering from test assertions - Fix Nitro publicAssets leading-dash filename bug - Add PostHog to error check and bundle coverage provider lists - Update snapshots

- Reddit pixel ID → a2_ilz4u0kbdr3v - Rybbit siteId → 874 (was analyticsId which doesn't exist) - Umami websiteId → ae15c227-67e8-434a-831f-67e6df88bd6c - Add pixel-config.reddit.com to reddit proxy config (CORS fix) - Update page components to match config keys

vercel bot deployed to Preview – scripts-playground March 3, 2026 03:52 View deployment

vercel bot deployed to Preview – scripts-docs March 3, 2026 03:53 View deployment

test: add PostHog to first-party E2E fixture

556b6ed

Adds posthog-js dependency, fixture page, and provider paths so PostHog is exercised in the E2E error-check sweep with a real browser connection.

coderabbitai bot reviewed Mar 3, 2026

View reviewed changes

src/runtime/server/proxy-handler.ts Show resolved Hide resolved

src/runtime/server/proxy-handler.ts Outdated Show resolved Hide resolved

vercel bot deployed to Preview – scripts-playground March 3, 2026 03:58 View deployment

vercel bot deployed to Preview – scripts-docs March 3, 2026 03:59 View deployment

vercel bot deployed to Preview – scripts-playground March 3, 2026 04:34 View deployment

coderabbitai bot reviewed Mar 3, 2026

View reviewed changes

vercel bot deployed to Preview – scripts-docs March 3, 2026 04:36 View deployment

test: filter PostHog MIME type noise in E2E proxy error assertions

c97fc8d

PostHog's config.js endpoint returns JSON but the SDK requests it as a script, causing a MIME type error in strict-mode browsers. This is a known third-party behavior, not a proxy rewrite bug.

coderabbitai bot reviewed Mar 3, 2026

View reviewed changes

test/e2e/first-party.test.ts Outdated Show resolved Hide resolved

vercel bot deployed to Preview – scripts-playground March 3, 2026 04:43 View deployment

vercel bot had a problem deploying to Preview – scripts-docs March 3, 2026 04:43 Failure

test: scope MIME type noise filter to PostHog config.js only

b0a9857

Only suppress the MIME-not-executable error when the URL contains config.js, so real MIME type regressions on other providers still surface.

vercel bot deployed to Preview – scripts-playground March 3, 2026 04:47 View deployment

vercel bot had a problem deploying to Preview – scripts-docs March 3, 2026 04:47 Failure

fix: preserve top-level JSON array bodies through privacy transforms

75639f2

Arrays passed to stripPayloadFingerprinting were coerced into objects ({"0": ..., "1": ...}). Now arrays are detected and each element is stripped individually, preserving the original array shape.

vercel bot deployed to Preview – scripts-playground March 3, 2026 04:51 View deployment

vercel bot deployed to Preview – scripts-docs March 3, 2026 04:52 View deployment

test: only flag 5xx proxy responses as failures in E2E tests

6b53d3d

4xx responses from upstream APIs are expected with fake test API keys (e.g. PostHog returning 404/401 for phc_test). Only 5xx indicates actual proxy infrastructure failures.

vercel bot deployed to Preview – scripts-playground March 3, 2026 04:56 View deployment

vercel bot deployed to Preview – scripts-playground March 3, 2026 05:00 View deployment

vercel bot had a problem deploying to Preview – scripts-docs March 3, 2026 05:00 Failure

zizzfizzix reviewed Mar 3, 2026

View reviewed changes

harlan-zw added 2 commits March 3, 2026 16:50

fix: include base64 in compression detection, use real API keys

05f26ca

- Add base64 to compression query param regex (PostHog uses it) - Replace placeholder IDs with real ones from playground across plausible, cloudflare, rybbit, fathom, intercom, and crisp

test: remove all error filtering from E2E proxy tests

f96cdaa

Every console error, uncaught exception, and failed proxy request is now treated as critical. No noise lists, no per-provider scoping — if it errors, the test fails.

vercel bot deployed to Preview – scripts-playground March 3, 2026 05:56 View deployment

vercel bot deployed to Preview – scripts-docs March 3, 2026 05:57 View deployment

vercel bot deployed to Preview – scripts-playground March 3, 2026 11:50 View deployment

vercel bot deployed to Preview – scripts-docs March 3, 2026 11:51 View deployment

vercel bot deployed to Preview – scripts-playground March 3, 2026 12:45 View deployment

vercel bot deployed to Preview – scripts-docs March 3, 2026 12:46 View deployment

harlan-zw merged commit c3a6098 into main Mar 3, 2026
10 checks passed

harlan-zw deleted the fix/proxy-binary-passthrough branch March 3, 2026 12:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: preserve compressed/binary request bodies in proxy handler#619

fix: preserve compressed/binary request bodies in proxy handler#619
harlan-zw merged 12 commits intomainfrom
fix/proxy-binary-passthrough

harlan-zw commented Mar 3, 2026

Uh oh!

vercel bot commented Mar 3, 2026 •

edited

Loading

Uh oh!

pkg-pr-new bot commented Mar 3, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Mar 3, 2026 •

edited

Loading

Reviews paused

Walkthrough

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Mar 3, 2026

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

zizzfizzix Mar 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

harlan-zw commented Mar 3, 2026

🔗 Linked issue

❓ Type of change

📚 Description

Uh oh!

vercel bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pkg-pr-new bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zizzfizzix Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vercel bot commented Mar 3, 2026 •

edited

Loading

pkg-pr-new bot commented Mar 3, 2026 •

edited

Loading

coderabbitai bot commented Mar 3, 2026 •

edited

Loading