fix(engine): write the audio mix filter graph to a file, not the command line by miguel-heygen · Pull Request #1890 · heygen-com/hyperframes

miguel-heygen · 2026-07-03T08:36:40Z

Summary

mixAudioTracks built the ffmpeg -filter_complex argument as one inline string that scales linearly with track count. Reported in the wild at 146 timed audio clips: the resulting command line exceeded the OS argument-length limit and spawn failed with ENAMETOOLONG, dropping audio entirely until the user manually consolidated clips to reduce the count.

Fix

FFmpeg supports -filter_complex_script <file> specifically for this — the same filter graph read from a file instead of inlined as a command-line argument. -i <path> pairs for each track still scale with count, but each is short and fixed-size; the one component that actually grew unbounded (the filter graph string) is off the command line entirely.

The temp script file is written right before the runFfmpeg call and cleaned up immediately after it resolves (success or failure, via .finally()), matching the existing sibling temp-file convention already used in audioVolumeEnvelope.ts.

Test plan

bunx vitest run packages/engine/src/services/audioMixer.test.ts — 7 tests pass (6 existing, updated to read the filter graph from the mock's captured file content instead of the now-removed inline -filter_complex string, + 1 new regression test)
New test: 150 tracks (reproducing the reported 146-clip shape) keeps the ffmpeg args array's total character length under 20K, uses -filter_complex_script (not -filter_complex), and the captured filter script's content still contains a correct amix=inputs=150 and one atrim= segment per track
bunx vitest run packages/engine/ — full package, 841 tests pass, no regressions
Verified end-to-end against a real ffmpeg binary (not just mocked): generated two real sine-wave WAV files, ran processCompositionAudio unmocked, confirmed correct output audio (right duration, right codec via ffprobe) and no leftover temp files after completion

…and line mixAudioTracks built the ffmpeg -filter_complex argument as one inline string scaling linearly with track count. Reported in the wild at 146 timed audio clips: the resulting command line exceeded the OS length limit and spawn failed with ENAMETOOLONG, dropping audio entirely until the user manually consolidated clips to reduce the count. FFmpeg supports -filter_complex_script specifically for this - the same filter graph read from a file instead of inlined as an argument. The -i pairs for each track still scale with count but stay short and fixed-size each, so the one component that actually grew unbounded (the filter string) no longer sits on the command line at all. The temp file is cleaned up immediately after ffmpeg exits, matching the existing sibling temp-file convention in audioVolumeEnvelope.ts. Verified end-to-end against a real ffmpeg binary (not just mocked): a two-track mix produced correct output audio with no leftover temp files.

james-russo-rames-d-jusso

Reviewed at 23b0fdd.

🟢 Correct fix using the exact FFmpeg feature designed for this problem (-filter_complex_script <file> vs inline -filter_complex <string>). Sidesteps the argv length limit for the one component that actually grew unbounded (the filter graph); inputs (-i pairs) still scale linearly but each is short and fixed-size, and the args-array total stays well under 20KB even at 150 tracks (verified in the new regression test).

Verification notes:

End-to-end verification against real ffmpeg (not just mocked) called out in the PR body — appreciated, this is the shape of verification I want to see on wire-format changes to a subprocess boundary.
Temp-file lifecycle: mkdtempSync for the script dir, openSync(..., "wx", 0o600) for exclusive create + owner-only perms, rmSync(scriptDir, { recursive: true, force: true }) in .finally() covers both success and failure paths. Matches the sibling convention in audioVolumeEnvelope.ts.
Test-mock retrofit is careful — the capturedFilterScripts side array is index-aligned with call order, and the two mockImplementationOnce overrides in the automation-fallback test explicitly push placeholders to keep alignment. Non-obvious enough to warrant the comment; author added it.

Concerns:

🟡 scriptDir is created in outputDir (= dirname(outputPath)). If a downstream render pipeline globs outputDir for output artifacts (e.g. ls outputDir/*.m4a, or a manifest-diff scan), the .filter-complex-*/ temp dirs are transient enough that they only exist for the ffmpeg process lifetime — but if the process is force-killed mid-run (SIGKILL bypasses .finally()), you'll leak a .filter-complex-<random>/graph.txt sitting next to the output. Consider tmpdir() instead of outputDir as the parent, or a cleanup sweep in the caller. Low-severity; the leak is a few KB per crash.
🟡 The automation-fallback retry path (runMix(true)) creates a SECOND script dir + file. Each retry writes a new temp; if the first mix hangs and gets timed out, both dirs exist briefly. .finally() on each individually is correct, just noting the pattern for future readers.

Nits:

🟡 writeFileSync(fd, ...) after openSync(...) — the try/finally correctly closeSyncs the fd, but if writeFileSync throws (disk full, etc.), the empty file + dir persists briefly until process exit. The .finally() on the outer runFfmpeg chain handles this. Acceptable.

Full package test pass (841 tests, no regressions) + real-ffmpeg verification is strong evidence. Nothing blocking.

— Rames D Jusso

vanceingalls

🟢 R1 verdict — LGTM (runtime-interop lens)

Reported real-world ENAMETOOLONG at 146 tracks. Fix pushes the one unbounded component (the filter graph string) off argv and into a temp file, keeping the -i pairs (fixed-size each) on the command line. Sound minimal fix.

Finding-by-finding

1. Race safety on parallel `runMix` invocations — 🟢

packages/engine/src/services/audioMixer.ts:433-436

const scriptDir = mkdtempSync(join(outputDir, ".filter-complex-"));
const scriptPath = join(scriptDir, "graph.txt");
const fd = openSync(scriptPath, "wx", 0o600);

Two things to like here after the second commit:

mkdtempSync guarantees a unique per-mix directory, so two concurrent mixAudioTracks calls sharing an outputDir don't stomp each other's graph.txt.
openSync(..., "wx", 0o600) fails fast on any existing file and forbids other-user read of the graph content while it's on disk. Belt-and-braces on top of the unique dir; not necessary but not harmful.

The .finally(() => rmSync(scriptDir, { recursive: true, force: true })) cleans up on both the initial mix and the automation-degraded retry, and force: true means a missing dir isn't an error — safe against a signal-abort mid-mix that already deleted it. LGTM.

2. Downstream consumer sweep — 🟢

mixAudioTracks is called only from processCompositionAudio in the same file (verified via repo-wide code search). No external export, no shape change to worry about. Blast radius zero.

3. Reachability under prod defaults — 🟢

The new runMix path replaces the old inline-filter path unconditionally — every existing call site hits the new code. No conditional gate, no feature flag. Prod defaults use it.

4. Test integrity — 🟢 (out of lens but worth flagging positively)

The capturedFilterScripts side-array with an index-aligned placeholder pushed from the two mockImplementationOnce calls in the degraded-automation test (line ~114-126) is the right approach — the file is unlinked before Vitest's assertions run, so reading it after processCompositionAudio resolves would ENOENT. The mock captures it synchronously while it still exists on disk. Clean.

No blockers.

Review by Via (runtime-interop lens)

github-advanced-security AI found potential problems Jul 3, 2026

View reviewed changes

Comment thread packages/engine/src/services/audioMixer.ts Fixed

fix(engine): create audio filter scripts safely

23b0fdd

miguel-heygen marked this pull request as ready for review July 4, 2026 20:16

james-russo-rames-d-jusso reviewed Jul 4, 2026

View reviewed changes

vanceingalls reviewed Jul 4, 2026

View reviewed changes

miguel-heygen merged commit 8a3227f into main Jul 4, 2026
52 checks passed

miguel-heygen deleted the fix/audiomixer-filter-complex-script branch July 4, 2026 21:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(engine): write the audio mix filter graph to a file, not the command line#1890

fix(engine): write the audio mix filter graph to a file, not the command line#1890
miguel-heygen merged 2 commits into
mainfrom
fix/audiomixer-filter-complex-script

miguel-heygen commented Jul 3, 2026

Uh oh!

Uh oh!

james-russo-rames-d-jusso left a comment

Uh oh!

vanceingalls left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

miguel-heygen commented Jul 3, 2026

Summary

Fix

Test plan

Uh oh!

Uh oh!

james-russo-rames-d-jusso left a comment

Choose a reason for hiding this comment

Uh oh!

vanceingalls left a comment

Choose a reason for hiding this comment

🟢 R1 verdict — LGTM (runtime-interop lens)

Finding-by-finding

1. Race safety on parallel runMix invocations — 🟢

2. Downstream consumer sweep — 🟢

3. Reachability under prod defaults — 🟢

4. Test integrity — 🟢 (out of lens but worth flagging positively)

No blockers.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

1. Race safety on parallel `runMix` invocations — 🟢