rpc: avoid ForkJoinPool compensation in send polling loop by jkschneider · Pull Request #7617 · openrewrite/rewrite

jkschneider · 2026-05-09T19:43:51Z

What's changed?

RewriteRpc.send's polling loop now uses future.getNow(null) + Thread.sleep(1ms) instead of future.get(checkIntervalMs, TimeUnit.MILLISECONDS). Liveness check decoupled from polling cadence and fires every 500ms.

What's your motivation?

future.get(long, TimeUnit) from a ForkJoinPool worker goes through CompletableFuture.Signaller.block, which is a ForkJoinPool.ManagedBlocker. ManagedBlocker is the explicit hook FJP watches for and reacts to by spawning a compensation worker — a fresh thread added to keep parallelism while the original is parked.

That compensation worker can pick up other queued work (recipe load, printer invocations, etc.) and call PythonRewriteRpc.getOrStart() / JavaScriptRewriteRpc.getOrStart() / etc., spawning a fresh OS rpc subprocess into its ThreadLocal. When FJP later terminates the idle compensation worker, its TL is GC'd — but the OS process is independent of the JVM heap and survives. The dispatching worker's RunTask.execute() finally only runs shutdownCurrent() on the dispatching thread's TL, never on the dead compensation worker.

On a moderne-cli mod run against ~448 Python repos, this accumulated 127+ alive python rpc processes per long-running JVM, with each leaked rpc carrying a different past repo's log path in argv. Each compensation-worker spawn = one leaked OS process. The same mechanism applies to JS / C# / Go rpcs.

Thread.sleep parks the thread via LockSupport directly, not through ManagedBlocker, so FJP doesn't compensate. Parallelism temporarily drops by one while a worker is in rpc.send, which is exactly the resource-bounded behavior --parallel is supposed to mean.

Verified locally with a 20-repo Python LST sample running UpgradeToPython314 at --parallel=14:

Before: spawn count >> repo count under load (compensation amplification)
After: spawn count = repo count, alive rpc count ≤ --parallel, no compensation worker thread names in jstack, final_alive=0 after run.
Pairs with rpc: kill child processes at JVM exit via shutdown hook #7616
rpc: kill child processes at JVM exit via shutdown hook #7616 added a JVM-shutdown-hook on RewriteRpcProcess to kill child processes at JVM exit — that catches survivors at process termination. This PR prevents the leak from happening intra-run by removing its trigger.

Checklist

I've read and applied the recipe conventions and best practices
I've used the IntelliJ IDEA auto-formatter on affected files

The send polling loop called future.get(checkIntervalMs, TimeUnit.MILLISECONDS), which from a ForkJoinPool worker goes through CompletableFuture.Signaller.block — a ForkJoinPool.ManagedBlocker. ManagedBlocker is the explicit hook FJP watches for to spawn a compensation worker (a fresh thread added to keep parallelism while the original is parked). The compensation worker can pick up other queued recipe-scheduler work and call PythonRewriteRpc.getOrStart() (e.g. from a printer or the lazy LazyRecipeBundleResolver supplier), spawning a fresh OS python rpc into its ThreadLocal. When FJP later terminates the idle compensation worker, its TL is GC'd — but the OS python process is independent of the JVM heap and survives. RunTask.execute()'s finally only runs shutdownCurrent on the dispatching worker, never on the dead compensation worker. Each compensation-worker spawn leaked one OS process; long-running recipe-worker JVMs accumulated 100+ alive rpcs intra-run. Replace future.get(timeout) with future.getNow(null) + Thread.sleep(1ms) polling. Thread.sleep parks via LockSupport directly — not through ManagedBlocker — so FJP doesn't compensate. Parallelism temporarily drops by one until the rpc response arrives, which is exactly the resource-bounded behavior we want from --parallel. Liveness check decoupled to fire every 500ms to preserve existing failure-detection cadence. Pairs with #7616 (JVM-exit shutdown hook): #7616 catches survivors at JVM exit; this prevents the leak from happening intra-run.

github-project-automation Bot added this to OpenRewrite May 9, 2026

github-project-automation Bot moved this to In Progress in OpenRewrite May 9, 2026

moderne-meeseeks Bot assigned jkschneider May 9, 2026

jkschneider merged commit ccba1ca into main May 9, 2026
1 check passed

jkschneider deleted the rpc-avoid-fjp-compensation branch May 9, 2026 19:44

github-project-automation Bot moved this from In Progress to Done in OpenRewrite May 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rpc: avoid ForkJoinPool compensation in send polling loop#7617

rpc: avoid ForkJoinPool compensation in send polling loop#7617
jkschneider merged 1 commit intomainfrom
rpc-avoid-fjp-compensation

jkschneider commented May 9, 2026 •

edited by moderne-meeseeks Bot

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jkschneider commented May 9, 2026 • edited by moderne-meeseeks Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What's changed?

What's your motivation?

Pairs with rpc: kill child processes at JVM exit via shutdown hook #7616

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jkschneider commented May 9, 2026 •

edited by moderne-meeseeks Bot

Loading