Calling disconnect causes processes spawned by cluster module to exit too early #27679

bimbiltu · 2019-05-13T18:16:13Z

Version: 12.2.0 but could also reproduce on 10.15.3
Platform: Darwin 18.5.0 Darwin Kernel Version 18.5.0: Mon Mar 11 20:40:32 PDT 2019; root:xnu-4903.251.3~3/RELEASE_X86_64 x86_64
Subsystem:

Consider the following script:

const cluster = require('cluster');
const childProcess = require('child_process');

const useCluster = false;
const isMaster = useCluster ? cluster.isMaster : !process.argv.includes('worker');

if (isMaster) {
  if (useCluster) {
    cluster.fork(__filename);
  } else {
    childProcess.fork(__filename, ['worker']);
  }
} else {
  setTimeout(() => console.log('hi from worker'), 1000);
  process.disconnect();
}

When useCluster is true node exits immediately and nothing is printed to the console. When useCluster is false you see "hi from worker" logged after 1s and then node will exit which is the expected behavior. The documentation makes it sound like calling process.disconnect should only close the IPC channel between the master and worker process and should not cause either to exit early if there is still work to do.

The text was updated successfully, but these errors were encountered:

sam-github · 2019-05-13T19:36:02Z

Sorry, I ran out of time to look at this, but what I suspect is happening is that the master does not expect to see the IPC pipe closed directly, without the worker sending a message saying that it is going to do so, considers this to be unexpected, and kills the worker. I had trouble proving that, though, more research would be required.

Note that using process.worker.disconnect() instead of process.disconnect() gives the behaviour you expect. If you just want a workaround, doing (process.worker ? process.worker.disconnect : process.disconnect)() would allow more symetricality between cluster workers and forked child processes.

bimbiltu · 2019-05-15T16:20:59Z

@sam-github process.worker is undefined for me when using both cluster and child_process. I tried this with node 10.15.3 as well as 12.2.0 on macOS mojave if that makes a difference. I've just switched to using child_process for now in my code since we were really only using the cluster module for its automatic debug port incrementing

sam-github · 2019-05-15T20:33:17Z

Apologies, I meant cluster.worker.disconnect.

The cluster modules is not very flexible, if its not being used for exactly the intended use-case (identical net or http servers), its features easily become misfeatures, so using child_process is probably a better idea for you.

ledbit · 2020-08-13T20:34:41Z

Just ran into a variation of this issue, in our case the master process exits and we want to perform some proper graceful shutdown on the child processes. Happy to contribute, but want to validate that I have the correct approach first. I believe the root cause of both issues (master process going away and process.disconnect) is in this listener added during cluster's worker setup.

The approach that I think would solve this issue would be to add an option (similar to exitedOnDisconnect) that would control whether the child process should exit on disconnect. Can someone please chime in on whether this is the right approach?

drewswanner · 2020-10-13T03:34:27Z

Is anyone working on this I would be interested in looking into it. I may need some guidance but I am willing to give it a shot.

SeanMcCord · 2021-05-18T00:41:07Z

Based on the documentation I believe process.disconnect is working correctly and as expected. @sam-github makes a good point about the cluster module flexibility. Using the cluster module effectively requires the usage of worker.disconnect to handle graceful shutdown*. This has been surprising behavior to people since 2016[1][2]. I'm not convinced that a change to either the cluster or the child_process modules are needed.

The 2017 issue[2] suggested updating the documentation to make this behavior clear; however, the documentation does not appear to have been changed. I've created a PR for updating the documentation #38713

[1] #13671
[2] https://stackoverflow.com/a/40846737

You could hack your way around this by manually setting worker.exitedAfterDisconnect to true before calling process.disconnect, but obviously that is gross.

oyyd added the cluster Issues and PRs related to the cluster subsystem. label May 14, 2019

jasnell added the help wanted Issues that need assistance from volunteers or PRs that need help to proceed. label Jun 26, 2020

SeanMcCord mentioned this issue May 18, 2021

add clarification to worker.disconnect() #38713

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Calling disconnect causes processes spawned by cluster module to exit too early #27679

Calling disconnect causes processes spawned by cluster module to exit too early #27679

bimbiltu commented May 13, 2019 •

edited

sam-github commented May 13, 2019

bimbiltu commented May 15, 2019

sam-github commented May 15, 2019

ledbit commented Aug 13, 2020

drewswanner commented Oct 13, 2020

SeanMcCord commented May 18, 2021

Calling disconnect causes processes spawned by cluster module to exit too early #27679

Calling disconnect causes processes spawned by cluster module to exit too early #27679

Comments

bimbiltu commented May 13, 2019 • edited

sam-github commented May 13, 2019

bimbiltu commented May 15, 2019

sam-github commented May 15, 2019

ledbit commented Aug 13, 2020

drewswanner commented Oct 13, 2020

SeanMcCord commented May 18, 2021

bimbiltu commented May 13, 2019 •

edited