cluster:RR - Added public API pausing/unpausing a worker #10369

Yemanu · 2016-12-21T01:35:48Z

cluster:RR - Added public API pausing/unpausing a worker

We have the following two use-cases:

Run offline GC, we have a script which coordinates offline GC using pause and unpause API. Basically, the master process is able not to distribute new requests to a paused worker by calling cluster.pause(worker). The paused worker can force full GC after draining all pending requests in its flight. Once it is done the worker can continue to server new request after the master cluster unpauses it by calling cluster.unpause(worker).
Respawn a worker after serving X requests. The master cluster should not distribute requests to the new worker until it prime-cache which may take few seconds. We need to pause it while prime caching the worker.

This enabled us to reduce long tail latency.

There has been discussion about this on how best to this feature (#7695)

…er not to distribute new requests to a paused worker until unpaused by setting cluster.unpause(worker)

cjihrig · 2016-12-21T01:52:47Z

I'm not really a fan of this approach. I'd prefer allowing the user to define the scheduling policy, which I believe was discussed in #7695.

Yemanu · 2016-12-21T17:52:45Z

Colin - Thanks for your quick response.

I think in the previous discussion we try to avoid exposing RR handler to the public API. I was thinking using explicit pause/unpause API make it more useful for various uses cases.

How can we user defined scheduling policy be used without exposing RR handler to the public API?

Yemanu · 2016-12-21T18:45:21Z

@cjihrig FYI: we (at Yahoo) also tried different approach https://github.com/bengl/toor using socket, but this doesn't work if the keepAlive option is on.

Trott · 2016-12-21T22:37:48Z

/cc @bengl

bengl · 2016-12-21T23:44:49Z

@Yemanu I opened an issue on bengl/toor . Can you please add further detail there about the keep-alive issue?

Yemanu · 2016-12-22T01:04:24Z

@bengl one of the issues with toor that is http-shutdown closes all ideal keepAlive connection, so it causes additional connection overhead.

Yemanu · 2016-12-22T01:17:43Z

Toor is also using the shimer module, and we need to fully understand the implication of using shimer like that other than debugging :)

jasnell · 2017-03-24T23:06:11Z

Updates on this one?

bnoordhuis · 2017-03-26T10:11:27Z

This has been superseded by #11546. I'll close it out.

Yemanu and others added 2 commits December 21, 2016 00:20

cluster:RR - Added public API pausing/unpausing a worker. Allows Mast…

12e5004

…er not to distribute new requests to a paused worker until unpaused by setting cluster.unpause(worker)

cluster:RR fix lint error

68d83de

nodejs-github-bot added cluster Issues and PRs related to the cluster subsystem. lts-watch-v6.x labels Dec 21, 2016

bengl mentioned this pull request Dec 21, 2016

Keep-Alive issue from Yahoo bengl/toor#2

Open

Yemanu mentioned this pull request Jan 18, 2017

cluster:take a worker offline and force GC #7695

Closed

4 tasks

jasnell added the stalled Issues and PRs that are stalled. label Mar 24, 2017

bnoordhuis closed this Mar 26, 2017

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cluster:RR - Added public API pausing/unpausing a worker #10369

cluster:RR - Added public API pausing/unpausing a worker #10369

Yemanu commented Dec 21, 2016

cjihrig commented Dec 21, 2016

Yemanu commented Dec 21, 2016

Yemanu commented Dec 21, 2016

Trott commented Dec 21, 2016

bengl commented Dec 21, 2016

Yemanu commented Dec 22, 2016

Yemanu commented Dec 22, 2016 •

edited

Loading

jasnell commented Mar 24, 2017

bnoordhuis commented Mar 26, 2017

cluster:RR - Added public API pausing/unpausing a worker #10369

cluster:RR - Added public API pausing/unpausing a worker #10369

Conversation

Yemanu commented Dec 21, 2016

cjihrig commented Dec 21, 2016

Yemanu commented Dec 21, 2016

Yemanu commented Dec 21, 2016

Trott commented Dec 21, 2016

bengl commented Dec 21, 2016

Yemanu commented Dec 22, 2016

Yemanu commented Dec 22, 2016 • edited Loading

jasnell commented Mar 24, 2017

bnoordhuis commented Mar 26, 2017

Yemanu commented Dec 22, 2016 •

edited

Loading