[#3699] Nio|EpollEventLoopGroup.shutdownGracefully() needs to gracefully... #3706

normanmaurer · 2015-04-29T09:56:22Z

... close connections.

Motivation:

Currently when calling Nio|EpollEventLoopGroup.shutdownGracefully() all active Channels will be closed right away. This is not what should happen for a graceful shutdown.

Modifications:

Only close Channels after quite period is complete.

Result:

Correct behavior when call shutdownGracefully()

…lly close connections. Motivation: Currently when calling Nio|EpollEventLoopGroup.shutdownGracefully() all active Channels will be closed right away. This is not what should happen for a graceful shutdown. Modifications: Only close Channels after quite period is complete. Result: Correct behavior when call shutdownGracefully()

normanmaurer · 2015-04-29T09:56:29Z

@trustin please check

normanmaurer · 2015-04-29T19:25:12Z

@Scottmitch also please check

trustin · 2015-04-30T09:11:46Z

TL;DR - We don't need to fix this because it is working as intended.

The intention of closeAll() is to make sure to close all channels when an event loop is shutting down so that the channels do not produce new events anymore. If we move closeAll() after confirmShutdown() returns true, the channel events related with the closure may not be processed correctly (e.g. rejected) because an event loop will not accept new tasks once confirmShutdown() returns true.

#3699 says the connections are closed before in-flight messages are handled. Actually, that's an intended behavior. Why? If we waited until all incoming messages are handled, it will always take as much time as the grace period, and we will still not be able to ensure all messages are handled. i.e. we solved nothing really.

I think it's better for a user to communicate with the remote peers to schedule the shutdown and call shutdownGracefully() when it's ready to terminate the event loop.

/cc @fantayeneh

normanmaurer · 2015-04-30T09:12:48Z

@trustin true ... let me close as won't fix...

fantayeneh · 2015-05-01T08:15:53Z

@trustin I am still confused about shutdownGracefully() documentation then. Can you please shed some light on the purpose of quietPeriod param. If the closeAll() happens right away why the need for the quite period.

The current doc says as follows.

Unlike shutdown(), graceful shutdown ensures that no tasks are submitted for 'the quiet period' 
(usually a couple seconds) before it shuts itself down. 
If a task is submitted during the quiet period, it is guaranteed to be accepted and the quiet 
period will start over.

Thanks

pyr · 2022-11-04T07:25:56Z

To support graceful shutdown of applications, the usual workflow is as follows:

Stop listening for new requests
Wait for on-going tasks to be cleared from the event loop (finishing serving in-flight requests)
Stop related executors and shut down service

This enables blue-green type deployments to be easily put together. If I am reading @trustin's comment correctly, .shutdownGracefully does not allow for (2.) to happen since it will cancel ongoing work on the event loop.

Is my understanding correct and if so, what would be the recommended way to gracefully stop a Netty based proxy which may have in-flight requests waiting for responses from downstream services before shutting down?

cc @normanmaurer for insight (for context: we are trying to get clj-commons/aleph to do the right thing when closing down)

pyr · 2022-11-08T08:44:05Z

Following-up here in case someone lands here. It seems that the best way to approach this is to keep track of connections with a ChannelGroup (by adding another ChannelHandler to the pipeline which acts during channelActive and channelInactive). After closing the listening channel, ChannelGroup::newCloseFuture can be called and waited upon, once realized, the rest of the shutdown procedure can be carried through.

normanmaurer self-assigned this Apr 29, 2015

normanmaurer added this to the 4.0.28.Final milestone Apr 29, 2015

normanmaurer closed this Apr 30, 2015

normanmaurer added won't fix not a bug labels Apr 30, 2015

normanmaurer deleted the graceful_shutdown branch May 1, 2015 18:47

CheeriosJo mentioned this pull request Aug 16, 2016

EventloopGroup shutdownGracefully closes the connections right away. #3699

Closed

bestokes mentioned this pull request Mar 11, 2019

Please support graceful shutdown ExpediaGroup/styx#384

Closed

arnaudgeiser mentioned this pull request Nov 4, 2022

Accept shutdownGracefully parameters clj-commons/aleph#638

Closed

2 tasks

arnaudgeiser mentioned this pull request Nov 6, 2022

Support graceful shutdown with ChannelGroup clj-commons/aleph#641

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[#3699] Nio|EpollEventLoopGroup.shutdownGracefully() needs to gracefully... #3706

[#3699] Nio|EpollEventLoopGroup.shutdownGracefully() needs to gracefully... #3706

normanmaurer commented Apr 29, 2015

normanmaurer commented Apr 29, 2015

normanmaurer commented Apr 29, 2015

trustin commented Apr 30, 2015

normanmaurer commented Apr 30, 2015

fantayeneh commented May 1, 2015

pyr commented Nov 4, 2022 •

edited

pyr commented Nov 8, 2022

[#3699] Nio|EpollEventLoopGroup.shutdownGracefully() needs to gracefully... #3706

[#3699] Nio|EpollEventLoopGroup.shutdownGracefully() needs to gracefully... #3706

Conversation

normanmaurer commented Apr 29, 2015

normanmaurer commented Apr 29, 2015

normanmaurer commented Apr 29, 2015

trustin commented Apr 30, 2015

normanmaurer commented Apr 30, 2015

fantayeneh commented May 1, 2015

pyr commented Nov 4, 2022 • edited

pyr commented Nov 8, 2022

pyr commented Nov 4, 2022 •

edited