Introduce "eventloop" style API to better handle netlink event storms #499

crosser · 2018-05-08T13:06:50Z

Please consider this proposal.

In mass network topology updates, thousands of netlink packets lead to excessive resource usage (one thread per event use much memory and results in slow processing) and hanging process (recovery from failed IPDB commit is slow, and monitoring thread is prone to die without the master thread noticing).

This change allows the user to process incoming netlink messages in an event loop (in the main thread, or in any user-initiated thread(s)), and netlink socket receive errors are passed via the same queue, and re-raised in the master thread. This allows the user program to notice that netlink events where lost and take appropriate action.

The last commit of three allows to specify the size of netlink socket send and receive buffers, and thus control the "storm sensitivity" of the system.

Sidenote: on Linux, system wide max size of socket receive and send buffers is controlled with
/proc/sys/net/core/rmem_max, it may be worth to mention in the documentation.

svinota · 2018-05-09T12:43:27Z

@celebdor Antoni, pls take a look at the PR

Event queue interface is an alternative to "post" callbacks. While "post" callbacks are executed in a separate thread each, event queue interface follows the "eventloop" metaphor, i.e. each netlink event received in the monitoring thread is put in the queue from which it can be subsequently fetched by calling `nextmsg()` generator function in the main thread (or in any other thread(s) started by the user). In the event of packet storm, it is much nicer to the resources than creating a thread for each received message. Signed-off-by: Eugene Crosser <crosser@average.org>

IPDB reinitialization is rather fragile, and sometimes hangs for a long time, and prevents release() from finishing. When event queue is used, errors in the monitor thread are reported to the user via event queue, so the user can take care of cleanup and reinitialization. Signed-off-by: Eugene Crosser <crosser@average.org>

Introduce keyword arguments to IPRoute() and IPDB() constructors to specify netlink socket send and receive buffer sizes. Defaults are 1048576, 1048576 (one megabyte). Useful for defining strategy for handling netlink packet storms: specify smaller size when early bailout is desired. When used in conjunction with eventqueue, exception will be raised in the main thread when netlink events are lost, allowing for user-controlled recovery procedure. Signed-off-by: Eugene Crosser <crosser@average.org>

Eugene Crosser added 3 commits May 14, 2018 15:41

crosser force-pushed the eventloop branch from 509723d to 2d2055b Compare May 14, 2018 13:59

svinota merged commit ef01d3f into svinota:master May 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce "eventloop" style API to better handle netlink event storms #499

Introduce "eventloop" style API to better handle netlink event storms #499

crosser commented May 8, 2018

svinota commented May 9, 2018

Introduce "eventloop" style API to better handle netlink event storms #499

Introduce "eventloop" style API to better handle netlink event storms #499

Conversation

crosser commented May 8, 2018

svinota commented May 9, 2018