do not always retry write to pipe if we get blocking errors #519

etamme · 2015-05-21T13:33:06Z

If you have an rabbitmq event subscription the event_rabbitmq module will shm_alloc rmq events and write pointers of the structs to the pipe.

In the event that the node you have connected to goes down, the pipe will start to fill and when it reaches its max capacity (65535 bytes) or approximately 2700 events based on the 8 byte pointer, the write call with return EAGAIN causing the while loop to become an infinite loop until pointers start getting pulled off the pipe. This causes massive CPU consumption, as well as blocking any process that generates an event to event_rabbitmq.

Attempts to publish to rabbit MQ timeout after 3 minutes based on a default system tcp timeout, so only one event every 3 minutes will be pulled from the pipe while the amqp node is down.

You can recreate this issue by setting up a proxy with an event_rabbitmq subscription, adding an iptables rule to block access to the amqp node and send traffic to the proxy that would generate an event till you hit the max pipe size ~2703 events pointers.

This commit changes the logic to simply retry the write 3 times, then abort.

…auses the proxy to lock up and consume cpu

josephfrazier · 2015-05-22T14:46:01Z

~~Note that event_xmlrpc has the same problem as well:~~

opensips/modules/event_xmlrpc/xmlrpc_send.c

Line 94 in 6f5dde4

} while ((rc < 0 && (IS_ERR(EINTR)||IS_ERR(EAGAIN)||IS_ERR(EWOULDBLOCK)))

EDIT: Whoops, didn't read closely enough. The parentheses in the above example are arranged differently, so retries is honored regardless of error code.

razvancrainea · 2015-06-03T09:44:52Z

The PR was committed in the master branch, with a few changes. If everything is ok now, let me know so I can backport it and close the PR.

Thanks,
Răzvan

razvancrainea · 2015-06-03T15:48:28Z

Backported to 2.1. Closing the ticket.

do not always retry write to pipe if we get blocking errors as this c…

32b4340

…auses the proxy to lock up and consume cpu

razvancrainea self-assigned this May 22, 2015

razvancrainea closed this Jun 3, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

do not always retry write to pipe if we get blocking errors #519

do not always retry write to pipe if we get blocking errors #519

etamme commented May 21, 2015

josephfrazier commented May 22, 2015

razvancrainea commented Jun 3, 2015

razvancrainea commented Jun 3, 2015

do not always retry write to pipe if we get blocking errors #519

do not always retry write to pipe if we get blocking errors #519

Conversation

etamme commented May 21, 2015

josephfrazier commented May 22, 2015

razvancrainea commented Jun 3, 2015

razvancrainea commented Jun 3, 2015