"ERROR:pika.adapters.base_connection:Socket Error on fd 8: 104" #266

vgoklani · 2013-01-29T23:52:46Z

Hi,

Could someone please explain this error and how to handle it. It looks like the socket is closing, but it's not clear how to address this. Is there an exception I should handle?

I am running the latest release build of both RabbitMQ and Pika.

Thanks,

Vishal

atatsu · 2013-01-30T01:07:40Z

How are you using your channels? Are you using the same channel to do everything? As in declaring queues/exchanges, setting up bindings, consuming queues?

vgoklani · 2013-01-30T01:13:19Z

Yes, I am using the default channel for everything. (presumably this is bad)

My base code is here: https://gist.github.com/4669710

I followed the RabbitMQ tutorial. Could you please point me to a better example.

My callback function is passed in, and the messages are a list of JSON objects that are produced by the producer and consumed by the consumer.

Thanks!

On Jan 29, 2013, at 8:07 PM, Nathan Lundquist notifications@github.com wrote:

How are you using your channels? Are you using the same channel to do everything? As in declaring queues/exchanges, setting up bindings, consuming queues?

—
Reply to this email directly or view it on GitHub.

gmr · 2013-01-30T01:15:40Z

Can you please paste the full traceback?

On Tuesday, January 29, 2013 at 8:13 PM, Vishal Goklani wrote:

Yes, I am using the default channel for everything. (presumably this is bad)

My base code is here: https://gist.github.com/4669710

I followed the RabbitMQ tutorial. Could you please point me to a better example.

My callback function is passed in, and the messages are a list of JSON objects that are produced by the producer and consumed by the consumer.

Thanks!

On Jan 29, 2013, at 8:07 PM, Nathan Lundquist <notifications@github.com (mailto:notifications@github.com)> wrote:

How are you using your channels? Are you using the same channel to do everything? As in declaring queues/exchanges, setting up bindings, consuming queues?

—
Reply to this email directly or view it on GitHub.

—
Reply to this email directly or view it on GitHub (#266 (comment)).

vgoklani · 2013-01-30T01:21:51Z

There is no full traceback from pika, I only see these messages:

ERROR:pika.adapters.base_connection:Socket Error on fd 4: 104

Are you referring to the RabbitMQ logs?

On Jan 29, 2013, at 8:07 PM, Nathan Lundquist notifications@github.com wrote:

How are you using your channels? Are you using the same channel to do everything? As in declaring queues/exchanges, setting up bindings, consuming queues?

—
Reply to this email directly or view it on GitHub.

gmr · 2013-01-30T01:33:42Z

No, I am referring to the python traceback when the exception is raised in your application. That error tells us the problem but not where it is occurring or why. The Python traceback when the app breaks is more useful.

On Tuesday, January 29, 2013 at 8:22 PM, Vishal Goklani wrote:

There is no full traceback from pika, I only see these messages:

ERROR:pika.adapters.base_connection:Socket Error on fd 4: 104

Are you referring to the RabbitMQ logs?

On Jan 29, 2013, at 8:07 PM, Nathan Lundquist <notifications@github.com (mailto:notifications@github.com)> wrote:

How are you using your channels? Are you using the same channel to do everything? As in declaring queues/exchanges, setting up bindings, consuming queues?

—
Reply to this email directly or view it on GitHub.

—
Reply to this email directly or view it on GitHub (#266 (comment)).

atatsu · 2013-01-30T01:43:17Z

From my experience you can save yourself a lot of headaches if you use a new channel for each new operation you wish to perform. The number of channel's you can have open at one time is really only limited by the resources of the box running RabbitMQ. We have thousands of active channels and the box we're running Rabbit on still has plenty to give.

There are a number of reasons to not use the same channel for multiple operations (i.e. queue_declare, exchange_declare, basic_consume) IMO. Say for example you've instructed a channel to consume ten different queues. Now pretend there's a channel error, you tried declaring a queue that already existed with different properties. RabbitMQ will close the channel. Your ten consumers are no longer consuming. You'll have to handle that. Setup all ten consumers again, assuming you know which ten were impacted.

Another problem you run into when reusing the same channel is RabbitMQ expecting a certain response from the channel and your code trying to do something else too soon resulting in an unexpected response on your end which can lead to RabbitMQ closing your connection (which I'm guessing is the problem you're experiencing).

In order to reuse the same channel you have to setup callbacks to know when Rabbit is finished doing its thing. So if you want to call channel.queue_declare five times consecutively you need to call it once, and then in the callback you can call it again. Rinse and repeat three more times.

Personally I find it easier to just use a separate channel for everything and then just close it when it isn't needed anymore. Does this make sense?

vgoklani · 2013-01-30T03:28:50Z

I created a try / catch block:

try:
queue.publish(messages)
except Exception as e:
print e.message

and got this: 'ConnectionClosed' object has no attribute 'messages'. I don't see any other traceback.

So it's throwing a ConnectionClosed exception. The solution would be to catch the exception and just open another connection. The bigger question is why is the connection closing, and is the correct solution to simply open another connection (?)

This is a rough sketch of my code:

open connection to queue
call producer and put messages into queue
sleep for 20min
goto 2

But it seems now that the connection is closing after step 3.

Is it possible that the 20min delay is closing the connection?

Thanks for all the help!

Best,

Vishal

On Jan 29, 2013, at 8:33 PM, Gavin M. Roy notifications@github.com wrote:

No, I am referring to the python traceback when the exception is raised in your application. That error tells us the problem but not where it is occurring or why. The Python traceback when the app breaks is more useful.

On Tuesday, January 29, 2013 at 8:22 PM, Vishal Goklani wrote:

There is no full traceback from pika, I only see these messages:

ERROR:pika.adapters.base_connection:Socket Error on fd 4: 104

Are you referring to the RabbitMQ logs?

On Jan 29, 2013, at 8:07 PM, Nathan Lundquist <notifications@github.com (mailto:notifications@github.com)> wrote:

How are you using your channels? Are you using the same channel to do everything? As in declaring queues/exchanges, setting up bindings, consuming queues?

—
Reply to this email directly or view it on GitHub.

—
Reply to this email directly or view it on GitHub (#266 (comment)).

—
Reply to this email directly or view it on GitHub.

gmr · 2013-01-30T03:33:59Z

If you're not getting the error message from:

try:
    # foo do whatever here
catch ConnectionClosed as error:
    print 'Connection was closed due to: %s' % error

Then the RabbitMQ logs are the next place to look.

vgoklani · 2013-01-30T04:56:54Z

is it normal to get messages like this (from the RabbitMQ log):

=WARNING REPORT==== 30-Jan-2013::03:52:16 ===
closing AMQP connection <0.466.0> (10.169.2.187:35609 -> 10.169.2.187:5672):
connection_closed_abruptly

=INFO REPORT==== 30-Jan-2013::03:52:21 ===
accepting AMQP connection <0.515.0> (10.169.2.187:35624 -> 10.169.2.187:5672)

=INFO REPORT==== 30-Jan-2013::03:52:38 ===
accepting AMQP connection <0.527.0> (10.169.2.187:35630 -> 10.169.2.187:5672)

=ERROR REPORT==== 30-Jan-2013::04:32:21 ===
closing AMQP connection <0.515.0> (10.169.2.187:35624 -> 10.169.2.187:5672):
{heartbeat_timeout,running}

=INFO REPORT==== 30-Jan-2013::04:34:32 ===
accepting AMQP connection <0.802.0> (10.169.2.187:37799 -> 10.169.2.187:5672)

How would I debug this further?

On Jan 29, 2013, at 10:34 PM, Gavin M. Roy notifications@github.com wrote:

If you're not getting the error message from:

try:

foo do whatever here

catch ConnectionClosed as error:
print 'Connection was closed due to: %s' % error

Then the RabbitMQ logs are the next place to look.

On Tue, Jan 29, 2013 at 10:28 PM, Vishal Goklani
notifications@github.comwrote:

I created a try / catch block:

try:
queue.publish(messages)
except Exception as e:
print e.message

and got this: 'ConnectionClosed' object has no attribute 'messages'. I
don't see any other traceback.

So it's throwing a ConnectionClosed exception. The solution would be to
catch the exception and just open another connection. The bigger question
is why is the connection closing, and is the correct solution to simply
open another connection (?)

This is a rough sketch of my code:

open connection to queue

call producer and put messages into queue

sleep for 20min

goto 2

But it seems now that the connection is closing after step 3.

Is it possible that the 20min delay is closing the connection?

Thanks for all the help!

Best,

Vishal

On Jan 29, 2013, at 8:33 PM, Gavin M. Roy notifications@github.com
wrote:

No, I am referring to the python traceback when the exception is raised
in your application. That error tells us the problem but not where it is
occurring or why. The Python traceback when the app breaks is more useful.

On Tuesday, January 29, 2013 at 8:22 PM, Vishal Goklani wrote:

There is no full traceback from pika, I only see these messages:

ERROR:pika.adapters.base_connection:Socket Error on fd 4: 104

Are you referring to the RabbitMQ logs?

On Jan 29, 2013, at 8:07 PM, Nathan Lundquist <
notifications@github.com (mailto:notifications@github.com)> wrote:

How are you using your channels? Are you using the same channel to
do everything? As in declaring queues/exchanges, setting up bindings,
consuming queues?

—
Reply to this email directly or view it on GitHub.

—
Reply to this email directly or view it on GitHub (
#266 (comment)).

—
Reply to this email directly or view it on GitHub.

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/266#issuecomment-12872498.

Gavin M. Roy
Chief Technology Officer

http://www.meetme.com/
100 Union Square Drive
New Hope, PA 18938
p. +1.215.862.1162 x263
f. +1.215.862.0465

https://www.facebook.com/pages/MeetMe/21931227129
https://twitter.com/meetme
http://www.youtube.com/user/MeetMeVideos

The public market leader in social discovery. (NYSE MKT: MEET)
—
Reply to this email directly or view it on GitHub.

gmr · 2013-01-30T06:05:55Z

It appears that you are blocking in your consumer for longer than Pika has to respond to timeouts. The telling line is:

=ERROR REPORT==== 30-Jan-2013::04:32:21 ===
closing AMQP connection <0.515.0> (10.169.2.187:35624 -> 10.169.2.187:5672):
{heartbeat_timeout,running}

You can either turn them off or make them much longer. In pika it's heartbeat_interval=0 to turn them off or heartbeat_interval={seconds} to set how many seconds you want them run. My guess is your consumer is blocking in Python processing for a fair amount of time if this is happening.

Are you using time.sleep or any such thing?

vgoklani · 2013-01-30T12:36:29Z

yes, I am using time.sleep to take 30min breaks in-between requests - time.sleep(60 * 30).

What's the proper way of pausing the producer/consumer, should I switch to connection.sleep(60*30)?

Thanks again for all the help!

On Jan 30, 2013, at 1:06 AM, Gavin M. Roy notifications@github.com wrote:

It appears that you are blocking in your consumer for longer than Pika has to respond to timeouts. The telling line is:

=ERROR REPORT==== 30-Jan-2013::04:32:21 ===
closing AMQP connection <0.515.0> (10.169.2.187:35624 -> 10.169.2.187:5672):
{heartbeat_timeout,running}

You can either turn them off or make them much longer. In pika it's heartbeat_interval=0 to turn them off or heartbeat_interval={seconds} to set how many seconds you want them run. My guess is your consumer is blocking in Python processing for a fair amount of time if this is happening.

Are you using time.sleep or any such thing?
—
Reply to this email directly or view it on GitHub.

gmr · 2013-01-30T16:45:41Z

Use BlockingConnection.sleep() instead of time.sleep():

https://pika.readthedocs.org/en/0.9.9/connecting.html#pika.adapters.blocking_connection.BlockingConnection.sleep

gmr closed this as completed Mar 13, 2013

levinas mentioned this issue Jul 31, 2014

solve socket error (pika/rabbitmq) levinas/assembly#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"ERROR:pika.adapters.base_connection:Socket Error on fd 8: 104" #266

"ERROR:pika.adapters.base_connection:Socket Error on fd 8: 104" #266

vgoklani commented Jan 29, 2013

atatsu commented Jan 30, 2013

vgoklani commented Jan 30, 2013

gmr commented Jan 30, 2013

vgoklani commented Jan 30, 2013

gmr commented Jan 30, 2013

atatsu commented Jan 30, 2013

vgoklani commented Jan 30, 2013

gmr commented Jan 30, 2013

vgoklani commented Jan 30, 2013

foo do whatever here

gmr commented Jan 30, 2013

vgoklani commented Jan 30, 2013

gmr commented Jan 30, 2013

"ERROR:pika.adapters.base_connection:Socket Error on fd 8: 104" #266

"ERROR:pika.adapters.base_connection:Socket Error on fd 8: 104" #266

Comments

vgoklani commented Jan 29, 2013

atatsu commented Jan 30, 2013

vgoklani commented Jan 30, 2013

gmr commented Jan 30, 2013

vgoklani commented Jan 30, 2013

gmr commented Jan 30, 2013

atatsu commented Jan 30, 2013

vgoklani commented Jan 30, 2013

gmr commented Jan 30, 2013

vgoklani commented Jan 30, 2013

foo do whatever here

gmr commented Jan 30, 2013

vgoklani commented Jan 30, 2013

gmr commented Jan 30, 2013