New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
heap invariant compromised in connsched #119
Comments
@gbarr wrote
and
|
We reverted to a previous version of beanstalkd and we haven't seen this bug yet since (over 24 hours). We were seeing it multiple times per day before, so it appears to only affect 1.5 If I can convince ops we will try 1.6 and see if it exists there. |
@chadkouse if you can reproduce it easily, it would be extremely helpful I can help if you need instructions on how to do that. |
Yeah tell me how to do that and I'll see of I can get it done. On Wednesday, May 23, 2012 at 5:02 AM, Keith Rarick wrote:
|
Before starting the beanstalkd process, run If you run it on Mac OS X, the core file will be in /cores; if you're |
Here's a core dump from 1.5 - http://dl.dropbox.com/u/32251821/core We have put 1.6 on one of our nodes now and will report back if we get a crash there as well (with core dump) Let me know when you've got that file so I can delete it from my dropbox. |
Hi Keith, Attached is the core dump file from the 1.6 crash. Hope this helps. On Thu, May 31, 2012 at 11:34 AM, Chad Kouse chad.kouse@gmail.com wrote:
Tyler Yosick |
Whoops. Looks like replying to one of the emails made me comment as Chad somehow... Keith: How can I get this 1.6 core dump file to you? |
@chadkouse I have the core file now. Can you also send @tyosick You can email it to kr@xph.us. Please also send |
@tyosick works with us so I'll let him send both binaries. |
@kr , just to confirm, you need both the 1.5 and 1.6 binaries? (Located in /usr/local/bin/beanstalkd)? |
@tyosick both would be best, though either one would probably be sufficient.
|
Thanks for the core file and binary. Loading them into gdb
|
I'm not sure what this means - does it mean there's a bug in beanstalkd or a flaw in our setup? |
Just had similar crash in my production environment - running ver 1.5 Jul 11 15:16:01 server04 kernel: [32980446.960264] beanstalkd[18214] general protection ip:4095ab sp:7fffcff4b2c0 error:0 in beanstalkd[400000+12000] Any recommendations or time to fix would be great. |
kr any recommendations here? We're stuck on 1.4.6 until we can solve this unfortunately. |
I have been able to reproduce a crash. Now it's just a matter of time You're right in sticking to 1.4.6 until this is fixed. Can I get you to run a test build of the bug fix when it's ready? |
Yeah we're willing to help test. Chad Kouse On Friday, August 24, 2012 at 10:04 PM, Keith Rarick wrote:
|
We've now been seeing this happen about once a day - still using 1.5, not really straightforward to roll back. My question is, if we have persistence turned on - are we losing any messages when this crash occurs. It would be really helpful to know this and help prioritize the need to roll back to 1.4.6 |
@chadkouse @nitinahuja could you please run this build https://s3.amazonaws.com/krheroku/beanstalkd
If it works well I'll make a release. |
@nitinahuja The potential for losing jobs depends on how often |
@chadkouse @nitinahuja did either of you get a chance to test out the recent fixes? |
We are loading it onto our machines tomorrow. Will report back in a couple of days (or sooner if you didn't fix the bug :) ) Chad Kouse On Monday, September 3, 2012 at 3:30 PM, Keith Rarick wrote:
|
@chadkouse can I assume you've had no problems so far? |
Yea, no problems so far Chad Kouse On Monday, September 10, 2012 at 4:07 AM, Keith Rarick wrote:
|
Excellent. I consider this fixed. |
Awesome thanks. We loaded it on a few mores machines so we will let you know of any hiccups we come across. Chad Kouse On Monday, September 10, 2012 at 2:33 PM, Keith Rarick wrote:
|
I am facing the same issue on CentOS 6.3 Beanstalkd 1.8 Dec 13 13:02:35 vm7-d2 kernel: beanstalkd[8701] general protection ip:40200a sp:7fff6c65fa10 error:0 in beanstalkd[400000+11000] Beanstalkd 1.4 was running fine without any issue so reverting back 1.4 |
Function connsched was modifying field tickat while a Conn
was still inside the heap. Wacky hijinks ensued.
Original report follows.
Reported in version 1.5:
https://groups.google.com/d/topic/beanstalk-talk/3uRiRBonVuE
The text was updated successfully, but these errors were encountered: