Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

segfault in connsoonestjob at conn.c:166 #160

Closed
kr opened this issue Dec 14, 2012 · 12 comments
Closed

segfault in connsoonestjob at conn.c:166 #160

kr opened this issue Dec 14, 2012 · 12 comments
Labels
WaitingForInfo Issue is not actionable because of missing required information, which needs to be provided.

Comments

@kr
Copy link
Member

kr commented Dec 14, 2012

Reported by @aqibsm in #119:

I am facing the same issue on CentOS 6.3 Beanstalkd 1.8

Dec 13 13:02:35 vm7-d2 kernel: beanstalkd[8701] general protection ip:40200a sp:7fff6c65fa10 error:0 in beanstalkd[400000+11000]
Dec 13 14:59:17 vm7-d2 kernel: beanstalkd[9993] general protection ip:40200a sp:7fff6c6f2fc0 error:0 in beanstalkd[400000+11000]

Beanstalkd 1.4 was running fine without any issue so reverting back 1.4

@kr
Copy link
Member Author

kr commented Dec 14, 2012

Can you reproduce the error? If so, would you mind compiling
beanstalkd with

make clean
make CFLAGS='-O0 -g'

and producing a stack trace?

@aqibsm
Copy link

aqibsm commented Jan 24, 2013

Hi Kr,

I could not proceed for further testing due to workload.

Could you please guide how to see general protection error as I am starting
beanstalkd from init script.

Thanks.

On 14 December 2012 13:29, Keith Rarick notifications@github.com wrote:

Can you reproduce the error? If so, would you mind compiling
beanstalkd with

make clean
make CFLAGS='-O0 -g'

and producing a stack trace?


Reply to this email directly or view it on GitHubhttps://github.com//issues/160#issuecomment-11368642.

Aaqib Salman Malik | Technical Manager

MOBIZONE PAKISTAN – An ARPUPLUS Company
Algeria | Morocco | Tunis | Egypt | Sudan | Jordan |
Palestine | Lebanon | Iraq | KSA | Yemen | UAE| Pakistan | Bangladesh |
Italy | Canada | USA |
Mob: +92.300.852.3010 | Tel: +92.51.285.6581/2 | Fax: +92.51.285.6580 - EXT
221
Email: aaqib.malik@mobizone.com.pk | Web: www.mobizone.com.pk

@aqibsm
Copy link

aqibsm commented Jan 29, 2013

Hi Kr,

I managed to get core dump.

GNU gdb (GDB) Red Hat Enterprise Linux (7.2-56.el6)
Copyright (C) 2010 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html

This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law. Type "show copying"
and "show warranty" for details.
This GDB was configured as "x86_64-redhat-linux-gnu".
For bug reporting instructions, please see:
http://www.gnu.org/software/gdb/bugs/...
Reading symbols from /usr/local/bin/beanstalkd...done.
[New Thread 17286]
Missing separate debuginfo for
Try: yum --disablerepo='' --enablerepo='-debug*' install
/usr/lib/debug/.build-id/8e/312e8752e924c26341440ec3a032bc0e20cba3
Reading symbols from /lib64/libc.so.6...(no debugging symbols found)...done.
Loaded symbols for /lib64/libc.so.6
Reading symbols from /lib64/ld-linux-x86-64.so.2...(no debugging symbols
found)...done.
Loaded symbols for /lib64/ld-linux-x86-64.so.2
Reading symbols from /lib64/libnss_files.so.2...(no debugging symbols
found)...done.
Loaded symbols for /lib64/libnss_files.so.2
Core was generated by `/usr/local/bin/beanstalkd -l 0.0.0.0 -p 11300 -u
root -b /var/lib/beanstalkd/bi'.
Program terminated with signal 11, Segmentation fault.
#0 0x000000000040200a in connsoonestjob (c=0x14df3090) at conn.c:166
166 if (j->r.deadline_at <= (soonest ? : j)->r.deadline_at)
soonest = j;
Missing separate debuginfos, use: debuginfo-install
glibc-2.12-1.80.el6_3.6.x86_64

On 24 January 2013 10:26, Aaqib Salman Malik aqibsm@gmail.com wrote:

Hi Kr,

I could not proceed for further testing due to workload.

Could you please guide how to see general protection error as I am
starting beanstalkd from init script.

Thanks.

On 14 December 2012 13:29, Keith Rarick notifications@github.com wrote:

Can you reproduce the error? If so, would you mind compiling
beanstalkd with

make clean
make CFLAGS='-O0 -g'

and producing a stack trace?


Reply to this email directly or view it on GitHubhttps://github.com//issues/160#issuecomment-11368642.

Aaqib Salman Malik | Technical Manager

MOBIZONE PAKISTAN – An ARPUPLUS Company
Algeria | Morocco | Tunis | Egypt | Sudan | Jordan |
Palestine | Lebanon | Iraq | KSA | Yemen | UAE| Pakistan | Bangladesh |
Italy | Canada | USA |
Mob: +92.300.852.3010 | Tel: +92.51.285.6581/2 | Fax: +92.51.285.6580 -
EXT 221
Email: aaqib.malik@mobizone.com.pk | Web: www.mobizone.com.pk

Aaqib Salman Malik | Technical Manager

MOBIZONE PAKISTAN – An ARPUPLUS Company
Algeria | Morocco | Tunis | Egypt | Sudan | Jordan |
Palestine | Lebanon | Iraq | KSA | Yemen | UAE| Pakistan | Bangladesh |
Italy | Canada | USA |
Mob: +92.300.852.3010 | Tel: +92.51.285.6581/2 | Fax: +92.51.285.6580 - EXT
221
Email: aaqib.malik@mobizone.com.pk | Web: www.mobizone.com.pk

@JensRantil
Copy link
Contributor

@kr Did you have any idea on this?

@thorro
Copy link

thorro commented Nov 17, 2017

We're getting the same error on v1.10:

Program terminated with signal SIGSEGV, Segmentation fault. #0 0x00000000004021c0 in connsoonestjob (c=0x64d0f40) at conn.c:166 166 if (j->r.deadline_at <= (soonest ? : j)->r.deadline_at) soonest = j;

Anyone found the cause of this? @kr ?

@jhammer
Copy link

jhammer commented Apr 9, 2019

We have just hit this, as well:

Apr 8 17:46:37 db01 kernel: [56880694.274429] traps: beanstalkd[1048] general protection ip:40241c sp:7ffd78a06a20 error:0 in beanstalkd[400000+e000]

@ysmolski ysmolski added Help Wanted NeedsInvestigation Someone must examine and confirm this is a valid issue and not a duplicate of an existing one. and removed needs-label labels Jun 26, 2019
@ysmolski
Copy link
Member

@jhammer, what version do you run? Can you produce the stack trace?

@ysmolski ysmolski changed the title general protection error segfault in connsoonestjob at conn.c:166 Jul 1, 2019
@jhammer
Copy link

jhammer commented Jul 3, 2019

@ysmolsky version 1.9. Sorry, I do not have the stack trace. Seem pretty rare in our case. Only hit the issue once after literally years of uptime.

@thorro
Copy link

thorro commented Jul 4, 2019

There are a few stacktraces lying around here. We hit it once in a while. Lately not much, since we've moved majority of our workload to another queue manager. It seems the number of crashes correlate with workload.

@ysmolski
Copy link
Member

ysmolski commented Jul 4, 2019

@thorro could you paste those stacktraces here? Did you move to another queue manager because beanstalkd failed to handle the load?

@thorro
Copy link

thorro commented Jul 4, 2019

@ysmolsky we moved mainly because of the crashes. Link to bt:

#328 (comment)

Also, because sometimes binlog files got out of hand. Meaning, when there were a lot of jobs, binglog size grew, which is normal, but when the jobs got processed and deleted, binlog size stayed the same, it didn't shrink. But that's another issue.

@ysmolski
Copy link
Member

Without a core dump it's next to impossible to fix this kind of error. If anyone can share core dump (that will contain everything that is inside of beanstalkd), I would be glad to look at and try to fix this.
How to get a core dump for a segfault on Linux

@ysmolski ysmolski added WaitingForInfo Issue is not actionable because of missing required information, which needs to be provided. and removed Help Wanted NeedsInvestigation Someone must examine and confirm this is a valid issue and not a duplicate of an existing one. labels Aug 17, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
WaitingForInfo Issue is not actionable because of missing required information, which needs to be provided.
Projects
None yet
Development

No branches or pull requests

6 participants