Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EOS crash in XrdSecProtocolgsi::Authenticate #146

Closed
ljanyst opened this issue Oct 10, 2014 · 7 comments
Closed

EOS crash in XrdSecProtocolgsi::Authenticate #146

ljanyst opened this issue Oct 10, 2014 · 7 comments
Assignees

Comments

@ljanyst
Copy link
Contributor

ljanyst commented Oct 10, 2014

error: received signal 11:
/usr/lib64/libXrdEosMgm.so(xrdmgmofs_stacktrace(int) 0x44)[0x2b64a43ad9b4]
/lib64/libc.so.6[0x36bc4302d0]
/usr/lib64/libXrdSecgsi.so(XrdSecProtocolgsi::Authenticate(XrdSecBuffer*, XrdSecBuffer**, XrdOucErrInfo*) 0x58)[0x2b64a3c23a78]
/usr//bin/xrootd(XrdXrootdProtocol::do_Auth() 0x123)[0x426233]
/usr/lib64/libXrdUtils.so.1(XrdLink::DoIt() 0x30)[0x362144f5a0]
/usr/lib64/libXrdUtils.so.1(XrdScheduler::Run() 0xd6)[0x3621453916]
/usr/lib64/libXrdUtils.so.1(XrdStartWorking(void*) 0x9)[0x3621453aa9]
/usr/lib64/libXrdUtils.so.1(XrdSysThread_Xeq 0x3f)[0x362142019f]
/lib64/libpthread.so.0[0x36bcc0683d]
/lib64/libc.so.6(clone 0x6d)[0x36bc4d526d]
#########################################################################
# stack trace exec=xrootd pid=26523 what='thread apply all bt'
#########################################################################
[..]
# warning: failed to parse the thread responsible for signal [0 0 4294967295]n#########################################################################
141010 09:44:24 17651 Scalla is starting. . .

The problem seems to be here: https://github.com/xrootd/xrootd/blob/v3.3.6-4.CERN/src/XrdSecgsi/XrdSecProtocolgsi.cc#L1648

With hs containing garbage.

Reference: https://its.cern.ch/jira/browse/EOS-1086

@abh3
Copy link
Member

abh3 commented Dec 3, 2014

Is this still a problem?

@ljanyst
Copy link
Contributor Author

ljanyst commented Dec 3, 2014

We hit it only once so far, so it's not urgent, but the issue has not been solved.

@abh3
Copy link
Member

abh3 commented Dec 3, 2014

Hopefully, we can get a core file when it happens again. Even a stack trace with line numbers would be useful.

@ljanyst
Copy link
Contributor Author

ljanyst commented Dec 3, 2014

You can derive it from the offsets. It happens here: https://github.com/xrootd/xrootd/blob/v3.3.6-4.CERN/src/XrdSecgsi/XrdSecProtocolgsi.cc#L1648

@abh3
Copy link
Member

abh3 commented Dec 3, 2014

Yes, I saw that. So, obviously the hs pointer is wrong. Here a core file would help.

@ljanyst
Copy link
Contributor Author

ljanyst commented Dec 3, 2014

It may be a race. This is EOS namespace, the core the core is >80GB.

@abh3
Copy link
Member

abh3 commented Jul 11, 2015

Apparently, there have been no additional crashes since this report. So, I am closing it. We can reopen if this occurs again.

@abh3 abh3 closed this as completed Jul 11, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants