New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Full symbols in a libfabric stack trace? #7939
Comments
Sometimes I find I need to explicitly set CFLAGS="-g -O0" to fully enable the gdb-able build. |
I'll give that a try. Thanks. |
That's weird. What is the output of |
however, the backtrace still looks like this:
|
Looks like a glibc backtrace - you can resolve this with addr2line or eu-addr2line. I had created a script for our project to automate that. Or use libbacktrace, which provides auto resolved lines (unless debug symbols are stripped). |
@aakefbs - I still can't figure out why abort() didn't produce the function names, but addr2line worked perfectly. Thanks. |
CFLAGS = -g -O0 -Wall -Wundef -Wpointer-arith -Wextra -Wno-unused-parameter -Wno-sign -compare -Wno-missing-field-initializers -fstack-protector-strong -fvisibility=hidde n -Wall -Wundef -Wpointer-arith
Michael Heinz
End-to-End Network Software Engineer
***@***.******@***.***>
From: Jianxin Xiong ***@***.***>
Sent: Wednesday, August 10, 2022 11:34 AM
To: ofiwg/libfabric ***@***.***>
Cc: Heinz, Michael ***@***.***>; Author ***@***.***>
Subject: Re: [ofiwg/libfabric] Full symbols in a libfabric stack trace? (Issue #7939)
That's weird. What is the output of grep ^CFLAGS Makefile?
—
Reply to this email directly, view it on GitHub<#7939 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ACTE6PS3PLTPTBUXE7MQR7LVYPDVVANCNFSM56E2XWJQ>.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
Hey, guys,
I’m trying to track down what expresses as a PSM3 error report but which I suspect is a NCCL bug. To do that I’m trying to get a symbolic stack trace of the executable when I call abort() inside PSM3 – but simply adding –enable-debug to the libfabric configure doesn’t seem to work.
Any ideas? The current configure I'm using is:
./autogen.sh && ./configure --prefix=${HOME} --enable-debug --with-cuda=/usr/local/cuda-11.6 --enable-cuda-dlopen --enable-only --enable-psm3
The text was updated successfully, but these errors were encountered: