session dbus-daemon crashed (SIGABRT) in libnss-systemd #15859

pabs3 · 2020-05-20T02:40:38Z

systemd version the issue has been seen with

245.5-2

Used distribution

Debian bullseye

Unexpected behaviour you saw

session dbus-daemon crashed (SIGABRT) in libnss-systemd

Steps to reproduce the problem

I don't know how to reproduce the problem but from my systemd journal it appears to be associated with something running as root using the su command to my user. I think this is the needrestart package using notify-send to switch to my user and notify me of processes needing a restart but I am not sure.

This might fix systemd#15859, a bug which I find very puzzling.

poettering · 2020-06-02T08:47:54Z

This is very puzzling. I prepped a possible fix in #16041. But I am not sure if it actually fixes anything, but it's the only thing that remotely makes sense to me.

We see EBADF on fclose() of an open_memstream() FILE*, and I am not sure how that possibly could ever happen...

Does this happen regularly for you?

pabs3 · 2020-06-02T08:54:45Z

It happens intermittently and I'm not sure how to trigger it. If you would like I can apply the patch locally and see if it fixes the issue, but I'm not sure how to tell the difference between the patch fixing the issue and the conditions to trigger the issue not occurring.

…

-- bye, pabs https://wiki.debian.org/PaulWise

poettering · 2020-06-02T09:44:45Z

do you have any special NSS setup btw? ldap or so? lots of users/groups or so?

If the issue doesn't pop up with the patch applied anymore we should probably close this and assume it fixed until it pops up again and then reopen, or so?

pabs3 · 2020-06-02T12:56:22Z

No special NSS setup, just a standalone desktop system. Two real users and 73 system users for daemons etc. I'll apply the patch tomorrow and report back at the end of the month if there have been any dbus-daemon crashes or not.

…

-- bye, pabs https://wiki.debian.org/PaulWise

This might fix #15859, a bug which I find very puzzling.

pabs3 · 2020-06-03T02:28:43Z

Applied the patch to my local system, will report any issues I see.

…

-- bye, pabs https://bonedaddy.net/pabs3/

pabs3 · 2020-07-17T02:24:13Z

Unfortunately I just got another pair of crashes with Debian systemd 245.6-2 with the patch cherry-picked on top. Attached the backtraces: https://github.com/systemd/systemd/files/4935409/crashes.txt

poettering · 2020-07-21T08:15:10Z

Does the version you tested include 75f6d5d?

mbiebl · 2020-07-21T10:55:35Z

Does the version you tested include 75f6d5d?

I assume so, given the comment "...with the patch cherry-picked on top."

poettering · 2020-07-21T11:27:07Z

did you reboot after patching/rebuilding/installing systemd? NSS modules remain pinned in running processes... only way to update them safely is to reboot?

pabs3 · 2020-07-22T03:15:35Z

The patch was included in the systemd I was testing. The upgrade to the patched version occurred 2020-07-08 14:58:06 The crashes occurred after a boot at 2020-07-17 09:34:50 The crash occurred 2020-07-17 10:00:33 Looking at my systemd journal log, the crash appears to be associated with one of my cron jobs. All of my cron jobs have special environment variables set to be able to identify their processes. Looking at the environment variables in the dbus-daemon core dumps, it appears to be one that invokes `nm-online -q`. In addition to the special environment variables, my cron jobs set DISPLAY=:0 which IIRC was required to make evolution address-export and other things requiring dbus work in cron. Since adding DISPLAY=:0 I have switched to Wayland but I didn't yet add WAYLAND_DISPLAY=wayland-0 to my cron jobs. So perhaps the nm-online failed to contact the session dbus-daemon (although it seems to work most of the time) and started a new dbus-daemon, which didn't like the environment it was in and passed incorrect things to libnss-systemd?

…

-- bye, pabs https://bonedaddy.net/pabs3/

keszybz · 2020-07-22T10:15:24Z

It is possible that the crash is caused by memory corruption in some other part of the code. I looked at the code involved and don't see anything obvious either. I guess we'll need to wait and see if other people hit this.

keszybz · 2020-07-27T10:20:04Z

https://bugzilla.redhat.com/show_bug.cgi?id=1823038 is another case.

keszybz · 2020-07-28T14:16:36Z

@fweimer, @codonell maybe you could take a look? The code seems correct, but when we do fclose() on the stream allocated with open_memstream(), we get EBADF.

codonell · 2020-07-29T20:01:06Z

@keszybz The storage backing the FILE* is allocated by malloc by __open_memstream() and so is easily susceptible to buffer overflows from nearby chunks. In general it looks like you only use open_memstream_unlocked() from src/basic/fileio.c, and so any failure to coordinate by the callers could result in corruption. I looked over the code in src/basic/fd-util.c and I don't see anything immediately wrong. These cases are hard to track down :-(

keszybz · 2020-07-31T15:53:31Z

In general it looks like you only use open_memstream_unlocked() from src/basic/fileio.c, and so any failure to coordinate by the callers could result in corruption.

There is always exactly one caller — the memstream object is never passed outside of the originating function. (In the whole codebase there is one exception in dbus introspection code, but that's code path is not touched here.) So there is no question of coordination, afaict.

keszybz · 2020-08-01T10:01:08Z

fclose may need to allocate space for the terminating NUL byte.

But can it return EBADF in that case? We only check that the errno we got is not EBADF.

fweimer · 2020-08-02T07:14:11Z

No, you won't get EBADF in that case, and the allocation during fclose will not happen anyway because of the previous fflush call, which is what actually allocates.

pabs3 · 2020-08-04T03:19:10Z

FTR: I got another pair of crashes with libnss-systemd 245.7-1 from Debian bullseye, AFAICT this version includes the patch from above. I'm assuming that the backtrace isn't going to be interesting but if it is please let me know before it is auto-deleted in a week's time.

…

-- bye, pabs https://bonedaddy.net/pabs3/

keszybz · 2020-08-04T11:05:35Z

I think we need to go over the glibc code with a fine comb and figure out in what circumstances it can return EBADF. Maybe EBADF is a legitimate return value for memstreams?

fweimer · 2020-09-14T07:06:59Z

@keszybz I rather suspect this is the consequence of unrelated memory corruption (but I could be wrong).

pabs3 · 2020-10-11T01:03:00Z

FTR: I got another pair of crashes with libnss-systemd 246.6-1 from Debian bullseye. I'm assuming that the backtrace isn't going to be interesting but if it is please let me know before it is auto-deleted in a week's time.

This might fix systemd#15859, a bug which I find very puzzling. (cherry picked from commit 75f6d5d)

poettering · 2023-06-05T17:11:03Z

Is this still reproducible with current versions of systemd/glibc? If not, let's close this

pabs3 · 2023-06-06T02:35:00Z

The dbus-daemon crash appears to be fixed for some time now, not seeing it with systemd 252.6-1 and glibc 2.36-9 from Debian bookworm.

…

-- bye, pabs https://bonedaddy.net/pabs3/

yuwata · 2023-06-06T04:43:15Z

Thanks. Then, let's close this.

poettering added bug 🐛 Programming errors, that need preferential fixing nss labels Jun 2, 2020

poettering added a commit to poettering/systemd that referenced this issue Jun 2, 2020

fd-util: be more careful with fclose() errnos

d00bdc4

This might fix systemd#15859, a bug which I find very puzzling.

poettering mentioned this issue Jun 2, 2020

fd-util: be more careful with fclose() errnos #16041

Merged

poettering added the needs-reporter-feedback ❓ There's an unanswered question, the reporter needs to answer label Jun 2, 2020

poettering closed this as completed in #16041 Jun 2, 2020

poettering added a commit that referenced this issue Jun 2, 2020

fd-util: be more careful with fclose() errnos

75f6d5d

This might fix #15859, a bug which I find very puzzling.

mbiebl reopened this Jul 20, 2020

mrc0mmand removed the needs-reporter-feedback ❓ There's an unanswered question, the reporter needs to answer label Jul 20, 2020

vbatts pushed a commit to kinvolk/systemd that referenced this issue Nov 12, 2020

fd-util: be more careful with fclose() errnos

bbe45b5

This might fix systemd#15859, a bug which I find very puzzling. (cherry picked from commit 75f6d5d)

vbatts pushed a commit to kinvolk/systemd that referenced this issue Nov 12, 2020

fd-util: be more careful with fclose() errnos

121a53a

This might fix systemd#15859, a bug which I find very puzzling. (cherry picked from commit 75f6d5d)

yuwata closed this as completed Jun 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

session dbus-daemon crashed (SIGABRT) in libnss-systemd #15859

session dbus-daemon crashed (SIGABRT) in libnss-systemd #15859

pabs3 commented May 20, 2020

poettering commented Jun 2, 2020

pabs3 commented Jun 2, 2020 via email

poettering commented Jun 2, 2020

pabs3 commented Jun 2, 2020 via email

pabs3 commented Jun 3, 2020 via email

pabs3 commented Jul 17, 2020 via email •

edited

poettering commented Jul 21, 2020

mbiebl commented Jul 21, 2020

poettering commented Jul 21, 2020

pabs3 commented Jul 22, 2020 via email

keszybz commented Jul 22, 2020

keszybz commented Jul 27, 2020

keszybz commented Jul 28, 2020

codonell commented Jul 29, 2020

keszybz commented Jul 31, 2020

keszybz commented Aug 1, 2020

fweimer commented Aug 2, 2020

pabs3 commented Aug 4, 2020 via email

keszybz commented Aug 4, 2020

fweimer commented Sep 14, 2020

pabs3 commented Oct 11, 2020

poettering commented Jun 5, 2023

pabs3 commented Jun 6, 2023 via email

yuwata commented Jun 6, 2023

session dbus-daemon crashed (SIGABRT) in libnss-systemd #15859

session dbus-daemon crashed (SIGABRT) in libnss-systemd #15859

Comments

pabs3 commented May 20, 2020

poettering commented Jun 2, 2020

pabs3 commented Jun 2, 2020 via email

poettering commented Jun 2, 2020

pabs3 commented Jun 2, 2020 via email

pabs3 commented Jun 3, 2020 via email

pabs3 commented Jul 17, 2020 via email • edited

poettering commented Jul 21, 2020

mbiebl commented Jul 21, 2020

poettering commented Jul 21, 2020

pabs3 commented Jul 22, 2020 via email

keszybz commented Jul 22, 2020

keszybz commented Jul 27, 2020

keszybz commented Jul 28, 2020

codonell commented Jul 29, 2020

keszybz commented Jul 31, 2020

keszybz commented Aug 1, 2020

fweimer commented Aug 2, 2020

pabs3 commented Aug 4, 2020 via email

keszybz commented Aug 4, 2020

fweimer commented Sep 14, 2020

pabs3 commented Oct 11, 2020

poettering commented Jun 5, 2023

pabs3 commented Jun 6, 2023 via email

yuwata commented Jun 6, 2023

pabs3 commented Jul 17, 2020 via email •

edited