Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Opening steam with running google chrome randomly hang/crash whole system #9793

Open
Pharmasolin opened this issue Jul 4, 2023 · 15 comments

Comments

@Pharmasolin
Copy link

Pharmasolin commented Jul 4, 2023

Your system information

  • Steam client version (build number or date): 1688428930 3 july
  • Distribution (e.g. Ubuntu): Ubuntu 22.04.2 LTS
  • Opted into Steam client beta?: [Yes/No] Yes
  • Have you checked for system updates?: [Yes/No] Yes (all is up to date, with latest stable kernel 6.4 and mesa from kisak ppa)
  • Steam Logs: steam-logs.tar.gz
  • GPU: AMD

Please describe your issue in as much detail as possible:

I noticed that from time to time when I have opened google chrome and I open steam in few moments I will see popup that steam/google chrome is not responding, and if I will kill any of the crashed apps - whole system will hang and crash. I will need to power off and power on my laptop.

Steps for reproducing this issue:

  1. No clear steps, I have opened google chrome, then open steam and randomly this steam app startup will crash my whole system. This started when steam switched to a new UI.

--
I will re-generate logs again after my OS will crash again and I will reboot PC, may be those logs will be more helpful.
https://gist.github.com/Pharmasolin/43290fc327cf54b7b64a04c19dff8c52

@kisak-valve
Copy link
Member

Hello @Pharmasolin, in general, no OpenGL application including Steam should be capable of taking down the whole system and symptoms like this usually hint towards a video driver issue. It might be worthwhile to also mention this issue to your video driver vendor.

@Pharmasolin
Copy link
Author

Pharmasolin commented Jul 4, 2023

@kisak-valve I'm afraid that's the issue with the gnome/mutter version on LTS Ubuntu. Because the desktop hangs I can't do anything but music from Spotify is playing.

And there was a bug with chromium apps in gnome. Please let this issue be open for a while, maybe someone will add more context.

@Pharmasolin
Copy link
Author

Pharmasolin commented Jul 5, 2023

This is the output from journalctl -b -1 | grep -v 'slack.desktop\[1712\]' (log where I turned on steam, I hid mentions of the slack in the log, because there are a lot of info/warnings from it there). Basically I had turned on google chrome, slack and steam (all chromium based apps?) Not sure if this bug related to steam itself and where to report. If someone will read this issue, please let me know where I should report this one.

лип 04 23:00:45 denys-Legion-5-15ACH6A steam.desktop[72282]: ExecuteSteamURL: "steam://open/steam://settings/"
лип 04 23:00:56 denys-Legion-5-15ACH6A systemd[1092]: vte-spawn-2fe99b25-4477-41e6-b715-6a96aaf3296d.scope: Consumed 14.748s CPU time.
лип 04 23:02:15 denys-Legion-5-15ACH6A dbus-daemon[1135]: [session uid=1000 pid=1135] Activating service name='org.gnome.Nautilus' requested by ':1.34' (uid=1000 pid=1292 comm="/usr/bin/gnome-shell ")
лип 04 23:02:15 denys-Legion-5-15ACH6A dbus-daemon[1135]: [session uid=1000 pid=1135] Successfully activated service 'org.gnome.Nautilus'
лип 04 23:02:15 denys-Legion-5-15ACH6A org.gnome.Nautilus[80663]: Initializing nautilus-dropbox 2020.03.04
лип 04 23:02:15 denys-Legion-5-15ACH6A dbus-daemon[688]: [system] Activating via systemd: service name='org.freedesktop.hostname1' unit='dbus-org.freedesktop.hostname1.service' requested by ':1.259' (uid=1000 pid=80663 comm="/usr/bin/nautilus --gapplication-service ")
лип 04 23:02:15 denys-Legion-5-15ACH6A systemd[1]: Starting Hostname Service...
лип 04 23:02:15 denys-Legion-5-15ACH6A nautilus[80663]: Called "net usershare info" but it failed: Failed to execute child process “net” (No such file or directory)
лип 04 23:02:15 denys-Legion-5-15ACH6A dbus-daemon[688]: [system] Successfully activated service 'org.freedesktop.hostname1'
лип 04 23:02:15 denys-Legion-5-15ACH6A systemd[1]: Started Hostname Service.
лип 04 23:02:38 denys-Legion-5-15ACH6A nautilus[80663]: Called "net usershare info" but it failed: Failed to execute child process “net” (No such file or directory)
лип 04 23:02:45 denys-Legion-5-15ACH6A systemd[1]: systemd-hostnamed.service: Deactivated successfully.
лип 04 23:02:54 denys-Legion-5-15ACH6A gnome-shell[1292]: meta_window_set_stack_position_no_sync: assertion 'window->stack_position >= 0' failed
лип 04 23:04:34 denys-Legion-5-15ACH6A systemd[1092]: Started Application launched by gnome-shell.
лип 04 23:04:34 denys-Legion-5-15ACH6A gnome-keyring-daemon[1131]: asked to register item /org/freedesktop/secrets/collection/login/2, but it's already registered
лип 04 23:04:34 denys-Legion-5-15ACH6A google-chrome.desktop[80892]: libva error: vaGetDriverNameByIndex() failed with unknown libva error, driver_name = (null)
лип 04 23:04:50 denys-Legion-5-15ACH6A systemd[1092]: Started Application launched by gnome-shell.
лип 04 23:04:51 denys-Legion-5-15ACH6A gnome-system-monitor.desktop[81374]: glibtop(c=81374): [WARNING] statvfs '/run/user/1000/doc' failed: Operation not permitted
лип 04 23:04:51 denys-Legion-5-15ACH6A gnome-system-monitor.desktop[81374]: glibtop(c=81374): [WARNING] statvfs '/run/user/1000/doc' failed: Operation not permitted
лип 04 23:04:51 denys-Legion-5-15ACH6A gnome-system-monitor.desktop[81374]: glibtop(c=81374): [WARNING] statvfs '/run/user/1000/doc' failed: Operation not permitted
лип 04 23:04:51 denys-Legion-5-15ACH6A gnome-system-monitor.desktop[81374]: glibtop(c=81374): [WARNING] statvfs '/run/user/1000/doc' failed: Operation not permitted
лип 04 23:04:51 denys-Legion-5-15ACH6A gnome-shell[1292]: Window manager warning: Ping serial 36632114 was reused for window W452, previous use was for window 0x2000010.
лип 04 23:04:54 denys-Legion-5-15ACH6A gnome-shell[1292]: Window manager warning: Overwriting existing binding of keysym 37 with keysym 37 (keycode 10).
лип 04 23:04:54 denys-Legion-5-15ACH6A gnome-shell[1292]: Window manager warning: Overwriting existing binding of keysym 38 with keysym 38 (keycode 11).
лип 04 23:04:54 denys-Legion-5-15ACH6A gnome-shell[1292]: Window manager warning: Overwriting existing binding of keysym 31 with keysym 31 (keycode a).
лип 04 23:04:54 denys-Legion-5-15ACH6A gnome-shell[1292]: Window manager warning: Overwriting existing binding of keysym 32 with keysym 32 (keycode b).
лип 04 23:04:54 denys-Legion-5-15ACH6A gnome-shell[1292]: Window manager warning: Overwriting existing binding of keysym 34 with keysym 34 (keycode d).
лип 04 23:04:54 denys-Legion-5-15ACH6A gnome-shell[1292]: Window manager warning: Overwriting existing binding of keysym 35 with keysym 35 (keycode e).
лип 04 23:04:54 denys-Legion-5-15ACH6A gnome-shell[1292]: Window manager warning: Overwriting existing binding of keysym 36 with keysym 36 (keycode f).
лип 04 23:04:54 denys-Legion-5-15ACH6A gnome-shell[1292]: Window manager warning: Overwriting existing binding of keysym 33 with keysym 33 (keycode c).
лип 04 23:04:54 denys-Legion-5-15ACH6A gnome-shell[1292]: Window manager warning: Overwriting existing binding of keysym 39 with keysym 39 (keycode 12).
лип 04 23:04:54 denys-Legion-5-15ACH6A gnome-system-mo[81374]: gtk_widget_event: assertion 'WIDGET_REALIZED_FOR_EVENT (widget, event)' failed
лип 04 23:05:00 denys-Legion-5-15ACH6A google-chrome.desktop[80892]: [80932:80939:0704/230500.951511:ERROR:ssl_client_socket_impl.cc(978)] handshake failed; returned -1, SSL error code 1, net_error -3
лип 04 23:05:00 denys-Legion-5-15ACH6A systemd[1092]: app-gnome-google\x2dchrome-80886.scope: Consumed 17.072s CPU time.
лип 04 23:05:02 denys-Legion-5-15ACH6A kernel: GpuWatchdog[72412]: segfault at 0 ip 00007fd3f5d929a6 sp 00007fd3ea9fd370 error 6 in libcef.so[7fd3f18ef000+7770000] likely on CPU 14 (core 7, socket 0)
лип 04 23:05:02 denys-Legion-5-15ACH6A kernel: Code: 89 de e8 0d ef 6e ff 80 7d cf 00 79 09 48 8b 7d b8 e8 4e 66 2c 03 41 8b 84 24 e0 00 00 00 89 45 b8 48 8d 7d b8 e8 5a d3 b5 fb <c7> 04 25 00 00 00 00 37 13 00 00 48 83 c4 38 5b 41 5c 41 5d 41 5e
лип 04 23:05:15 denys-Legion-5-15ACH6A gnome-shell[1292]: Window manager warning: Ping serial 36655684 was reused for window 0x4000003, previous use was for window W452.
лип 04 23:05:16 denys-Legion-5-15ACH6A gnome-system-monitor.desktop[81374]: glibtop(c=81374): [WARNING] statvfs '/run/user/1000/doc' failed: Operation not permitted

@Pharmasolin
Copy link
Author

Pharmasolin commented Jul 5, 2023

ChatGPT:

The line GpuWatchdog[72412]: segfault at 0 ip 00007fd3f5d929a6 sp 00007fd3ea9fd370 error 6 in libcef.so[7fd3f18ef000+7770000] indicates that there was a segmentation fault (segfault) within the libcef.so library. This library is part of the Chromium Embedded Framework (CEF), which is used by many applications to embed web content.

In this case, the log mentions "GpuWatchdog", which is a process that watches for GPU (Graphics Processing Unit) tasks that are taking too long. If it detects a long-running GPU task, it terminates it to prevent the entire system from becoming unresponsive.

The segfault means that the GpuWatchdog process tried to access memory that it wasn't allowed to, which is often a sign of a software bug. It could be a bug in the GPU drivers, in the Chromium Embedded Framework, or in the way the application is using them.

If you're seeing this error repeatedly, it might be worth trying a few things:

Updating your system: Make sure your system is fully updated. This includes the operating system, the application, and any drivers. If it's a known bug, there's a chance it's been fixed in a newer version.

Switching to a different GPU driver: If you're using proprietary GPU drivers, you could try switching to the open-source ones, or vice versa.

Disabling hardware acceleration: Some applications offer an option to disable hardware acceleration, which can sometimes work around issues like this.

Reporting the bug: If none of the above helps, consider reporting the bug to the developers of the application or the GPU driver. Make sure to provide them with as much information as possible, including the full error message and the circumstances under which it occurs.

@melynx
Copy link

melynx commented Jul 16, 2023

I'm actually having the same issue here. Running Ubuntu 23.04 with a thunderbolt 3 eGPU enclosure with a Radeon Vega 64. Running Chrome using "integrated graphics" with steam running seem to cause the random crash + lockup as described. However, it seems like by running Chrome using the "dedicated gpu" (DRI PRIME offloading) seems to not result in the random crash, at least for the limited time I'm doing it. (There is a bug that causes Steam to crash loop if it is run using PRIME)

@calindan2013
Copy link

calindan2013 commented Jul 25, 2023

same issue here, Ryzen 5900X + Radeon 6900XT, Debian 12.1, GNOME 43.6, running Chrome + Steam in background and other apps, I suddenly get a hard freeze of the desktop, keyboard becomes unresponsive and mouse lights go off, no way of switching to a terminal or doing anything except keeping power button pressed to shut down. Tried all suggested option in the BIOS for hard freezes involving AMD processors, but keeps happening every week:

root@localhost:~# tac /var/log/syslog | grep "libcef"
2023-07-25T01:17:45.406198+01:00 localhost kernel: [14382.710339] GpuWatchdog[14867]: segfault at 0 ip 00007f87bf1929a6 sp 00007f87b62a68f0 error 6 in libcef.so[7f87bacef000+7770000] likely on CPU 8 (core 10, socket 0)
2023-07-24T21:22:45.254209+01:00 localhost kernel: [ 282.915405] GpuWatchdog[11950]: segfault at 0 ip 00007f87bf1929a6 sp 00007f87b62a68f0 error 6 in libcef.so[7f87bacef000+7770000] likely on CPU 13 (core 1, socket 0)
2023-07-24T11:16:43.093311+01:00 localhost kernel: [ 1088.729831] GpuWatchdog[15198]: segfault at 0 ip 00007f7330f929a6 sp 00007f732804e8f0 error 6 in libcef.so[7f732caef000+7770000] likely on CPU 15 (core 3, socket 0)
2023-07-23T10:25:17.987090+01:00 localhost kernel: [ 523.786541] GpuWatchdog[11285]: segfault at 0 ip 00007fae835929a6 sp 00007fae7a5508f0 error 6 in libcef.so[7fae7f0ef000+7770000] likely on CPU 1 (core 1, socket 0)
2023-07-23T01:32:17.746643+01:00 localhost kernel: [ 9338.927904] GpuWatchdog[351214]: segfault at 0 ip 00007f24a11929a6 sp 00007f24982248f0 error 6 in libcef.so[7f249ccef000+7770000] likely on CPU 21 (core 11, socket 0)

@Pharmasolin
Copy link
Author

@calindan2013 fyi, disabling hardware acceleration in steam settings fixes this issue.

@calindan2013
Copy link

yes that worked thanks. still, this situation should not be allowed to degenerate into a complete system freeze by the kernel and/or its video hardware modules.

@Pharmasolin
Copy link
Author

yes that worked thanks. still, this situation should not be allowed to degenerate into a complete system freeze by the kernel and/or its video hardware modules.

I don't know how to report this bug up in a chain and whose bug it is. CEF, mesa (radeonsi(?)), or gnome (mutter?)

@user1-github
Copy link

user1-github commented Aug 20, 2023

After 3 days of headache and lots of googling, trying to figure out why does this happen to me on Debian 12.1 while Steam Flatpak is downloading games, I've come to the conclusion that this is caused by yet another rare bug in Chromium (which is unfortunately full of those). Either that, or the way the specific app utilizes Chromium (For those who don't know, Steam client embeds Chromium for webpages and web uis). I mean I've seen people had this exact same GpuWatchdog segfault with electron and other Chromium apps as well and in those cases it also caused a full gui freeze. This also seem to happen on any gpu vendor / driver on Linux, both open source and proprietary and on any DE, with both X and Wayland. So the solution to this is either Chromium devs should fix this, or maybe Steam can have some workaround to prevent this issue.

Since I didn't experience this before the big Steam ui update, I think it might have something to do with the fact that the Steam client now makes heavier use of web uis.

@user1-github
Copy link

Hello @Pharmasolin, in general, no OpenGL application including Steam should be capable of taking down the whole system and symptoms like this usually hint towards a video driver issue. It might be worthwhile to also mention this issue to your video driver vendor.

@kisak-valve But they can. See my comment above. The root cause of the issue is most likely a Chromium bug. The fact that it takes the whole gui down is a flaw in the Linux gui stack (this is one of the few areas where Windows is ahead of Linux, because just like this issue, incorrect graphics api usage on Windows also doesn't take the Windows gui down ((unless it's a bug in the kernel driver))).

@zerocrates
Copy link

I've been having this issue since the new UI also (Ubuntu 23.04, on Intel integrated graphics).

I'm using Wayland, so there's a wrinkle: this doesn't take down the whole system UI, just X programs. This means that the problem is recoverable: killing Xwayland (it has to be a kill -9, the regular TERM signal won't do anything) will kill Steam and anything else using X that was running at the time, but they can then just be restarted.

My main workaround has been to have the actual Steam UI open less. The hang will happen quite often if Steam is just sitting in the background all the time, but more or less will never happen if Steam's actual windows are closed and it's just a notification icon. But the risk remains that the moment you do go to briefly interact with it will cause the hang.

@dsalt
Copy link

dsalt commented Nov 14, 2023

The same crash occasionally happens here. There's no desktop hang, though. (Linux 6.5.9, Mesa 23.2.1, RX 6600 XT; Devuan daedalus.)

@lologarithm
Copy link

lologarithm commented Nov 30, 2023

I am having the same issue, I am unsure if its from steam, discord, spotify, or any number of other apps that might use chromium.

hardware: amd 5950x, radeon 6900XT,
versions: Linux 6.3.13, kisak-mesa 23.2.1

example log:

[30705.371465] GpuWatchdog[646929]: segfault at 0 ip 00007f88ba792bc6 sp 00007f88af5fd430 error 6 in libcef.so[7f88b62ef000+7770000] likely on CPU 13 (core 13, socket 0)

Willing to do any debugging or grabbing of logs if it helps someone.

@KenSharp
Copy link

KenSharp commented Dec 8, 2023

This affects more than just Steam. Users of the Electron framework could help by working together.

https://bugs.launchpad.net/ubuntu/+source/linux-signed-hwe-6.2/+bug/2045951

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

9 participants