New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mythbackend hangs on startup with kernel 6.3.7 and above, and is not killable. #761
Comments
After my Fedora 37 machine upgraded to kernel 6.3.7 I started having the backend service fail on start. I'd been investigating mariadb with help from the Mythtv forums as I noticed messages like this from it's log:
However I cannot fault the database at all. Also once mythbackend service starts it cannot be stopped, getting stuck in a 'D' state - also an indicator that the problem may be kernel related. FWIW I don't use KDE, I'm using LXDE.
|
Hopefully this bug is redundant and a kernel fix is coming, as commented by John in the mailing list: https://lists.mythtv.org/pipermail/mythtv-users/2023-June/412038.html |
And I received this link via the elrepo team: https://bugzilla.kernel.org/show_bug.cgi?id=217566 with an expectation that Fedora packages will be available later this week. |
After I had restored the rpmfusion nvidia akmod packages erased while aiming to prevent unwanted kernel updates, mythbackend appears to be happy with the 6.3.9 release-candidate kernel: uname -r The mythbackend is a recent master with PR 752. It calls itself v34-Pre-287-78171a7dcf Note that Gary's comment on the mythtv users list suggests that this is probably not the final dvb-related kernel fix. |
I realize this problem is caused by the kernel but would it be possible for mythbackend to handle situations like this gracefully? It's a pain that you have to reboot in order to kill it. |
This issue appears to have been fixed with the release of kernel 6.3.9, on my fedora 37 aand 'el7' boxes. https://mirrors.edge.kernel.org/pub/linux/kernel/v6.x/ChangeLog-6.3.9 Fix applied: Revert "media: dvb-core: Fix use-after-free on race condition at dvb_frontend" |
@Jpilk how did you get kernel 6.3.9 on Fedora 37? Are you running beta packages for Fedora? My system updated to 6.3.8 which also has the problem and now is having issues (possibly NVIDIA driver related) going back to booting 6.3.6 which leaves me with no working kernel. Thanks! |
I don't usually run test kernels. The one I have now came as an update of the one from here. https://bugzilla.kernel.org/show_bug.cgi?id=217566#c13 I haven't yet seen 6.3.9 in the fedora updates repo. [john@HPFed ~]$ uname -rsvp |
kernel 6.3.9 is currently in the test-updates repo (it has not yet pushed to updates as it has not completed the entire testing criteria) for Fedora. You can use something of the form
to get the kernel from the updates-testing repo now. You can also pull the rpms directly from the fedora koji build system, or use the command suggested on the bodhi kernel update gating pages for your release (or just do your own mock build from the fedora sources, as usual). |
Thanks @garybuhrmaster I was eventually able to get kernel 6.3.6 working again, however it seems there's flickering in new recordings. Specifically the picture flickers a few pixels up and down (on the vertical axis only) rapidly during all recordings. It could be something specific to my setup but could you please keep an eye out for recording quality losses or glitches when testing new kernels? EDIT: A cold boot of my system again seems to have resolved it. I guess it was a playback problem and probably unrelated to this issue. |
It looks to me as if the current holdup is with ryzen in F38. |
The package-manager ikon just offered a new test kernel for Fedora 37. mythbackend looks fine with it. The repo list was from https://copr.fedorainfracloud.org/coprs/g/kernel-vanilla/stable-rc/ [john@HPFed ~]$ uname -a |
And 6.4.0 seems ok too, in this build from elrepo :-) [john@HP_Box ~]$ uname -a 2023-06-27 10:09:56.844823 C mythbackend version: HEAD [v32.0-749472ba33] www.mythtv.org |
I'm confused about the Fedora release process, but again packagekit has installed an update which is working fine for me. And that included a successful DVB-T/T2 rescan. [john@HPFed ~]$ uname -a 2023-06-29 10:49:40.938983 C mythbackend version: HEAD [v34-Pre-45de13278b] www.mythtv.org |
A dnf upgrade today has installed kernel-6.4.1-0.rc3.20230630gt94976aa9.758.vanilla.fc37.x86_64 This has failed to complete booting. Booting pauses at 'Terminate Plymouth Boot Screen', then fails to start abrtd.service and hangs citing Dependency failures for ABRT kernel log watcher, kernel panic detection, and others. I waited for what seemed a long time without any change. Then the long-press on the power button. REISUB said all except S is disabled. Rebooting 6.3.10-250.vanilla got MythTV working again, but only after going through the same steps, including the Dependency failure warnings. Eventually it moved on. |
This system has an nvidia card using the rpmfusion 470xx drivers. The build by akmods failed after kernel installation and before the attempted reboot. 'sudo akmods --force' under 6.3.10 also fails to build for 6.4.1 now. |
Give this repo a try. I've been using it for years instead of rpmfusion: https://negativo17.org/nvidia-driver/ |
Thanks for the suggestion :-) I suspect that rpmfusion is ok on 6.3.x and I'll try staying with that for now. They had a hiccup on 6.2 -> 6.3 and maybe 6.3 -> 6.4 is similar. |
I have installed kernel-6.3.10-100.fc37 as shown at https://bodhi.fedoraproject.org/updates/?search=kernel using the one-time command line quoted in its link there. It's working fine for me with the rpmfusion 470xx nvidia driver in another 4-core x86_64 HP box. Problems are still being reported for the fc38 version, mostly Ryzen-related, and both kernels are still marked as 'testing', but this seems to me likely to be the current best near-mainstream option. |
That was yesterday, on a system untouched for several weeks. On trying to do it with the HPFed system above, dnf is showing some strange akmods/kernel related dependencies, and the intended updates-testing kernel doesn't boot. Still on 6.3.10 vanilla... |
https://bodhi.fedoraproject.org/updates/?packages=kernel [john@HPFed ~]$ uname -rsvp has now reached the fedora updates-testing repo and I have MythTV running as normal under it. Initially the nvidia driver was not built, but the build was successful after installing (again) the appropriate packages as described in the rpmfusion nvida howto. Then 'sudo akmods --force'. I usually watch build progress with atop, and this time it was different, seeing the CMD as akmods rather than the individual stages (make, cc1, depmod etc) when run automatically.. But it works, and reboots seem fine. sudo dnf upgrade --enablerepo=updates-testing --refresh --advisory=FEDORA-2023-83d5a4c7ea ## gets this kernel |
The fc38 version of 6.3.11 has been pushed to stable because it includes a security fix (StackRot (CVE-2023-3269)), although there are still problems with amd-Ryzen hardware. 6.3.11 in fc37 still looks good to me on amd-free systems. |
Apparently 6.3.11 for fc37 didn't include the StackRot fix, but 6.3.12 does. MythTV seems fine with it. |
This thread looks related, too. OpenSuSE Leap 15.4. http://lists.mythtv.org/pipermail/mythtv-users/2023-July/412183.html and a firmware download failure ( not in MythTV ) in 6.3.11-200.fc38.x86_64 |
kernel 6.4.4 is now in the Fedora 37 updates stable repo, and an updated nvidia 470xx driver from rpmfusion. MythTV, recent master, seems happy with these. Linux 6.4.4-100.fc37.x86_64 #1 SMP PREEMPT_DYNAMIC Wed Jul 19 17:06:05 UTC 2023 x86_64 |
I do not have the power to close this issue, but given the upstream kernels have been fixed, I recommend this issue be closed. |
Thanks Gary. |
Platform:el7 with elrepo mainline kernel; Fedora 37; also reported with Debian/Testing
MythTV version:32 fixes as of Jan 2023; current master
Package version: rpms built from gtb script
Component: mythbackend
What steps will reproduce the bug?
Start mythbackend from a kde konsole. as I normally do.
How often does it reproduce? Is there a required condition?
Every time, with kernal 6.3.7 and above. Works as expected with 6.3.6
What is the expected behaviour?
What do you see instead?
[root@HP_Box john]# journalctl -S -8h | grep -v dracut | grep -A 10 "task mythbackend:"
Jun 15 09:34:44 HP_Box kernel: INFO: task mythbackend:2889 blocked for more than 122 seconds.
Jun 15 09:34:44 HP_Box kernel: Tainted: G E 6.3.8-1.el7.elrepo.x86_64 #1
Jun 15 09:34:44 HP_Box kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Jun 15 09:34:44 HP_Box kernel: task:mythbackend state:D stack:0 pid:2889 ppid:1 flags:0x00004004
Jun 15 09:34:44 HP_Box kernel: Call Trace:
Jun 15 09:34:44 HP_Box kernel:
Jun 15 09:34:44 HP_Box kernel: __schedule+0x357/0x9f0
Reported on the users list. See this thread. http://lists.mythtv.org/pipermail/mythtv-users/2023-June/412025.html
Additional information
The text was updated successfully, but these errors were encountered: