New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
4.11.4 not working on R7 370 (GCN 1.0) #8
Comments
Okay this is a severe issue, it works with my R9 380 though. But the 4.11.4+ kernel was build directly before a couple of commits at the amd-staging kernel were made. So I merged the latest status, built and uploaded again. Can you try again if this fixes the issue? |
Unfortunately that did not fix it. I had not tested the latest 4.9 previously but just did and 4.9.31+ kernel boots just fine. However, I forgot to mention previously that since 4.11.3+ Vulkan apps hang my GPU. This happens with the 4.9.31+ kernel as well. 4.11.1 worked just fine. I've attached my dmesg log that I got during a hang from Mad Max Vulkan version (which I got via SSH). |
Sorry to hear that but thanks for testing. We should avoid that the issues get too messy: Please state the precise kernel version where the problem started like: The Vulkan problem sounds like it stopped working with a different kernel version, hence it is a different reggression. We should handle this in a seperate issue. If I got this right, I would ask you to open a new issue for that problem. I will report these issues upstream later today. |
Thanks for getting back so quick. When I get the chance I'll test out 4.11.2+ and 4.11.1+ again to verify where the Vulkan issue starts and create a new issue. Unfortunately the latest Mesa from the Padoka PPA has broken radeonsi for me in general (radeonsi_dri interface seems to be having issues). I'll wait a bit for that PPA to update and if its still broken or takes a while I roll back to the stable Mesa branch and test everything out. |
New kernel, new chance? Can you test again with 4.11.5+? |
Unfortunately amdgpu is still crashing on 4.11.5+. However, I have now tested with 4.11.2+ and can confirm not only does it boot just fine but Vulkan as well as OpenGL apps are working just fine. I also tested 4.9.32+ and everything is working good there as well. It may have been a radv issue that caused issues with Vulkan previously as my Mesa drivers have been updated, though I was not expecting the userspace drivers to cause a full GPU crash. I could test the 4.9.31+ version again to confirm if Mesa was the cause if that would be useful and report a separate issue? I'll update the original report with the new info. FYI I did notice some errors in dmesg with 4.11.2 that I don't think I've seen before:
Everything was working OK though. I've not seen that warning with 4.9.32+. |
Basically there is no new code from amd-staging side in the 4.9 kernel since 4.9.28+. In this very special case it would still be interesting because in 4.9.32 something was changed in drivers/gpu/drm/amd/amdgpu/ci_dpm.c (still I would be surprised). I expect this to be a radv issue. If you can confirm that, I will only report your Xorg crash since 4.11.4+ upstream. |
Sorry for taking so long. I tested 4.9.31+ and its working fine. Seems it was something related to the Mesa/libdrm/radv changes. I've now tested 4.11.8+ and its still not working for me using the amdgpu driver. |
Thanks for your endurance and detailed testing. I reported the 4.11.X+ kernel issue upstream to the AMD graphics developers mailing list. Hopefully they find the reason for the regression. If you are working on Ubuntu you can also try the new kernel variant with Ubuntu additions. E.g. the vanilla based kernel didn't play well with the apparmor tools. |
Got feedback from the AMD developers - they would like to have a dmesg log after boot/crash. If it was just a conflict with the radeon module this should be obsolete now because I don't build the radeon module in the kernels anymore. |
Hi, first of all thank you for this work - I would really like to avoid installing the amdgpupro driver. So, I have tried your 4.9.38 and 4.11.11+ kernels on 16.04 but unfortunately with no success. Now, I am not so familiar with this kind of testing/modifications. If I needed to open a new issue for this, just let me know, but it kind of looks similar to the main issue in this thread. Visible sign of the issue: Xorg does not start up. I can open a console but not log in via the graphical user interface. Black screen with cursor in upper left hand corner for both 4.9.38 and 4.11.11+ When I start up my system with 4.9.0-040900-generic #201612111631 kernel, I have no graphics issue (and no sound). On the Xorg.0.log with the 4.11.11+ , I see the following lines with (EE):
If you need more info, please let me know what. I can do testing if you let me know what to do. EDIT: after reading some of the comments above, I also generated a 4.11.3+ kernel by going through your older commits and that worked with my 16.04. I also have a 17.04 on this machine and will try that tomorrow. |
Looks like I missed an important change that caused all the hassle. There is a new way to switch between radeon and amdgpu and it is not via blacklisting anymore. The GCN 1.0 and GCN 1.1 have to add "radeon.si_support=0 radeon.cik_support=0 amdgpu.si_support=1 amdgpu.cik_support=1" to their kernel boot line (e.g. via GRUB_CMDLINE_LINUX_DEFAULT in /etc/default/grub). Updated the Readme for this additional "feature". I assume this should fix this issue and close it. If there are still problems please open a fresh issue with Xorg and dmesg logs. |
Previous kernel versions up to and including 4.11.3+ would all boot into Xorg just fine but since 4.11.4+ sddm keeps crashing and Xorg log shows issues with loading the amdgpu module. Booting the same kernel using the "radeon" kernel module works just fine. All 4.9 series are working fine.
I've attached my Xorg log. Let me know if I can provide any other debugging information.
Xorg-amdgpu.0.txt
For clarity:
GPU: AMD Radeon R7 370 (XFX Xtreme Black Edition OC)
Good kernels:
Tested broken kernels:
The text was updated successfully, but these errors were encountered: