Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[GVT-d][KBL-NUC]System print call trace"drm_mode_config_cleanup" and "kernel NULL pointer" after run "echo 0000:00:02.0 > /sys/bus/pci/devices/0000:00:02.0/driver/unbind" #502

Closed
TerrenceXu opened this issue Mar 4, 2019 · 42 comments
Assignees
Labels

Comments

@TerrenceXu
Copy link

TerrenceXu commented Mar 4, 2019

Please omit irrelevant data and trim the bug report template to only those parts that make sense!

Describe the bug
System print call trace"drm_mode_config_cleanup" and "kernel NULL pointer" after try to unbind Intel Graphic card by "echo 0000:00:02.0 > /sys/bus/pci/devices/0000:00:02.0/driver/unbind".

To Reproduce
Steps to reproduce the behavior:

  1. Boot up host with "intel_iommu=on" ;
  2. Run "modprobe vfio" & "modprobe vfio_pci";
  3. Run "echo 0000:00:02.0 > /sys/bus/pci/devices/0000:00:02.0/driver/unbind".

Expected behavior
Unbind "00:02.0" well, GVT-d can work.

Screenshots
N/A

Environment (please complete the following information):

  • Clear Linux OS version: 27600
  • Bundles: It including all needed package for GVT.

Additional context

[ 33.379322] initcall crypto_cmac_module_init+0x0/0x1000 [cmac] returned 0 after 1 usecs
[ 33.580884] IPv6: ADDRCONF(NETDEV_UP): wlp58s0: link is not ready
[ 91.660085] calling vfio_virqfd_init+0x0/0x1000 [vfio_virqfd] @ 1058
[ 91.660160] initcall vfio_virqfd_init+0x0/0x1000 [vfio_virqfd] returned 0 after 69 usecs
[ 91.661516] calling vfio_pci_init+0x0/0x15b [vfio_pci] @ 1058
[ 91.661536] initcall vfio_pci_init+0x0/0x15b [vfio_pci] returned 0 after 16 usecs
[ 121.896096] Console: switching to colour dummy device 80x25
[ 121.963772] WARNING: CPU: 1 PID: 1033 at drivers/gpu/drm/drm_mode_config.c:443 drm_mode_config_cleanup+0x287/0x2e0
[ 121.963774] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress iwlmvm snd_pcm_dmaengine mac80211 ac97_bus snd_hda_codec_hdmi iwlwifi snd_hda_codec_realtek rtsx_pci_ms snd_hda_codec_generic btusb btrtl btbcm btintel bluetooth snd_hda_intel ecdh_generic rtsx_pci_sdmmc snd_hda_codec memstick cfg80211 snd_hda_core rfkill intel_wmi_thunderbolt rtsx_pci wmi_bmof snd_hwdep wmi snd_pcm e1000e mei_me snd soundcore mei i2c_i801 pcc_cpufreq intel_pch_thermal
[ 121.963793] CPU: 1 PID: 1033 Comm: bash Not tainted 4.20.6-690.native #1
[ 121.963794] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[ 121.963795] RIP: 0010:drm_mode_config_cleanup+0x287/0x2e0
[ 121.963797] Code: 02 00 00 e8 cb 77 44 00 49 8b 04 24 49 39 c4 75 5f 48 83 c4 38 31 c0 5b 41 5c 41 5d 41 5e 41 5f 5d 89 c2 89 c1 89 c6 89 c7 c3 <0f> 0b 48 8d 75 a8 4c 89 e7 48 c7 c3 af ca 62 b7 e8 a4 85 ff ff eb
[ 121.963797] RSP: 0018:ffff9bd30465bc58 EFLAGS: 00010297
[ 121.963798] RAX: ffff897f19ae6018 RBX: ffff897f196c8368 RCX: 0000000000000000
[ 121.963799] RDX: ffff897f196c8330 RSI: 0000000000000000 RDI: 0000000000000000
[ 121.963800] RBP: ffff9bd30465bcb8 R08: 0000000000000000 R09: 0000000000000000
[ 121.963800] R10: 0000000000000000 R11: 0000000000000000 R12: ffff897f196c8000
[ 121.963801] R13: ffff897f196c8370 R14: ffffffffb7916890 R15: 0000000000000060
[ 121.963802] FS: 00007f1ae1a10740(0000) GS:ffff897f1ea80000(0000) knlGS:0000000000000000
[ 121.963803] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 121.963804] CR2: 0000563a9d57f5d8 CR3: 0000000875896005 CR4: 00000000003606e0
[ 121.963804] Call Trace:
[ 121.963807] ? mutex_lock+0xd/0x30
[ 121.963810] intel_modeset_cleanup+0xb2/0x120
[ 121.963812] i915_driver_unload+0x9f/0x150
[ 121.963814] i915_pci_remove+0x1e/0x40
[ 121.963816] pci_device_remove+0x39/0xc0
[ 121.963818] device_release_driver_internal+0x1b5/0x270
[ 121.963819] device_release_driver+0xd/0x20
[ 121.963821] unbind_store+0xb8/0x160
[ 121.963822] drv_attr_store+0x22/0x50
[ 121.963824] sysfs_kf_write+0x36/0x60
[ 121.963825] kernfs_fop_write+0x131/0x1b0
[ 121.963827] __vfs_write+0x26/0x170
[ 121.963829] ? _cond_resched+0x14/0x30
[ 121.963830] vfs_write+0xaa/0x1b0
[ 121.963831] ksys_write+0x41/0xa0
[ 121.963832] __x64_sys_write+0x15/0x20
[ 121.963834] do_syscall_64+0x55/0x1b0
[ 121.963835] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 121.963836] RIP: 0033:0x7f1ae1b2030b
[ 121.963838] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 f5 33 0e 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[ 121.963838] RSP: 002b:00007ffd8468e298 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[ 121.963839] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f1ae1b2030b
[ 121.963840] RDX: 000000000000000d RSI: 00000000014cb140 RDI: 0000000000000001
[ 121.963841] RBP: 00000000014cb140 R08: 000000000000000a R09: 00007f1ae1a10740
[ 121.963841] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f1ae1bff760
[ 121.963842] R13: 000000000000000d R14: 00007f1ae1bfb340 R15: 00007f1ae1bfa740
[ 121.963843] --[ end trace 5678d77f99e40df9 ]--
[ 121.963845] [drm:drm_mode_config_cleanup] ERROR connector DP-1 leaked!
[ 121.964334] WARNING: CPU: 1 PID: 1033 at drivers/gpu/drm/drm_mode_config.c:477 drm_mode_config_cleanup+0x2c6/0x2e0
[ 121.964334] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress iwlmvm snd_pcm_dmaengine mac80211 ac97_bus snd_hda_codec_hdmi iwlwifi snd_hda_codec_realtek rtsx_pci_ms snd_hda_codec_generic btusb btrtl btbcm btintel bluetooth snd_hda_intel ecdh_generic rtsx_pci_sdmmc snd_hda_codec memstick cfg80211 snd_hda_core rfkill intel_wmi_thunderbolt rtsx_pci wmi_bmof snd_hwdep wmi snd_pcm e1000e mei_me snd soundcore mei i2c_i801 pcc_cpufreq intel_pch_thermal
[ 121.964347] CPU: 1 PID: 1033 Comm: bash Tainted: G W 4.20.6-690.native #1
[ 121.964347] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[ 121.964348] RIP: 0010:drm_mode_config_cleanup+0x2c6/0x2e0
[ 121.964349] Code: eb 0c 48 8b 70 48 48 89 df e8 66 f4 ff ff 48 8d 7d a8 e8 8d 8b ff ff 48 85 c0 75 e6 48 8d 7d a8 e8 1f 8c ff ff e9 e1 fd ff ff <0f> 0b e9 e9 fe ff ff 0f 0b eb 9d 66 66 2e 0f 1f 84 00 00 00 00 00
[ 121.964350] RSP: 0018:ffff9bd30465bc58 EFLAGS: 00010206
[ 121.964351] RAX: ffff897f1622fc08 RBX: ffff897f196c8438 RCX: ffff897f196c8308
[ 121.964352] RDX: ffff897f196c8460 RSI: 0000000000000000 RDI: ffff897f196c8438
[ 121.964352] RBP: ffff9bd30465bcb8 R08: 0000000000000000 R09: 0000000000000000
[ 121.964353] R10: 0000000000000000 R11: 0000000000000000 R12: ffff897f196c8000
[ 121.964353] R13: ffff897f196c8460 R14: ffffffffb7916890 R15: 0000000000000060
[ 121.964355] FS: 00007f1ae1a10740(0000) GS:ffff897f1ea80000(0000) knlGS:0000000000000000
[ 121.964355] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 121.964356] CR2: 0000563a9d57f5d8 CR3: 0000000875896005 CR4: 00000000003606e0
[ 121.964356] Call Trace:
[ 121.964358] ? mutex_lock+0xd/0x30
[ 121.964360] intel_modeset_cleanup+0xb2/0x120
[ 121.964361] i915_driver_unload+0x9f/0x150
[ 121.964362] i915_pci_remove+0x1e/0x40
[ 121.964364] pci_device_remove+0x39/0xc0
[ 121.964365] device_release_driver_internal+0x1b5/0x270
[ 121.964366] device_release_driver+0xd/0x20
[ 121.964367] unbind_store+0xb8/0x160
[ 121.964369] drv_attr_store+0x22/0x50
[ 121.964370] sysfs_kf_write+0x36/0x60
[ 121.964371] kernfs_fop_write+0x131/0x1b0
[ 121.964372] __vfs_write+0x26/0x170
[ 121.964374] ? _cond_resched+0x14/0x30
[ 121.964375] vfs_write+0xaa/0x1b0
[ 121.964376] ksys_write+0x41/0xa0
[ 121.964377] __x64_sys_write+0x15/0x20
[ 121.964378] do_syscall_64+0x55/0x1b0
[ 121.964380] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 121.964380] RIP: 0033:0x7f1ae1b2030b
[ 121.964381] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 f5 33 0e 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[ 121.964382] RSP: 002b:00007ffd8468e298 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[ 121.964383] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f1ae1b2030b
[ 121.964383] RDX: 000000000000000d RSI: 00000000014cb140 RDI: 0000000000000001
[ 121.964384] RBP: 00000000014cb140 R08: 000000000000000a R09: 00007f1ae1a10740
[ 121.964385] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f1ae1bff760
[ 121.964385] R13: 000000000000000d R14: 00007f1ae1bfb340 R15: 00007f1ae1bfa740
[ 121.964386] --[ end trace 5678d77f99e40dfa ]--
[ 121.964388] [leaked fb] framebuffer[91]:
[ 121.964389] [leaked fb] allocated by = X
[ 121.964389] [leaked fb] refcount=1
[ 121.964391] [leaked fb] format=XR24 little-endian (0x34325258)
[ 121.964392] [leaked fb] modifier=0x100000000000001
[ 121.964393] [leaked fb] size=1360x768
[ 121.964393] [leaked fb] layers:
[ 121.964394] [leaked fb] size[0]=1360x768
[ 121.964395] [leaked fb] pitch[0]=5632
[ 121.964396] [leaked fb] offset[0]=0
[ 121.964397] [leaked fb] obj[0]:
[ 121.964398] [leaked fb] name=0
[ 121.964398] [leaked fb] refcount=4
[ 121.964399] [leaked fb] start=00000000
[ 121.964400] [leaked fb] size=5242880
[ 121.964401] [leaked fb] imported=no
[ 121.964410] WARNING: CPU: 1 PID: 1033 at include/drm/drm_modeset_lock.h:105 drm_mode_config_cleanup+0x2cd/0x2e0
[ 121.964410] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress iwlmvm snd_pcm_dmaengine mac80211 ac97_bus snd_hda_codec_hdmi iwlwifi snd_hda_codec_realtek rtsx_pci_ms snd_hda_codec_generic btusb btrtl btbcm btintel bluetooth snd_hda_intel ecdh_generic rtsx_pci_sdmmc snd_hda_codec memstick cfg80211 snd_hda_core rfkill intel_wmi_thunderbolt rtsx_pci wmi_bmof snd_hwdep wmi snd_pcm e1000e mei_me snd soundcore mei i2c_i801 pcc_cpufreq intel_pch_thermal
[ 121.964422] CPU: 1 PID: 1033 Comm: bash Tainted: G W 4.20.6-690.native #1
[ 121.964422] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[ 121.964424] RIP: 0010:drm_mode_config_cleanup+0x2cd/0x2e0
[ 121.964424] Code: 89 df e8 66 f4 ff ff 48 8d 7d a8 e8 8d 8b ff ff 48 85 c0 75 e6 48 8d 7d a8 e8 1f 8c ff ff e9 e1 fd ff ff 0f 0b e9 e9 fe ff ff <0f> 0b eb 9d 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 55 48 89
[ 121.964425] RSP: 0018:ffff9bd30465bc58 EFLAGS: 00010287
[ 121.964426] RAX: ffff9bd303c67c18 RBX: ffff897f196c8300 RCX: 0000000000000000
[ 121.964426] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 121.964427] RBP: ffff9bd30465bcb8 R08: 0000000000000000 R09: 0000000000000000
[ 121.964428] R10: 0000000000000000 R11: 0000000000000000 R12: ffff897f196c8278
[ 121.964428] R13: ffff897f196c8300 R14: ffffffffb762cac5 R15: ffffffffb68bc560
[ 121.964429] FS: 00007f1ae1a10740(0000) GS:ffff897f1ea80000(0000) knlGS:0000000000000000
[ 121.964430] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 121.964431] CR2: 0000563a9d57f5d8 CR3: 0000000875896005 CR4: 00000000003606e0
[ 121.964431] Call Trace:
[ 121.964432] ? __drm_printfn_info+0x20/0x20
[ 121.964434] intel_modeset_cleanup+0xb2/0x120
[ 121.964435] i915_driver_unload+0x9f/0x150
[ 121.964437] i915_pci_remove+0x1e/0x40
[ 121.964438] pci_device_remove+0x39/0xc0
[ 121.964439] device_release_driver_internal+0x1b5/0x270
[ 121.964440] device_release_driver+0xd/0x20
[ 121.964442] unbind_store+0xb8/0x160
[ 121.964443] drv_attr_store+0x22/0x50
[ 121.964444] sysfs_kf_write+0x36/0x60
[ 121.964445] kernfs_fop_write+0x131/0x1b0
[ 121.964446] __vfs_write+0x26/0x170
[ 121.964448] ? _cond_resched+0x14/0x30
[ 121.964449] vfs_write+0xaa/0x1b0
[ 121.964450] ksys_write+0x41/0xa0
[ 121.964451] __x64_sys_write+0x15/0x20
[ 121.964452] do_syscall_64+0x55/0x1b0
[ 121.964454] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 121.964454] RIP: 0033:0x7f1ae1b2030b
[ 121.964455] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 f5 33 0e 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[ 121.964456] RSP: 002b:00007ffd8468e298 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[ 121.964457] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f1ae1b2030b
[ 121.964457] RDX: 000000000000000d RSI: 00000000014cb140 RDI: 0000000000000001
[ 121.964458] RBP: 00000000014cb140 R08: 000000000000000a R09: 00007f1ae1a10740
[ 121.964459] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f1ae1bff760
[ 121.964459] R13: 000000000000000d R14: 00007f1ae1bfb340 R15: 00007f1ae1bfa740
[ 121.964460] --[ end trace 5678d77f99e40dfb ]--
[ 121.969743] BUG: unable to handle kernel paging request at 0000000000005000
[ 121.970236] PGD 800000088eafe067 P4D 800000088eafe067 PUD 88ea10067 PMD 0 
[ 121.970705] Oops: 0002 1 SMP PTI
[ 121.970968] CPU: 1 PID: 550 Comm: X Tainted: G W 4.20.6-690.native #1
[ 121.971493] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[ 121.972195] RIP: 0010:mutex_unlock+0x12/0x30
[ 121.972489] Code: 70 10 e8 11 b8 43 ff eb ab 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 55 31 c9 65 48 8b 14 25 c0 4d 01 00 48 89 e5 48 89 d0 <f0> 48 0f b1 0f 48 39 c2 74 05 e8 df fe ff ff 5d 31 c0 89 c2 89 c1
[ 121.973808] RSP: 0018:ffff9bd303c67920 EFLAGS: 00010246
[ 121.974167] RAX: ffff897f139b4040 RBX: 0000000000000000 RCX: 0000000000000000
[ 121.974651] RDX: ffff897f139b4040 RSI: 0000000000000000 RDI: 0000000000005000
[ 121.975155] RBP: ffff9bd303c67920 R08: 0000000000000000 R09: 0000000000000000
[ 121.975639] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[ 121.976140] R13: 0000000000000002 R14: ffff897f1a43e108 R15: ffff897f196c8000
[ 121.976661] FS: 00007fe2d42a7dc0(0000) GS:ffff897f1ea80000(0000) knlGS:0000000000000000
[ 121.977211] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 121.977606] CR2: 0000000000005000 CR3: 0000000891536001 CR4: 00000000003606e0
[ 121.978159] Call Trace:
[ 121.978339] intel_dp_aux_xfer+0x186/0x660
[ 121.978626] ? wait_woken+0x80/0x80
[ 121.978874] intel_dp_aux_transfer+0xad/0x250
[ 121.979179] ? __switch_to_asm+0x40/0x70
[ 121.979455] drm_dp_dpcd_access+0x62/0xf0
[ 121.979736] drm_dp_dpcd_read+0x2e/0xd0
[ 121.980008] lspcon_wake_native_aux_ch+0x1f/0x80
[ 121.980335] ? intel_digital_port_connected+0x18e/0x630
[ 121.980728] lspcon_resume+0x13/0xf0
[ 121.980995] intel_dp_detect+0x5b3/0x610
[ 121.981268] drm_helper_probe_detect+0x4b/0x90
[ 121.981575] drm_helper_probe_single_connector_modes+0xc6/0x6d0
[ 121.982019] drm_mode_getconnector+0x4c5/0x4f3
[ 121.982326] ? drm_connector_property_set_ioctl+0x30/0x30
[ 121.982696] drm_ioctl_kernel+0x93/0xd0
[ 121.982985] drm_ioctl+0x379/0x440
[ 121.983223] ? drm_connector_property_set_ioctl+0x30/0x30
[ 121.983594] ? _crng_backtrack_protect+0x51/0x70
[ 121.983930] do_vfs_ioctl+0x94/0x5e0
[ 121.984180] ? __sys_recvmsg+0x70/0x90
[ 121.984441] ksys_ioctl+0x6a/0x90
[ 121.984672] __x64_sys_ioctl+0x15/0x20
[ 121.984969] do_syscall_64+0x55/0x1b0
[ 121.985224] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 121.985571] RIP: 0033:0x7fe2d4c587bb
[ 121.985820] Code: 0f 1e fa 48 8b 05 c5 76 0d 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 95 76 0d 00 f7 d8 64 89 01 48
[ 121.987079] RSP: 002b:00007fff744de6b8 EFLAGS: 00003246 ORIG_RAX: 0000000000000010
[ 121.987593] RAX: ffffffffffffffda RBX: 000000000258d840 RCX: 00007fe2d4c587bb
[ 121.988114] RDX: 00007fff744de750 RSI: 00000000c05064a7 RDI: 000000000000000d
[ 121.988599] RBP: 00007fff744de6e0 R08: 0000000002eac5b0 R09: 0000000002eac560
[ 121.989121] R10: 0000000000000001 R11: 0000000000003246 R12: 00007fff744de750
[ 121.989606] R13: 00000000c05064a7 R14: 000000000000000d R15: 00007fff744de750
[ 121.990090] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress iwlmvm snd_pcm_dmaengine mac80211 ac97_bus snd_hda_codec_hdmi iwlwifi snd_hda_codec_realtek rtsx_pci_ms snd_hda_codec_generic btusb btrtl btbcm btintel bluetooth snd_hda_intel ecdh_generic rtsx_pci_sdmmc snd_hda_codec memstick cfg80211 snd_hda_core rfkill intel_wmi_thunderbolt rtsx_pci wmi_bmof snd_hwdep wmi snd_pcm e1000e mei_me snd soundcore mei i2c_i801 pcc_cpufreq intel_pch_thermal
[ 121.993898] CR2: 0000000000005000
[ 121.994132] --[ end trace 5678d77f99e40dfc ]--
[ 122.044774] -----------[ cut here ]-----------
[ 122.045096] WARN_ON(!list_empty(&dev_priv->contexts.list))
[ 122.045106] WARNING: CPU: 1 PID: 1033 at drivers/gpu/drm/i915/i915_gem.c:5737 i915_gem_fini+0x14d/0x160
[ 122.046123] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress iwlmvm snd_pcm_dmaengine mac80211 ac97_bus snd_hda_codec_hdmi iwlwifi snd_hda_codec_realtek rtsx_pci_ms snd_hda_codec_generic btusb btrtl btbcm btintel bluetooth snd_hda_intel ecdh_generic rtsx_pci_sdmmc snd_hda_codec memstick cfg80211 snd_hda_core rfkill intel_wmi_thunderbolt rtsx_pci wmi_bmof snd_hwdep wmi snd_pcm e1000e mei_me snd soundcore mei i2c_i801 pcc_cpufreq intel_pch_thermal
[ 122.049836] CPU: 1 PID: 1033 Comm: bash Tainted: G D W 4.20.6-690.native #1
[ 122.050387] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[ 122.051130] RIP: 0010:i915_gem_fini+0x14d/0x160
[ 122.051444] Code: 6d 00 00 49 39 c4 75 0f 5b 31 c0 41 5c 41 5d 41 5e 5d 89 c6 89 c7 c3 48 c7 c6 58 a0 63 b7 48 c7 c7 91 c4 60 b7 e8 b3 ad 80 ff <0f> 0b eb da 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 55 48 89
[ 122.052732] RSP: 0018:ffff9bd30465bcc8 EFLAGS: 00010292
[ 122.053114] RAX: 0000000000000000 RBX: ffff897f196cdc90 RCX: 0000000000000000
[ 122.053600] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 122.054127] RBP: ffff9bd30465bce8 R08: 0000000000000000 R09: 0000000000000000
[ 122.054613] R10: 0000000000000000 R11: 0000000000000000 R12: ffff897f196ced18
[ 122.055139] R13: ffff897f196cc858 R14: ffff897f196c8068 R15: 0000000000000060
[ 122.055627] FS: 00007f1ae1a10740(0000) GS:ffff897f1ea80000(0000) knlGS:0000000000000000
[ 122.056201] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 122.056596] CR2: 0000563a9d4fe870 CR3: 0000000875896006 CR4: 00000000003606e0
[ 122.057101] Call Trace:
[ 122.057277] i915_driver_unload+0xd9/0x150
[ 122.057563] i915_pci_remove+0x1e/0x40
[ 122.057863] pci_device_remove+0x39/0xc0
[ 122.058138] device_release_driver_internal+0x1b5/0x270
[ 122.058501] device_release_driver+0xd/0x20
[ 122.058830] unbind_store+0xb8/0x160
[ 122.059080] drv_attr_store+0x22/0x50
[ 122.059337] sysfs_kf_write+0x36/0x60
[ 122.059593] kernfs_fop_write+0x131/0x1b0
[ 122.059893] __vfs_write+0x26/0x170
[ 122.060140] ? _cond_resched+0x14/0x30
[ 122.060402] vfs_write+0xaa/0x1b0
[ 122.060636] ksys_write+0x41/0xa0
[ 122.060888] __x64_sys_write+0x15/0x20
[ 122.061151] do_syscall_64+0x55/0x1b0
[ 122.061407] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 122.061756] RIP: 0033:0x7f1ae1b2030b
[ 122.062006] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 f5 33 0e 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[ 122.063273] RSP: 002b:00007ffd8468e298 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[ 122.063790] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f1ae1b2030b
[ 122.064277] RDX: 000000000000000d RSI: 00000000014cb140 RDI: 0000000000000001
[ 122.064786] RBP: 00000000014cb140 R08: 000000000000000a R09: 00007f1ae1a10740
[ 122.065290] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f1ae1bff760
[ 122.065777] R13: 000000000000000d R14: 00007f1ae1bfb340 R15: 00007f1ae1bfa740
[ 122.066267] --[ end trace 5678d77f99e40dfd ]--
[ 122.066860] -----------[ cut here ]-----------
[ 122.067185] Memory manager not clean during takedown.
[ 122.067550] WARNING: CPU: 1 PID: 1033 at drivers/gpu/drm/drm_mm.c:950 drm_mm_takedown+0x22/0x30
[ 122.068149] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress iwlmvm snd_pcm_dmaengine mac80211 ac97_bus snd_hda_codec_hdmi iwlwifi snd_hda_codec_realtek rtsx_pci_ms snd_hda_codec_generic btusb btrtl btbcm btintel bluetooth snd_hda_intel ecdh_generic rtsx_pci_sdmmc snd_hda_codec memstick cfg80211 snd_hda_core rfkill intel_wmi_thunderbolt rtsx_pci wmi_bmof snd_hwdep wmi snd_pcm e1000e mei_me snd soundcore mei i2c_i801 pcc_cpufreq intel_pch_thermal
[ 122.071803] CPU: 1 PID: 1033 Comm: bash Tainted: G D W 4.20.6-690.native #1
[ 122.072355] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[ 122.073060] RIP: 0010:drm_mm_takedown+0x22/0x30
[ 122.073378] Code: c0 e9 32 fd ff ff 66 90 48 8b 47 38 48 83 c7 38 48 39 c7 75 05 31 c0 89 c7 c3 55 48 c7 c7 88 a8 62 b7 48 89 e5 e8 9e 72 88 ff <0f> 0b 5d 31 c0 89 c7 c3 66 0f 1f 44 00 00 55 48 89 e5 41 57 41 56
[ 122.074642] RSP: 0018:ffff9bd30465bc80 EFLAGS: 00010282
[ 122.075005] RAX: 0000000000000000 RBX: ffff897f196cd7a8 RCX: 0000000000000000
[ 122.075491] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 122.075997] RBP: ffff9bd30465bc80 R08: 0000000000000000 R09: 0000000000000000
[ 122.076485] R10: 0000000000000000 R11: 0000000000000000 R12: ffff897f196c8000
[ 122.077010] R13: ffff897f196cd920 R14: ffff897f196c8068 R15: ffffffffb76395d5
[ 122.077497] FS: 00007f1ae1a10740(0000) GS:ffff897f1ea80000(0000) knlGS:0000000000000000
[ 122.078088] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 122.078485] CR2: 0000563a9d4fe870 CR3: 0000000875896006 CR4: 00000000003606e0
[ 122.078992] Call Trace:
[ 122.079168] i915_gem_cleanup_stolen+0x1a/0x30
[ 122.079477] i915_ggtt_cleanup_hw+0xfc/0x160
[ 122.079777] i915_driver_cleanup_hw+0x35/0x50
[ 122.080079] i915_driver_unload+0xf1/0x150
[ 122.080365] i915_pci_remove+0x1e/0x40
[ 122.080628] pci_device_remove+0x39/0xc0
[ 122.080903] device_release_driver_internal+0x1b5/0x270
[ 122.081263] device_release_driver+0xd/0x20
[ 122.081555] unbind_store+0xb8/0x160
[ 122.081806] drv_attr_store+0x22/0x50
[ 122.082063] sysfs_kf_write+0x36/0x60
[ 122.082321] kernfs_fop_write+0x131/0x1b0
[ 122.082602] __vfs_write+0x26/0x170
[ 122.082866] ? _cond_resched+0x14/0x30
[ 122.083127] vfs_write+0xaa/0x1b0
[ 122.083361] ksys_write+0x41/0xa0
[ 122.083594] __x64_sys_write+0x15/0x20
[ 122.083894] do_syscall_64+0x55/0x1b0
[ 122.084151] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 122.084499] RIP: 0033:0x7f1ae1b2030b
[ 122.084770] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 f5 33 0e 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[ 122.086036] RSP: 002b:00007ffd8468e298 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[ 122.086553] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f1ae1b2030b
[ 122.087042] RDX: 000000000000000d RSI: 00000000014cb140 RDI: 0000000000000001
[ 122.087530] RBP: 00000000014cb140 R08: 000000000000000a R09: 00007f1ae1a10740
[ 122.088017] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f1ae1bff760
[ 122.088509] R13: 000000000000000d R14: 00007f1ae1bfb340 R15: 00007f1ae1bfa740
[ 122.089015] --[ end trace 5678d77f99e40dfe ]--
[ 122.089396] -----------[ cut here ]-----------
[ 122.089904] WARN_ON(atomic_read(&dev_priv->runtime_pm.wakeref_count))
[ 122.089916] WARNING: CPU: 1 PID: 1033 at drivers/gpu/drm/i915/i915_drv.c:1795 i915_driver_unload+0x13d/0x150
[ 122.091083] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress iwlmvm snd_pcm_dmaengine mac80211 ac97_bus snd_hda_codec_hdmi iwlwifi snd_hda_codec_realtek rtsx_pci_ms snd_hda_codec_generic btusb btrtl btbcm btintel bluetooth snd_hda_intel ecdh_generic rtsx_pci_sdmmc snd_hda_codec memstick cfg80211 snd_hda_core rfkill intel_wmi_thunderbolt rtsx_pci wmi_bmof snd_hwdep wmi snd_pcm e1000e mei_me snd soundcore mei i2c_i801 pcc_cpufreq intel_pch_thermal
[ 122.094856] CPU: 1 PID: 1033 Comm: bash Tainted: G D W 4.20.6-690.native #1
[ 122.095407] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[ 122.096112] RIP: 0010:i915_driver_unload+0x13d/0x150
[ 122.096455] Code: c1 89 c6 89 c7 c3 48 c7 c7 e8 06 63 b7 e8 1b 32 fe ff e9 5d ff ff ff 48 c7 c6 d0 07 63 b7 48 c7 c7 91 c4 60 b7 e8 13 47 85 ff <0f> 0b eb c9 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 55 48 89
[ 122.097723] RSP: 0018:ffff9bd30465bcf8 EFLAGS: 00010292
[ 122.098121] RAX: 0000000000000000 RBX: ffff897f196c8000 RCX: 0000000000000000
[ 122.098610] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 122.099138] RBP: ffff9bd30465bd08 R08: 0000000000000000 R09: 0000000000000000
[ 122.099626] R10: 0000000000000000 R11: 0000000000000000 R12: ffff897f1abfc000
[ 122.100153] R13: ffffffffb7916820 R14: ffffffffb7916890 R15: 0000000000000060
[ 122.100644] FS: 00007f1ae1a10740(0000) GS:ffff897f1ea80000(0000) knlGS:0000000000000000
[ 122.101217] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 122.101613] CR2: 0000563a9d4fe870 CR3: 0000000875896006 CR4: 00000000003606e0
[ 122.102121] Call Trace:
[ 122.102297] i915_pci_remove+0x1e/0x40
[ 122.102560] pci_device_remove+0x39/0xc0
[ 122.102873] device_release_driver_internal+0x1b5/0x270
[ 122.103237] device_release_driver+0xd/0x20
[ 122.103529] unbind_store+0xb8/0x160
[ 122.103780] drv_attr_store+0x22/0x50
[ 122.104039] sysfs_kf_write+0x36/0x60
[ 122.104295] kernfs_fop_write+0x131/0x1b0
[ 122.104576] __vfs_write+0x26/0x170
[ 122.104822] ? _cond_resched+0x14/0x30
[ 122.105085] vfs_write+0xaa/0x1b0
[ 122.105319] ksys_write+0x41/0xa0
[ 122.105552] __x64_sys_write+0x15/0x20
[ 122.105853] do_syscall_64+0x55/0x1b0
[ 122.106110] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[ 122.106461] RIP: 0033:0x7f1ae1b2030b
[ 122.106712] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 f5 33 0e 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[ 122.108053] RSP: 002b:00007ffd8468e298 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[ 122.108573] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f1ae1b2030b
[ 122.109083] RDX: 000000000000000d RSI: 00000000014cb140 RDI: 0000000000000001
[ 122.109572] RBP: 00000000014cb140 R08: 000000000000000a R09: 00007f1ae1a10740
[ 122.110078] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f1ae1bff760
[ 122.110567] R13: 000000000000000d R14: 00007f1ae1bfb340 R15: 00007f1ae1bfa740
[ 122.111094] --[ end trace 5678d77f99e40dff ]--
[ 122.287874] RIP: 0010:mutex_unlock+0x12/0x30
[ 122.288737] Code: 70 10 e8 11 b8 43 ff eb ab 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 55 31 c9 65 48 8b 14 25 c0 4d 01 00 48 89 e5 48 89 d0 <f0> 48 0f b1 0f 48 39 c2 74 05 e8 df fe ff ff 5d 31 c0 89 c2 89 c1
[ 122.293300] RSP: 0018:ffff9bd303c67920 EFLAGS: 00010246
[ 122.294614] RAX: ffff897f139b4040 RBX: 0000000000000000 RCX: 0000000000000000
[ 122.296573] RDX: ffff897f139b4040 RSI: 0000000000000000 RDI: 0000000000005000
[ 122.298380] RBP: ffff9bd303c67920 R08: 0000000000000000 R09: 0000000000000000
[ 122.300190] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[ 122.302150] R13: 0000000000000002 R14: ffff897f1a43e108 R15: ffff897f196c8000
[ 122.303967] FS: 00007fe2d42a7dc0(0000) GS:ffff897f1eb80000(0000) knlGS:0000000000000000
[ 122.306023] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 122.307702] CR2: 0000563a9d4f9fe8 CR3: 0000000891536005 CR4: 00000000003606e0
[ 179.223074] BUG: unable to handle kernel NULL pointer dereference at 0000000000000000
[ 179.223611] PGD 0 P4D 0 
[ 179.223792] Oops: 0000 2 SMP PTI
[ 179.224036] CPU: 1 PID: 640 Comm: threaded-ml Tainted: G D W 4.20.6-690.native #1
[ 179.224621] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[ 179.225321] RIP: 0010:ida_free+0xac/0x110
[ 179.225599] Code: 24 01 48 8d 7d b0 e8 53 fd 00 00 48 8b 7d b0 4c 89 ee e8 07 0b 02 00 48 83 c4 30 31 c0 5b 41 5c 41 5d 41 5e 5d 89 c6 89 c7 c3 <4c> 0f a3 30 73 39 4c 0f b3 30 48 8d 7d b0 31 f6 e8 6f fa 00 00 be
[ 179.226854] RSP: 0018:ffff9bd304187a90 EFLAGS: 00010046
[ 179.227212] RAX: 0000000000000000 RBX: 0000000000000003 RCX: 0000000000000000
[ 179.227697] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 179.228181] RBP: ffff9bd304187ae0 R08: 0000000000000000 R09: 0000000000000000
@hyuan3
Copy link

hyuan3 commented Mar 4, 2019

It's a native i915 driver issue. The driver is failed to unbind.

@ahkok ahkok added the bug label Mar 4, 2019
@ahkok
Copy link
Contributor

ahkok commented Mar 4, 2019

Is this reported upstream?

@hyuan3
Copy link

hyuan3 commented Mar 5, 2019

No. @TerrenceXu tested upstream kernel and didn't reproduce this issue.

@miguelinux
Copy link
Member

@miguelinux
Copy link
Member

@TerrenceXu
Copy link
Author

@hyuan3
Copy link

hyuan3 commented Mar 5, 2019

@miguelinux No, it's found in GVT-d setup that before pass through GPU device to virtual machine, first unbind i915 driver from the device.

@TerrenceXu
Copy link
Author

Hi All, any progress?

@gvancuts
Copy link

gvancuts commented Mar 7, 2019

FWIW, I can also easily reproduce that with the latest version of Clear Linux (i.e.: 28190), tried both the kernel-native and kernel-lts bundles with the same results. My system is running the latest bios: BNKBL357.86A.0072.2018.1222.1129

@TerrenceXu , did you use the exact same kernel configuration when you tested with the upstream kernel?

@TerrenceXu
Copy link
Author

@gvancuts , I just use the same kernel configuration with build upstream 4.20.13 kernel, this issue was not happening.

@gvancuts
Copy link

gvancuts commented Mar 8, 2019

@gvancuts , I just use the same kernel configuration with build upstream 4.20.13 kernel, this issue was not happening.

Earlier today I built an upstream 4.20.14 (same version of the latest Clear Linux native kernel), using the exact same kernel config than the one we use in Clear Linux. But I am also seeing the same call trace with the upstream kernel. It's getting late over here but I'll see if I can try with the upstream 4.20.13 kernel.

@gvancuts
Copy link

gvancuts commented Mar 8, 2019

@gvancuts , I just use the same kernel configuration with build upstream 4.20.13 kernel, this issue was not happening.

Earlier today I built an upstream 4.20.14 (same version of the latest Clear Linux native kernel), using the exact same kernel config than the one we use in Clear Linux. But I am also seeing the same call trace with the upstream kernel. It's getting late over here but I'll see if I can try with the upstream 4.20.13 kernel.

Same with upstream 4.20.13 using the Clear Linux configuration. Can you double-check you are indeed using the exact same kernel configuration when comparing the upstream and Clear Linux native kernels?

@gvancuts
Copy link

gvancuts commented Mar 8, 2019

My last test of the day was to make the i915 a module (instead of built-in) but I still see the same call trace in dmesg (using the upstream 4.20.13 kernel)

@TerrenceXu
Copy link
Author

@gvancuts, looks this issue is related to Clear Linux OS or Clear Linux OS patches.
I build 4.20.13 or 4.20.14 upstream kernel + clear Linux configuration on Ubuntu, issue not existed.
I build 4.20.13 or 4.20.14 upstream kernel + clear Linux configuration on Clear Linux follow https://clearlinux.org/documentation/clear-linux/guides/maintenance/kernel-development, issue existed.

@gvancuts
Copy link

This points at at least some responsibility in user-space then, I don't really know where to go from here so I'll leave it to the experts to jump in and help you out. I'll just add a couple of notes:

  • I see the same crash on Fedora 29 running on a Skull Canyon (kernel 4.20.13-200-fc29)
  • The intel-gfx team apparently hit this late last year (see this bug report) but it was recently closed because it is no longer happening. There is no workaround or root-cause identified in the bug report.

@bryteise
Copy link
Member

@miguelinux Do you have the hardware available to try and reproduce this issue?

@miguelinux
Copy link
Member

@bryteise I think we dont have KBL-NUC at GDC

@bryteise
Copy link
Member

@nesiusra Do you know who might have a KBL handy to work on this?

@bryteise
Copy link
Member

@TerrenceXu I've been reminding that this could be do to a bug in the i915 driver due to how we load firmware. Could you disable CONFIG_EXTRA_FIRMWARE in the config and retry (that line loads the DMC firmware)?

@gvancuts
Copy link

gvancuts commented Mar 11, 2019 via email

@seanvk
Copy link

seanvk commented Mar 11, 2019

How does Clear load dmc/guc differently?

@seanvk
Copy link

seanvk commented Mar 11, 2019

run modinfo i915 | grep -i guc
modinfo i915 | grep -i dmc

@bryteise
Copy link
Member

@seanvk i915 is built-in on Clear I think for boot speed reasons.

@chivakker
Copy link

@bryteise dmc will load on native only, guc/huc will not be loaded

@bryteise
Copy link
Member

@chivakker Isn't this an issue for native?

@TerrenceXu
Copy link
Author

@bryteise , we still can reproduce this issue after disable CONFIG_EXTRA_FIRMWARE in the kernel config.:(

@bryteise
Copy link
Member

Well that's unpleasantly surprising, hrm. Thoughts @fenrus75 ?

@seanvk
Copy link

seanvk commented Mar 14, 2019

You might need CONFIG_VFIO_IOMMU_TYPE1 and CONFIG_VFIO_PCI_IGD

@ahkok
Copy link
Contributor

ahkok commented Mar 14, 2019

Those 2 are enabled in linux-native already.

@seanvk
Copy link

seanvk commented Mar 14, 2019

Okay. One last thought. Maybe you also need
echo -n auto > /sys/bus/pci/devices/0000:00:02.0/power/control
When unbinding. So a better order could be:

echo -n auto > /sys/bus/pci/devices/0000:00:02.0/power/control
echo -n "0000:00:02.0" > /sys/bus/pci/drivers/i915/unbind

@seanvk
Copy link

seanvk commented Mar 14, 2019

works for me on Intel(R) Core(TM) i7-8650U CPU @ 1.90GHz
Name: Clear Linux OS
Version: 27400
Linux clr-bend-svkelley 4.20.3-685.native #1 SMP Mon Jan 21 17:57:34 UTC 2019 x86_64 GNU/Linux

@seanvk
Copy link

seanvk commented Mar 14, 2019

Now I did it when I had a graphics target up and of course that got messy, but aside from pulling the rug out from under the target mode, it worked fine.

@seanvk
Copy link

seanvk commented Mar 14, 2019

Just to double check for errors. I went back and did the following:

First, reboot with iommu enabled:
reboot with intel_iommu=on

Next I disabled graphical target:
sudo systemctl mask gdm
sudo systemctl isolate multi-user.target

Then I loaded the modules:
sudo modprobe vfio
sudo modprobe vfio_pci

Then I unbound the i915:
sudo su
echo -n auto > /sys/bus/pci/devices/0000:00:02.0/power/control
echo -n "0000:00:02.0" > /sys/bus/pci/drivers/i915/unbind

dmesg was clean and my remote shell to the laptop is functional, no lockups or kernel oops.

[ 236.133164] calling vfio_virqfd_init+0x0/0x1000 [vfio_virqfd] @ 1247
[ 236.133273] initcall vfio_virqfd_init+0x0/0x1000 [vfio_virqfd] returned 0 after 103 usecs
[ 236.135225] calling vfio_pci_init+0x0/0x15b [vfio_pci] @ 1247
[ 236.135250] initcall vfio_pci_init+0x0/0x15b [vfio_pci] returned 0 after 22 usecs
[ 247.322641] Console: switching to colour dummy device 80x25

Sean

@seanvk
Copy link

seanvk commented Mar 14, 2019

No issue seen on current Clear release either:

root@clr-bend-svkelley /home/seanvk # uname -a
Linux clr-bend-svkelley 5.0.2-717.native #1 SMP Wed Mar 13 23:07:41 UTC 2019 x86_64 GNU/Linux
root@clr-bend-svkelley /home/seanvk # distro
Name: Clear Linux OS
Version: 28310
Codename:

This is a KBL-R/Coffeelake based system

@seanvk
Copy link

seanvk commented Mar 15, 2019

@ahkok @iphutch I suggest you close this issue as unable to reproduce unless @TerrenceXu can respond on the steps I've suggested and reproduces the error.

@iphutch iphutch closed this as completed Mar 15, 2019
@gvancuts
Copy link

I am still seeing the same issue after adding echo -n auto > /sys/bus/pci/devices/0000:00:02.0/power/control to the sequence of commands. My system is currently running Clear Linux 28190, I will try to update to the latest and see if it makes any difference. My CPU is a Intel(R) Core(TM) i7-7567U.

@gvancuts
Copy link

Same with the latest. The only thing that makes the kernel crash go away is to not start the desktop environment.

@fenrus75
Copy link
Contributor

fenrus75 commented Mar 16, 2019 via email

@gvancuts
Copy link

welllllll starting the desktop environment and then yanking away the gpu is a bit... harsh innit?

Yes, especially to the user staring at the monitor :-)

@seanvk mentionned in an earlier comment (#502 (comment)) that he did not see the crash even when he had a graphics target up and running.

@TerrenceXu
Copy link
Author

TerrenceXu commented Mar 17, 2019

@seanvk @gvancuts I still can reproduce it with 28320 (5.0.2-717.native).

[   30.054547] calling  crypto_cmac_module_init+0x0/0x1000 [cmac] @ 861
[   30.054550] initcall crypto_cmac_module_init+0x0/0x1000 [cmac] returned 0 after 0 usecs
[  210.024120] calling  vfio_virqfd_init+0x0/0x1000 [vfio_virqfd] @ 962
[  210.024259] initcall vfio_virqfd_init+0x0/0x1000 [vfio_virqfd] returned 0 after 129 usecs
[  210.032162] calling  vfio_pci_init+0x0/0x15b [vfio_pci] @ 962
[  210.032198] initcall vfio_pci_init+0x0/0x15b [vfio_pci] returned 0 after 32 usecs
[  213.067304] Console: switching to colour dummy device 80x25
[  213.067406] snd_hda_intel 0000:00:1f.3: Dropping the link to 0000:00:02.0
[  213.135743] WARNING: CPU: 0 PID: 952 at drivers/gpu/drm/drm_mode_config.c:455 drm_mode_config_cleanup+0x287/0x2e0
[  213.135744] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc iwlmvm snd_soc_sst_ipc snd_soc_sst_dsp mac80211 snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress btusb btrtl snd_pcm_dmaengine iwlwifi btbcm ac97_bus btintel bluetooth snd_hda_codec_hdmi ecdh_generic rtsx_pci_sdmmc snd_hda_codec_realtek rtsx_pci_ms memstick snd_hda_codec_generic intel_wmi_thunderbolt wmi_bmof ledtrig_audio e1000e intel_xhci_usb_role_switch cfg80211 rfkill snd_hda_intel wmi rtsx_pci roles intel_pch_thermal snd_hda_codec snd_hda_core mei_me mei snd_hwdep snd_pcm snd_timer snd pcc_cpufreq i2c_i801 soundcore
[  213.135762] CPU: 0 PID: 952 Comm: bash Not tainted 5.0.2-717.native #1
[  213.135762] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[  213.135764] RIP: 0010:drm_mode_config_cleanup+0x287/0x2e0
[  213.135765] Code: 02 00 00 e8 8b bb 45 00 49 8b 04 24 49 39 c4 75 5f 48 83 c4 38 31 c0 5b 41 5c 41 5d 41 5e 41 5f 5d 89 c2 89 c1 89 c6 89 c7 c3 <0f> 0b 48 8d 75 a8 4c 89 e7 48 c7 c3 2b ed 60 8c e8 84 83 ff ff eb
[  213.135765] RSP: 0018:ffffa9d50527fc58 EFLAGS: 00010297
[  213.135766] RAX: ffff8be45947a818 RBX: ffff8be459188368 RCX: 0000000000000000
[  213.135767] RDX: ffff8be459188330 RSI: 0000000000000000 RDI: 0000000000000000
[  213.135767] RBP: ffffa9d50527fcb8 R08: 0000000000000000 R09: 0000000000000000
[  213.135768] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8be459188000
[  213.135768] R13: ffff8be459188370 R14: ffffffff8c91b290 R15: 0000000000000060
[  213.135769] FS:  00007f4e301e7740(0000) GS:ffff8be45ea00000(0000) knlGS:0000000000000000
[  213.135770] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  213.135771] CR2: 00000000018e05d0 CR3: 0000000878f58004 CR4: 00000000003606f0
[  213.135771] Call Trace:
[  213.135775]  ? mutex_lock+0xd/0x30
[  213.135776]  intel_modeset_cleanup+0xb2/0x120
[  213.135779]  i915_driver_unload+0x9f/0x140
[  213.135780]  i915_pci_remove+0x1e/0x40
[  213.135781]  pci_device_remove+0x39/0xc0
[  213.135783]  device_release_driver_internal+0x189/0x250
[  213.135784]  device_release_driver+0xd/0x20
[  213.135786]  unbind_store+0xb8/0x160
[  213.135787]  drv_attr_store+0x22/0x50
[  213.135789]  sysfs_kf_write+0x36/0x60
[  213.135789]  kernfs_fop_write+0x131/0x1b0
[  213.135791]  __vfs_write+0x26/0x190
[  213.135792]  ? _cond_resched+0x14/0x30
[  213.135793]  vfs_write+0xaa/0x1b0
[  213.135794]  ksys_write+0x41/0xa0
[  213.135795]  __x64_sys_write+0x15/0x20
[  213.135797]  do_syscall_64+0x55/0x1b0
[  213.135798]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  213.135799] RIP: 0033:0x7f4e302fa77b
[  213.135800] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 c5 ea 0d 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[  213.135800] RSP: 002b:00007ffc0e4b7b38 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[  213.135801] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f4e302fa77b
[  213.135802] RDX: 000000000000000d RSI: 000000000116cb90 RDI: 0000000000000001
[  213.135802] RBP: 000000000116cb90 R08: 000000000000000a R09: 00007f4e301e7740
[  213.135803] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f4e303d45c0
[  213.135803] R13: 000000000000000d R14: 00007f4e303d53c0 R15: 00007f4e303d47c0
[  213.135804] ---[ end trace f1f20182276dc090 ]---
[  213.135806] [drm:drm_mode_config_cleanup] *ERROR* connector DP-1 leaked!
[  213.136299] WARNING: CPU: 0 PID: 952 at drivers/gpu/drm/drm_mode_config.c:489 drm_mode_config_cleanup+0x2c6/0x2e0
[  213.136299] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc iwlmvm snd_soc_sst_ipc snd_soc_sst_dsp mac80211 snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress btusb btrtl snd_pcm_dmaengine iwlwifi btbcm ac97_bus btintel bluetooth snd_hda_codec_hdmi ecdh_generic rtsx_pci_sdmmc snd_hda_codec_realtek rtsx_pci_ms memstick snd_hda_codec_generic intel_wmi_thunderbolt wmi_bmof ledtrig_audio e1000e intel_xhci_usb_role_switch cfg80211 rfkill snd_hda_intel wmi rtsx_pci roles intel_pch_thermal snd_hda_codec snd_hda_core mei_me mei snd_hwdep snd_pcm snd_timer snd pcc_cpufreq i2c_i801 soundcore
[  213.136310] CPU: 0 PID: 952 Comm: bash Tainted: G        W         5.0.2-717.native #1
[  213.136311] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[  213.136312] RIP: 0010:drm_mode_config_cleanup+0x2c6/0x2e0
[  213.136313] Code: eb 0c 48 8b 70 48 48 89 df e8 16 f4 ff ff 48 8d 7d a8 e8 ad 89 ff ff 48 85 c0 75 e6 48 8d 7d a8 e8 3f 8a ff ff e9 e1 fd ff ff <0f> 0b e9 e9 fe ff ff 0f 0b eb 9d 66 66 2e 0f 1f 84 00 00 00 00 00
[  213.136313] RSP: 0018:ffffa9d50527fc58 EFLAGS: 00010206
[  213.136314] RAX: ffff8be453d75e08 RBX: ffff8be459188438 RCX: ffff8be459188308
[  213.136315] RDX: ffff8be459188460 RSI: 0000000000000000 RDI: ffff8be459188438
[  213.136315] RBP: ffffa9d50527fcb8 R08: 0000000000000000 R09: 0000000000000000
[  213.136316] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8be459188000
[  213.136316] R13: ffff8be459188460 R14: ffffffff8c91b290 R15: 0000000000000060
[  213.136317] FS:  00007f4e301e7740(0000) GS:ffff8be45ea00000(0000) knlGS:0000000000000000
[  213.136318] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  213.136318] CR2: 00000000018e05d0 CR3: 0000000878f58004 CR4: 00000000003606f0
[  213.136318] Call Trace:
[  213.136320]  ? mutex_lock+0xd/0x30
[  213.136321]  intel_modeset_cleanup+0xb2/0x120
[  213.136322]  i915_driver_unload+0x9f/0x140
[  213.136323]  i915_pci_remove+0x1e/0x40
[  213.136324]  pci_device_remove+0x39/0xc0
[  213.136325]  device_release_driver_internal+0x189/0x250
[  213.136326]  device_release_driver+0xd/0x20
[  213.136328]  unbind_store+0xb8/0x160
[  213.136329]  drv_attr_store+0x22/0x50
[  213.136330]  sysfs_kf_write+0x36/0x60
[  213.136330]  kernfs_fop_write+0x131/0x1b0
[  213.136331]  __vfs_write+0x26/0x190
[  213.136333]  ? _cond_resched+0x14/0x30
[  213.136333]  vfs_write+0xaa/0x1b0
[  213.136334]  ksys_write+0x41/0xa0
[  213.136335]  __x64_sys_write+0x15/0x20
[  213.136336]  do_syscall_64+0x55/0x1b0
[  213.136337]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  213.136338] RIP: 0033:0x7f4e302fa77b
[  213.136338] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 c5 ea 0d 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[  213.136339] RSP: 002b:00007ffc0e4b7b38 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[  213.136340] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f4e302fa77b
[  213.136340] RDX: 000000000000000d RSI: 000000000116cb90 RDI: 0000000000000001
[  213.136341] RBP: 000000000116cb90 R08: 000000000000000a R09: 00007f4e301e7740
[  213.136341] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f4e303d45c0
[  213.136342] R13: 000000000000000d R14: 00007f4e303d53c0 R15: 00007f4e303d47c0
[  213.136342] ---[ end trace f1f20182276dc091 ]---
[  213.136344] [leaked fb] framebuffer[113]:
[  213.136345] [leaked fb]      allocated by = X
[  213.136345] [leaked fb]      refcount=1
[  213.136347] [leaked fb]      format=XR24 little-endian (0x34325258)
[  213.136347] [leaked fb]      modifier=0x100000000000001
[  213.136348] [leaked fb]      size=1360x768
[  213.136348] [leaked fb]      layers:
[  213.136349] [leaked fb]              size[0]=1360x768
[  213.136350] [leaked fb]              pitch[0]=5632
[  213.136350] [leaked fb]              offset[0]=0
[  213.136351] [leaked fb]              obj[0]:
[  213.136352] [leaked fb]                      name=0
[  213.136352] [leaked fb]                      refcount=4
[  213.136353] [leaked fb]                      start=00000000
[  213.136353] [leaked fb]                      size=5242880
[  213.136354] [leaked fb]                      imported=no
[  213.136361] WARNING: CPU: 0 PID: 952 at include/drm/drm_modeset_lock.h:105 drm_mode_config_cleanup+0x2cd/0x2e0
[  213.136361] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc iwlmvm snd_soc_sst_ipc snd_soc_sst_dsp mac80211 snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress btusb btrtl snd_pcm_dmaengine iwlwifi btbcm ac97_bus btintel bluetooth snd_hda_codec_hdmi ecdh_generic rtsx_pci_sdmmc snd_hda_codec_realtek rtsx_pci_ms memstick snd_hda_codec_generic intel_wmi_thunderbolt wmi_bmof ledtrig_audio e1000e intel_xhci_usb_role_switch cfg80211 rfkill snd_hda_intel wmi rtsx_pci roles intel_pch_thermal snd_hda_codec snd_hda_core mei_me mei snd_hwdep snd_pcm snd_timer snd pcc_cpufreq i2c_i801 soundcore
[  213.136371] CPU: 0 PID: 952 Comm: bash Tainted: G        W         5.0.2-717.native #1
[  213.136371] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[  213.136372] RIP: 0010:drm_mode_config_cleanup+0x2cd/0x2e0
[  213.136373] Code: 89 df e8 16 f4 ff ff 48 8d 7d a8 e8 ad 89 ff ff 48 85 c0 75 e6 48 8d 7d a8 e8 3f 8a ff ff e9 e1 fd ff ff 0f 0b e9 e9 fe ff ff <0f> 0b eb 9d 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 55 48 89
[  213.136373] RSP: 0018:ffffa9d50527fc58 EFLAGS: 00010287
[  213.136374] RAX: ffffa9d503f67c18 RBX: ffff8be459188300 RCX: 0000000000000000
[  213.136375] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[  213.136375] RBP: ffffa9d50527fcb8 R08: 0000000000000000 R09: 0000000000000000
[  213.136376] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8be459188278
[  213.136376] R13: ffff8be459188300 R14: ffffffff8c60ed41 R15: ffffffff8b8dfa90
[  213.136377] FS:  00007f4e301e7740(0000) GS:ffff8be45ea00000(0000) knlGS:0000000000000000
[  213.136378] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  213.136378] CR2: 00000000018e05d0 CR3: 0000000878f58004 CR4: 00000000003606f0
[  213.136378] Call Trace:
[  213.136379]  ? __drm_printfn_info+0x20/0x20
[  213.136381]  intel_modeset_cleanup+0xb2/0x120
[  213.136382]  i915_driver_unload+0x9f/0x140
[  213.136383]  i915_pci_remove+0x1e/0x40
[  213.136383]  pci_device_remove+0x39/0xc0
[  213.136384]  device_release_driver_internal+0x189/0x250
[  213.136385]  device_release_driver+0xd/0x20
[  213.136387]  unbind_store+0xb8/0x160
[  213.136388]  drv_attr_store+0x22/0x50
[  213.136389]  sysfs_kf_write+0x36/0x60
[  213.136389]  kernfs_fop_write+0x131/0x1b0
[  213.136390]  __vfs_write+0x26/0x190
[  213.136392]  ? _cond_resched+0x14/0x30
[  213.136392]  vfs_write+0xaa/0x1b0
[  213.136393]  ksys_write+0x41/0xa0
[  213.136394]  __x64_sys_write+0x15/0x20
[  213.136395]  do_syscall_64+0x55/0x1b0
[  213.136396]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  213.136396] RIP: 0033:0x7f4e302fa77b
[  213.136397] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 c5 ea 0d 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[  213.136398] RSP: 002b:00007ffc0e4b7b38 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[  213.136398] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f4e302fa77b
[  213.136399] RDX: 000000000000000d RSI: 000000000116cb90 RDI: 0000000000000001
[  213.136399] RBP: 000000000116cb90 R08: 000000000000000a R09: 00007f4e301e7740
[  213.136400] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f4e303d45c0
[  213.136400] R13: 000000000000000d R14: 00007f4e303d53c0 R15: 00007f4e303d47c0
[  213.136401] ---[ end trace f1f20182276dc092 ]---
[  213.139720] BUG: unable to handle kernel paging request at 0000000000005158
[  213.140221] #PF error: [WRITE]
[  213.140437] PGD 800000088e47d067 P4D 800000088e47d067 PUD 88e47e067 PMD 0
[  213.140908] Oops: 0002 [#1] SMP PTI
[  213.141153] CPU: 1 PID: 494 Comm: X Tainted: G        W         5.0.2-717.native #1
[  213.141676] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[  213.142397] RIP: 0010:mutex_unlock+0x12/0x30
[  213.142694] Code: 70 10 e8 01 47 42 ff eb ab 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 55 31 c9 65 48 8b 14 25 c0 4d 01 00 48 89 e5 48 89 d0 <f0> 48 0f b1 0f 48 39 c2 74 05 e8 df fe ff ff 5d 31 c0 89 c2 89 c1
[  213.143996] RSP: 0018:ffffa9d503f67908 EFLAGS: 00010246
[  213.144355] RAX: ffff8be44e5da040 RBX: ffff8be4596dd108 RCX: 0000000000000000
[  213.144843] RDX: ffff8be44e5da040 RSI: 0000000000000000 RDI: 0000000000005158
[  213.145331] RBP: ffffa9d503f67908 R08: 0000000000000000 R09: 0000000000000000
[  213.145817] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[  213.146304] R13: 0000000000000002 R14: 0000000000000004 R15: ffff8be459188000
[  213.146791] FS:  00007f51501bbdc0(0000) GS:ffff8be45ea80000(0000) knlGS:0000000000000000
[  213.147343] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  213.147757] CR2: 0000000000005158 CR3: 000000088e5de004 CR4: 00000000003606e0
[  213.148245] Call Trace:
[  213.148422]  pps_unlock+0x1e/0x40
[  213.148655]  intel_dp_aux_xfer+0x15a/0x620
[  213.148944]  ? wait_woken+0x80/0x80
[  213.149194]  intel_dp_aux_transfer+0xad/0x250
[  213.149502]  ? __switch_to+0x460/0x4c0
[  213.149767]  drm_dp_dpcd_access+0x62/0xf0
[  213.150048]  drm_dp_dpcd_read+0x2e/0xd0
[  213.150317]  lspcon_wake_native_aux_ch+0x1f/0x80
[  213.150636]  lspcon_resume+0x13/0xf0
[  213.150886]  intel_dp_detect+0x4ad/0x590
[  213.151160]  drm_helper_probe_detect+0x4b/0x90
[  213.151469]  drm_helper_probe_single_connector_modes+0xc6/0x6d0
[  213.151877]  drm_mode_getconnector+0x4bd/0x4e9
[  213.152186]  ? drm_connector_property_set_ioctl+0x30/0x30
[  213.152559]  drm_ioctl_kernel+0x93/0xd0
[  213.152837]  drm_ioctl+0x391/0x470
[  213.153079]  ? drm_connector_property_set_ioctl+0x30/0x30
[  213.153459]  do_vfs_ioctl+0x94/0x5e0
[  213.153716]  ? handle_mm_fault+0xea/0x1c0
[  213.153996]  ksys_ioctl+0x6a/0x90
[  213.154230]  __x64_sys_ioctl+0x15/0x20
[  213.154494]  do_syscall_64+0x55/0x1b0
[  213.154751]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  213.155100] RIP: 0033:0x7f5150850c2b
[  213.155351] Code: 0f 1e fa 48 8b 05 65 22 0d 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 35 22 0d 00 f7 d8 64 89 01 48
[  213.156616] RSP: 002b:00007ffdb7be4a08 EFLAGS: 00003246 ORIG_RAX: 0000000000000010
[  213.157133] RAX: ffffffffffffffda RBX: 0000000001343280 RCX: 00007f5150850c2b
[  213.157622] RDX: 00007ffdb7be4aa0 RSI: 00000000c05064a7 RDI: 000000000000000d
[  213.158109] RBP: 00007ffdb7be4a30 R08: 00000000011d2430 R09: 00000000011d23e0
[  213.158597] R10: 0000000000000001 R11: 0000000000003246 R12: 00007ffdb7be4aa0
[  213.159084] R13: 00000000c05064a7 R14: 000000000000000d R15: 00007ffdb7be4aa0
[  213.159571] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc iwlmvm snd_soc_sst_ipc snd_soc_sst_dsp mac80211 snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress btusb btrtl snd_pcm_dmaengine iwlwifi btbcm ac97_bus btintel bluetooth snd_hda_codec_hdmi ecdh_generic rtsx_pci_sdmmc snd_hda_codec_realtek rtsx_pci_ms memstick snd_hda_codec_generic intel_wmi_thunderbolt wmi_bmof ledtrig_audio e1000e intel_xhci_usb_role_switch cfg80211 rfkill snd_hda_intel wmi rtsx_pci roles intel_pch_thermal snd_hda_codec snd_hda_core mei_me mei snd_hwdep snd_pcm snd_timer snd pcc_cpufreq i2c_i801 soundcore
[  213.163553] CR2: 0000000000005158
[  213.163786] ---[ end trace f1f20182276dc093 ]---
[  213.263907] RIP: 0010:mutex_unlock+0x12/0x30
[  213.264205] Code: 70 10 e8 01 47 42 ff eb ab 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 55 31 c9 65 48 8b 14 25 c0 4d 01 00 48 89 e5 48 89 d0 <f0> 48 0f b1 0f 48 39 c2 74 05 e8 df fe ff ff 5d 31 c0 89 c2 89 c1
[  213.265472] RSP: 0018:ffffa9d503f67908 EFLAGS: 00010246
[  213.265833] RAX: ffff8be44e5da040 RBX: ffff8be4596dd108 RCX: 0000000000000000
[  213.266321] RDX: ffff8be44e5da040 RSI: 0000000000000000 RDI: 0000000000005158
[  213.266809] RBP: ffffa9d503f67908 R08: 0000000000000000 R09: 0000000000000000
[  213.267297] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[  213.267784] R13: 0000000000000002 R14: 0000000000000004 R15: ffff8be459188000
[  213.268271] FS:  00007f51501bbdc0(0000) GS:ffff8be45ea80000(0000) knlGS:0000000000000000
[  213.268822] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  213.269216] CR2: 0000000000005158 CR3: 000000088e5de004 CR4: 00000000003606e0
[  213.316874] ------------[ cut here ]------------
[  213.318002] WARN_ON(!list_empty(&dev_priv->contexts.list))
[  213.318171] WARNING: CPU: 3 PID: 952 at drivers/gpu/drm/i915/i915_gem.c:5742 i915_gem_fini+0x14d/0x160
[  213.322000] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc iwlmvm snd_soc_sst_ipc snd_soc_sst_dsp mac80211 snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress btusb btrtl snd_pcm_dmaengine iwlwifi btbcm ac97_bus btintel bluetooth snd_hda_codec_hdmi ecdh_generic rtsx_pci_sdmmc snd_hda_codec_realtek rtsx_pci_ms memstick snd_hda_codec_generic intel_wmi_thunderbolt wmi_bmof ledtrig_audio e1000e intel_xhci_usb_role_switch cfg80211 rfkill snd_hda_intel wmi rtsx_pci roles intel_pch_thermal snd_hda_codec snd_hda_core mei_me mei snd_hwdep snd_pcm snd_timer snd pcc_cpufreq i2c_i801 soundcore
[  213.336637] CPU: 3 PID: 952 Comm: bash Tainted: G      D W         5.0.2-717.native #1
[  213.338842] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[  213.341608] RIP: 0010:i915_gem_fini+0x14d/0x160
[  213.342837] Code: 6d 00 00 49 39 c4 75 0f 5b 31 c0 41 5c 41 5d 41 5e 5d 89 c6 89 c7 c3 48 c7 c6 b0 c4 61 8c 48 c7 c7 29 e2 5e 8c e8 53 71 80 ff <0f> 0b eb da 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 55 48 89
[  213.346956] RSP: 0018:ffffa9d50527fcc8 EFLAGS: 00010292
[  213.348419] RAX: 0000000000000000 RBX: ffff8be45918ddf0 RCX: 0000000000000000
[  213.350237] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[  213.352215] RBP: ffffa9d50527fce8 R08: 0000000000000000 R09: 0000000000000000
[  213.354190] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8be45918edc0
[  213.356158] R13: ffff8be45918c9a0 R14: ffff8be459188068 R15: 0000000000000060
[  213.358131] FS:  00007f4e301e7740(0000) GS:ffff8be45eb80000(0000) knlGS:0000000000000000
[  213.360199] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  213.361880] CR2: 00007f8262aed908 CR3: 0000000878f58004 CR4: 00000000003606e0
[  213.363532] Call Trace:
[  213.363714]  i915_driver_unload+0xd9/0x140
[  213.364741]  i915_pci_remove+0x1e/0x40
[  213.365751]  pci_device_remove+0x39/0xc0
[  213.366777]  device_release_driver_internal+0x189/0x250
[  213.368086]  device_release_driver+0xd/0x20
[  213.369298]  unbind_store+0xb8/0x160
[  213.370287]  drv_attr_store+0x22/0x50
[  213.371290]  sysfs_kf_write+0x36/0x60
[  213.372291]  kernfs_fop_write+0x131/0x1b0
[  213.373331]  __vfs_write+0x26/0x190
[  213.374319]  ? _cond_resched+0x14/0x30
[  213.375329]  vfs_write+0xaa/0x1b0
[  213.376299]  ksys_write+0x41/0xa0
[  213.377266]  __x64_sys_write+0x15/0x20
[  213.378279]  do_syscall_64+0x55/0x1b0
[  213.379284]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  213.380579] RIP: 0033:0x7f4e302fa77b
[  213.382825] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 c5 ea 0d 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[  213.387088] RSP: 002b:00007ffc0e4b7b38 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[  213.389004] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f4e302fa77b
[  213.390967] RDX: 000000000000000d RSI: 000000000116cb90 RDI: 0000000000000001
[  213.392778] RBP: 000000000116cb90 R08: 000000000000000a R09: 00007f4e301e7740
[  213.394749] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f4e303d45c0
[  213.396726] R13: 000000000000000d R14: 00007f4e303d53c0 R15: 00007f4e303d47c0
[  213.398705] ---[ end trace f1f20182276dc094 ]---
[  213.400200] ------------[ cut here ]------------
[  213.400528] Memory manager not clean during takedown.
[  213.400903] WARNING: CPU: 3 PID: 952 at drivers/gpu/drm/drm_mm.c:950 drm_mm_takedown+0x22/0x30
[  213.402244] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc iwlmvm snd_soc_sst_ipc snd_soc_sst_dsp mac80211 snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress btusb btrtl snd_pcm_dmaengine iwlwifi btbcm ac97_bus btintel bluetooth snd_hda_codec_hdmi ecdh_generic rtsx_pci_sdmmc snd_hda_codec_realtek rtsx_pci_ms memstick snd_hda_codec_generic intel_wmi_thunderbolt wmi_bmof ledtrig_audio e1000e intel_xhci_usb_role_switch cfg80211 rfkill snd_hda_intel wmi rtsx_pci roles intel_pch_thermal snd_hda_codec snd_hda_core mei_me mei snd_hwdep snd_pcm snd_timer snd pcc_cpufreq i2c_i801 soundcore
[  213.415398] CPU: 3 PID: 952 Comm: bash Tainted: G      D W         5.0.2-717.native #1
[  213.417448] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[  213.420209] RIP: 0010:drm_mm_takedown+0x22/0x30
[  213.421457] Code: c0 e9 32 fd ff ff 66 90 48 8b 47 38 48 83 c7 38 48 39 c7 75 05 31 c0 89 c7 c3 55 48 c7 c7 28 cb 60 8c 48 89 e5 e8 3e 41 88 ff <0f> 0b 5d 31 c0 89 c7 c3 66 0f 1f 44 00 00 55 48 89 e5 41 57 41 56
[  213.426613] RSP: 0018:ffffa9d50527fc80 EFLAGS: 00010282
[  213.427930] RAX: 0000000000000000 RBX: ffff8be45918d900 RCX: 0000000000000000
[  213.429762] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[  213.430980] RBP: ffffa9d50527fc80 R08: 0000000000000000 R09: 0000000000000000
[  213.432788] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8be459188000
[  213.434752] R13: ffff8be45918da80 R14: ffff8be459188068 R15: ffffffff8c61beb7
[  213.436564] FS:  00007f4e301e7740(0000) GS:ffff8be45eb80000(0000) knlGS:0000000000000000
[  213.438779] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  213.440300] CR2: 00007f8262aed908 CR3: 0000000878f58004 CR4: 00000000003606e0
[  213.442274] Call Trace:
[  213.442845]  i915_gem_cleanup_stolen+0x1a/0x30
[  213.444086]  i915_ggtt_cleanup_hw+0xfc/0x160
[  213.445311]  i915_driver_cleanup_hw+0x35/0x50
[  213.446541]  i915_driver_unload+0xe9/0x140
[  213.447742]  i915_pci_remove+0x1e/0x40
[  213.448753]  pci_device_remove+0x39/0xc0
[  213.449790]  device_release_driver_internal+0x189/0x250
[  213.451122]  device_release_driver+0xd/0x20
[  213.451781]  unbind_store+0xb8/0x160
[  213.452425]  drv_attr_store+0x22/0x50
[  213.453426]  sysfs_kf_write+0x36/0x60
[  213.454428]  kernfs_fop_write+0x131/0x1b0
[  213.455623]  __vfs_write+0x26/0x190
[  213.456449]  ? _cond_resched+0x14/0x30
[  213.457615]  vfs_write+0xaa/0x1b0
[  213.458430]  ksys_write+0x41/0xa0
[  213.459394]  __x64_sys_write+0x15/0x20
[  213.460409]  do_syscall_64+0x55/0x1b0
[  213.461412]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  213.462703] RIP: 0033:0x7f4e302fa77b
[  213.463700] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 c5 ea 0d 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[  213.468694] RSP: 002b:00007ffc0e4b7b38 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[  213.470865] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f4e302fa77b
[  213.472683] RDX: 000000000000000d RSI: 000000000116cb90 RDI: 0000000000000001
[  213.473765] RBP: 000000000116cb90 R08: 000000000000000a R09: 00007f4e301e7740
[  213.475370] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f4e303d45c0
[  213.475865] R13: 000000000000000d R14: 00007f4e303d53c0 R15: 00007f4e303d47c0
[  213.476972] ---[ end trace f1f20182276dc095 ]---
[  213.478420] ------------[ cut here ]------------
[  213.478753] WARN_ON(atomic_read(&dev_priv->runtime_pm.wakeref_count))
[  213.478763] WARNING: CPU: 3 PID: 952 at drivers/gpu/drm/i915/i915_drv.c:1806 i915_driver_unload+0x135/0x140
[  213.481187] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc iwlmvm snd_soc_sst_ipc snd_soc_sst_dsp mac80211 snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress btusb btrtl snd_pcm_dmaengine iwlwifi btbcm ac97_bus btintel bluetooth snd_hda_codec_hdmi ecdh_generic rtsx_pci_sdmmc snd_hda_codec_realtek rtsx_pci_ms memstick snd_hda_codec_generic intel_wmi_thunderbolt wmi_bmof ledtrig_audio e1000e intel_xhci_usb_role_switch cfg80211 rfkill snd_hda_intel wmi rtsx_pci roles intel_pch_thermal snd_hda_codec snd_hda_core mei_me mei snd_hwdep snd_pcm snd_timer snd pcc_cpufreq i2c_i801 soundcore
[  213.495443] CPU: 3 PID: 952 Comm: bash Tainted: G      D W         5.0.2-717.native #1
[  213.497512] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[  213.499331] RIP: 0010:i915_driver_unload+0x135/0x140
[  213.500611] Code: c1 89 c6 89 c7 c3 48 c7 c7 c8 2a 61 8c e8 a3 2c fe ff e9 65 ff ff ff 48 c7 c6 b0 2b 61 8c 48 c7 c7 29 e2 5e 8c e8 8b 0f 85 ff <0f> 0b eb c9 0f 1f 80 00 00 00 00 55 48 89 e5 41 56 41 55 41 89 f5
[  213.505258] RSP: 0018:ffffa9d50527fcf8 EFLAGS: 00010292
[  213.506564] RAX: 0000000000000000 RBX: ffff8be459188000 RCX: 0000000000000000
[  213.508363] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[  213.510164] RBP: ffffa9d50527fd08 R08: 0000000000000000 R09: 0000000000000000
[  213.512126] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8be45abf7000
[  213.513932] R13: ffffffff8c91b220 R14: ffffffff8c91b290 R15: 0000000000000060
[  213.515742] FS:  00007f4e301e7740(0000) GS:ffff8be45eb80000(0000) knlGS:0000000000000000
[  213.517803] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  213.519322] CR2: 00007f8262aed908 CR3: 0000000878f58004 CR4: 00000000003606e0
[  213.520924] Call Trace:
[  213.521114]  i915_pci_remove+0x1e/0x40
[  213.522122]  pci_device_remove+0x39/0xc0
[  213.523150]  device_release_driver_internal+0x189/0x250
[  213.524614]  device_release_driver+0xd/0x20
[  213.525666]  unbind_store+0xb8/0x160
[  213.526661]  drv_attr_store+0x22/0x50
[  213.527661]  sysfs_kf_write+0x36/0x60
[  213.528663]  kernfs_fop_write+0x131/0x1b0
[  213.529692]  __vfs_write+0x26/0x190
[  213.530680]  ? _cond_resched+0x14/0x30
[  213.531688]  vfs_write+0xaa/0x1b0
[  213.532659]  ksys_write+0x41/0xa0
[  213.533622]  __x64_sys_write+0x15/0x20
[  213.534637]  do_syscall_64+0x55/0x1b0
[  213.535640]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  213.536938] RIP: 0033:0x7f4e302fa77b
[  213.537934] Code: 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 f3 0f 1e fa 48 8d 05 c5 ea 0d 00 8b 00 85 c0 75 17 b8 01 00 00 00 c5 fc 77 0f 05 <48> 3d 00 f0 ff ff 77 55 c3 0f 1f 40 00 41 54 49 89 d4 55 48 89 f5
[  213.542922] RSP: 002b:00007ffc0e4b7b38 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[  213.544200] RAX: ffffffffffffffda RBX: 000000000000000d RCX: 00007f4e302fa77b
[  213.546163] RDX: 000000000000000d RSI: 000000000116cb90 RDI: 0000000000000001
[  213.547975] RBP: 000000000116cb90 R08: 000000000000000a R09: 00007f4e301e7740
[  213.549944] R10: 00000000004c5517 R11: 0000000000000246 R12: 00007f4e303d45c0
[  213.551916] R13: 000000000000000d R14: 00007f4e303d53c0 R15: 00007f4e303d47c0
[  213.553727] ---[ end trace f1f20182276dc096 ]---
[  271.218871] BUG: unable to handle kernel NULL pointer dereference at 0000000000000000
[  271.219408] #PF error: [normal kernel read fault]
[  271.219732] PGD 0 P4D 0
[  271.219913] Oops: 0000 [#2] SMP PTI
[  271.220156] CPU: 2 PID: 568 Comm: dconf worker Tainted: G      D W         5.0.2-717.native #1
[  271.220739] Hardware name: Intel Corporation NUC7i7BNH/NUC7i7BNB, BIOS BNKBL357.86A.0050.2017.0816.2002 08/16/2017
[  271.221438] RIP: 0010:ida_free+0xac/0x110
[  271.221715] Code: 24 01 48 8d 7d b0 e8 03 03 01 00 48 8b 7d b0 4c 89 ee e8 c7 12 02 00 48 83 c4 30 31 c0 5b 41 5c 41 5d 41 5e 5d 89 c6 89 c7 c3 <4c> 0f a3 30 73 39 4c 0f b3 30 48 8d 7d b0 31 f6 e8 1f 00 01 00 be
[  271.222971] RSP: 0018:ffffa9d50417fa90 EFLAGS: 00010046
[  271.223329] RAX: 0000000000000000 RBX: 0000000000000003 RCX: 0000000000000000
[  271.223815] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[  271.224300] RBP: ffffa9d50417fae0 R08: 0000000000000000 R09: 0000000000000000
[  271.224784] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[  271.225269] R13: 0000000000000206 R14: 0000000000000003 R15: ffff8be44a457930
[  271.225754] FS:  00007fa0355d1700(0000) GS:ffff8be45eb00000(0000) knlGS:0000000000000000
[  271.226302] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  271.226695] CR2: 0000000000000000 CR3: 000000082f80e001 CR4: 00000000003606e0
[  271.227179] Call Trace:
[  271.227355]  release_hw_id+0x45/0x90
[  271.227603]  ? destroy_kernel_context+0x30/0x30
[  271.227916]  context_close+0x1e/0x160
[  271.228171]  ? call_rcu+0x10/0x20
[  271.228436]  ? destroy_kernel_context+0x30/0x30
[  271.228750]  context_idr_cleanup+0xc/0x20
[  271.229029]  idr_for_each+0x4d/0xb0
[  271.229274]  i915_gem_context_close+0x21/0x40
[  271.229577]  i915_driver_postclose+0x2c/0x60
[  271.229875]  drm_file_free.part.0+0x22d/0x320
[  271.230177]  drm_release+0x9c/0x100
[  271.230422]  __fput+0xaf/0x210
[  271.230637]  ____fput+0x9/0x10
[  271.230854]  task_work_run+0x87/0xb0
[  271.231104]  do_exit+0x304/0xba0
[  271.231331]  do_group_exit+0x36/0xa0
[  271.231581]  get_signal+0x12f/0x6c0
[  271.231827]  do_signal+0x32/0x630
[  271.232060]  ? security_file_permission+0x9c/0x100
[  271.232392]  ? vfs_read+0x136/0x160
[  271.232636]  exit_to_usermode_loop+0x9b/0xc0
[  271.232933]  do_syscall_64+0x180/0x1b0
[  271.233196]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
[  271.233545] RIP: 0033:0x7fa03bb9cf44
[  271.233798] Code: Bad RIP value.
[  271.234025] RSP: 002b:00007fa0355d0920 EFLAGS: 00000293 ORIG_RAX: 0000000000000007
[  271.234541] RAX: fffffffffffffdfc RBX: 00000000007f7a30 RCX: 00007fa03bb9cf44
[  271.235028] RDX: 00000000ffffffff RSI: 0000000000000001 RDI: 00000000007f7a30
[  271.235515] RBP: 0000000000000001 R08: 0000000000000000 R09: 0000000000000001
[  271.236003] R10: 00000000008092c0 R11: 0000000000000293 R12: 00000000ffffffff
[  271.236491] R13: 00000000007f7a30 R14: 00007fa03b510f40 R15: 0000000000000001
[  271.236979] Modules linked in: vfio_pci vfio_virqfd cmac rfcomm bnep fuse snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_skl_ipc iwlmvm snd_soc_sst_ipc snd_soc_sst_dsp mac80211 snd_soc_acpi_intel_match snd_soc_acpi snd_soc_core snd_compress btusb btrtl snd_pcm_dmaengine iwlwifi btbcm ac97_bus btintel bluetooth snd_hda_codec_hdmi ecdh_generic rtsx_pci_sdmmc snd_hda_codec_realtek rtsx_pci_ms memstick snd_hda_codec_generic intel_wmi_thunderbolt wmi_bmof ledtrig_audio e1000e intel_xhci_usb_role_switch cfg80211 rfkill snd_hda_intel wmi rtsx_pci roles intel_pch_thermal snd_hda_codec snd_hda_core mei_me mei snd_hwdep snd_pcm snd_timer snd pcc_cpufreq i2c_i801 soundcore
[  271.240962] CR2: 0000000000000000
[  271.241195] ---[ end trace f1f20182276dc097 ]---
[  271.332918] RIP: 0010:mutex_unlock+0x12/0x30
[  271.333273] Code: 70 10 e8 01 47 42 ff eb ab 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 55 31 c9 65 48 8b 14 25 c0 4d 01 00 48 89 e5 48 89 d0 <f0> 48 0f b1 0f 48 39 c2 74 05 e8 df fe ff ff 5d 31 c0 89 c2 89 c1
[  271.334617] RSP: 0018:ffffa9d503f67908 EFLAGS: 00010246
[  271.334977] RAX: ffff8be44e5da040 RBX: ffff8be4596dd108 RCX: 0000000000000000
[  271.335463] RDX: ffff8be44e5da040 RSI: 0000000000000000 RDI: 0000000000005158
[  271.335949] RBP: ffffa9d503f67908 R08: 0000000000000000 R09: 0000000000000000
[  271.336437] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
[  271.336962] R13: 0000000000000002 R14: 0000000000000004 R15: ffff8be459188000
[  271.337450] FS:  00007fa0355d1700(0000) GS:ffff8be45eb00000(0000) knlGS:0000000000000000
[  271.338038] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  271.338433] CR2: 00007fa03bb9cf1a CR3: 0000000878cf0006 CR4: 00000000003606e0
[  271.338958] Fixing recursive fault but reboot is needed!

@TerrenceXu
Copy link
Author

Just to double check for errors. I went back and did the following:

First, reboot with iommu enabled:
reboot with intel_iommu=on

Next I disabled graphical target:
sudo systemctl mask gdm
sudo systemctl isolate multi-user.target

Then I loaded the modules:
sudo modprobe vfio
sudo modprobe vfio_pci

Then I unbound the i915:
sudo su
echo -n auto > /sys/bus/pci/devices/0000:00:02.0/power/control
echo -n "0000:00:02.0" > /sys/bus/pci/drivers/i915/unbind

dmesg was clean and my remote shell to the laptop is functional, no lockups or kernel oops.

[ 236.133164] calling vfio_virqfd_init+0x0/0x1000 [vfio_virqfd] @ 1247
[ 236.133273] initcall vfio_virqfd_init+0x0/0x1000 [vfio_virqfd] returned 0 after 103 usecs
[ 236.135225] calling vfio_pci_init+0x0/0x15b [vfio_pci] @ 1247
[ 236.135250] initcall vfio_pci_init+0x0/0x15b [vfio_pci] returned 0 after 22 usecs
[ 247.322641] Console: switching to colour dummy device 80x25

Sean
I also can reproduce this issue follow your steps. :(

@TerrenceXu
Copy link
Author

@seanvk @gvancuts
After we upgraded the ClearLinux version from 28320 to 28360, we cannot reproduce this issue when host is in text mode, now we only can reproduce it when host GDM is running.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

11 participants