Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DRM Page Fault #96

Closed
dwrobison opened this issue Oct 4, 2016 · 13 comments
Closed

DRM Page Fault #96

dwrobison opened this issue Oct 4, 2016 · 13 comments

Comments

@dwrobison
Copy link

Running TrueOS 12.x:

FreeBSD lefty 12.0-CURRENT FreeBSD 12.0-CURRENT #9 1b52c82(drm-next-4.7): Mon Sep 12 21:06:48 UTC 2016 root@gauntlet:/usr/obj/usr/src/sys/GENERIC amd64

Running VirtualBox and Kodi, got this page fault.

Can't upload my poor quality images. Here's two links:

http://flag.blackened.net/daver/freebsd/dump_1.JPG
http://flag.blackened.net/daver/freebsd/dump_2.JPG

@mattmacy
Copy link
Member

mattmacy commented Oct 4, 2016

The printfs are warnings. The crash is in a callout. Without a core or at least a backtrace in ddb there's no reason to attribute the panic to drm.

@dwrobison
Copy link
Author

Didn't get a core as it froze the system completely. Happened twice but the second time I didn't snap photos. I'll try again next time it happens to see if I can get a core dump.

@mattmacy
Copy link
Member

mattmacy commented Oct 4, 2016

Read the wiki for how to improve your odds of getting a core. Trueos does a number of things that reduce debuggability.

@kmoore134
Copy link

matt, what options would help in TrueOS to make debugging easier? Besides enabling coredumps? ;)

@nomadlogic
Copy link

not to speak for matt here since he's def the domain expert but we've
found that upgrading to dwarf4 is helpful:

https://github.com/FreeBSDDesktop/freebsd-base-graphics/wiki/Testing-And-Debugging-Tips#triaging-panics

also these sysctl knobs have def helped me:

debug.debugger_on_panic="0"
dev.drm.skip_ddb="1"

there may be other knobs we can tune now but this did help me squash
some skylake issues a while back.

cheers,
-pete

On Tue, Oct 4, 2016 at 1:19 PM, Kris Moore notifications@github.com wrote:

matt, what options would help in TrueOS to make debugging easier? Besides
enabling coredumps? ;)


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.

pete wright
www.nycbug.org
@nomadlogicLA

@mattmacy
Copy link
Member

mattmacy commented Oct 4, 2016

I actually haven't found DWARF4 to be that useful for the kernel :(. I'm actually thinking of KDB/DDB as features that would be useful to have in 12.

@nomadlogic
Copy link

Ah! I'll pull that misinfo from the wiki then :)

-p

On Tue, Oct 4, 2016 at 2:08 PM, Matthew Macy notifications@github.com
wrote:

I actually haven't found DWARF4 to be that useful for the kernel :(. I'm
actually thinking of KDB/DDB as features that would be useful to have in 12.


You are receiving this because you commented.
Reply to this email directly, view it on GitHub
#96 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AH-D06gy8k_HkuYHM6xZ7Yev85Ir81Qaks5qwsBbgaJpZM4KOE1G
.

pete wright
www.nycbug.org
@nomadlogicLA

@dwrobison
Copy link
Author

Well folks, at any rate here is my dmesg as it may help posterity at some point.

Copyright (c) 1992-2016 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 12.0-CURRENT #9 1b52c82(drm-next-4.7): Mon Sep 12 21:06:48 UTC 2016
root@gauntlet:/usr/obj/usr/src/sys/GENERIC amd64
clang version 3.8.1 (tags/RELEASE_381/final)
VT(vga): resolution 640x480
CPU: Intel(R) Core(TM) i5 CPU M 520 @ 2.40GHz (2394.06-MHz K8-class CPU)
Origin="GenuineIntel" Id=0x20652 Family=0x6 Model=0x25 Stepping=2
Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
Features2=0x298e3ff<SSE3,PCLMULQDQ,DTES64,MON,DS_CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1,SSE4.2,POPCNT,AESNI>
AMD Features=0x28100800<SYSCALL,NX,RDTSCP,LM>
AMD Features2=0x1
VT-x: PAT,HLT,MTF,PAUSE,EPT,UG,VPID
TSC: P-state invariant, performance statistics
real memory = 8589934592 (8192 MB)
avail memory = 8187342848 (7808 MB)
Event timer "LAPIC" quality 600
ACPI APIC Table:
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
FreeBSD/SMP: 1 package(s) x 2 core(s) x 2 hardware threads
random: unblocking device.
ioapic0 <Version 2.0> irqs 0-23 on motherboard
random: entropy device external interface
unknown: registered evdev provider: System keyboard multiplexer <0>
kbd1 at kbdmux0
netmap: loaded module
module_register_init: MOD_LOAD (vesa, 0xffffffff80fdcb50, 0) error 19
vtvga0: on motherboard
cryptosoft0: on motherboard
aesni0: <AES-CBC,AES-XTS,AES-GCM,AES-ICM> on motherboard
acpi0: on motherboard
acpi0: Power Button (fixed)
cpu0: on acpi0
cpu1: on acpi0
cpu2: on acpi0
cpu3: on acpi0
hpet0: iomem 0xfed00000-0xfed003ff on acpi0
Timecounter "HPET" frequency 14318180 Hz quality 950
Event timer "HPET" frequency 14318180 Hz quality 550
Event timer "HPET1" frequency 14318180 Hz quality 440
Event timer "HPET2" frequency 14318180 Hz quality 440
Event timer "HPET3" frequency 14318180 Hz quality 440
Event timer "HPET4" frequency 14318180 Hz quality 440
atrtc0: port 0x70-0x77 irq 8 on acpi0
atrtc0: Warning: Couldn't map I/O.
Event timer "RTC" frequency 32768 Hz quality 0
attimer0: port 0x40-0x43,0x50-0x53 irq 0 on acpi0
Timecounter "i8254" frequency 1193182 Hz quality 0
Event timer "i8254" frequency 1193182 Hz quality 100
Timecounter "ACPI-fast" frequency 3579545 Hz quality 900
acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0
acpi_ec0: <Embedded Controller: GPE 0x10> port 0x930,0x934 on acpi0
pcib0: port 0xcf8-0xcff on acpi0
pcib0: _OSC returned error 0x4
pci0: on pcib0
vgapci0: port 0x7110-0x7117 mem 0xf0000000-0xf03fffff,0xe0000000-0xefffffff irq 16 at device 2.0 on pci0
agp0: <Intel Ironlake (M) SVGA controller> on vgapci0
agp0: aperture size is 256M, detected 32764k stolen memory
vgapci0: Boot video device
em0: <Intel(R) PRO/1000 Network Connection 7.6.1-k> port 0x7020-0x703f mem 0xf6900000-0xf691ffff,0xf6970000-0xf6970fff irq 20 at device 25.0 on pci0
em0: Using an MSI interrupt
em0: Ethernet address: 00:26:b9:d7:c9:04
em0: netmap queues/slots: TX 1/1024, RX 1/1024
ehci0: <Intel PCH USB 2.0 controller USB-B> mem 0xf6960000-0xf69603ff irq 16 at device 26.0 on pci0
usbus0: EHCI version 1.0
usbus0 on ehci0
hdac0: <Intel 5 Series/3400 Series HDA Controller> mem 0xf6950000-0xf6953fff irq 22 at device 27.0 on pci0
pcib1: irq 16 at device 28.0 on pci0
pcib1: [GIANT-LOCKED]
pcib2: irq 17 at device 28.1 on pci0
pcib2: [GIANT-LOCKED]
pci1: on pcib2
iwn0: <Intel Centrino Advanced-N 6200> mem 0xf4100000-0xf4101fff irq 17 at device 0.0 on pci1
pcib3: irq 18 at device 28.2 on pci0
pcib3: failed to allocate initial memory window: 0xf0400000-0xf2cfffff
pcib3: [GIANT-LOCKED]
pci2: on pcib3
cbb0: mem 0xf2c40000-0xf2c40fff irq 18 at device 0.0 on pci2
cardbus0: on cbb0
pccard0: <16-bit PCCard bus> on cbb0
sdhci_pci0: mem 0xf2c30000-0xf2c300ff irq 19 at device 0.1 on pci2
sdhci_pci0: 1 slot(s) allocated
pci2: <serial bus, FireWire> at device 0.4 (no driver attached)
pcib4: irq 19 at device 28.3 on pci0
pcib4: [GIANT-LOCKED]
ehci1: <Intel PCH USB 2.0 controller USB-A> mem 0xf6940000-0xf69403ff irq 17 at device 29.0 on pci0
usbus1: EHCI version 1.0
usbus1 on ehci1
pcib5: at device 30.0 on pci0
pci3: on pcib5
isab0: at device 31.0 on pci0
isa0: on isab0
atapci0: <Intel 5 Series/3400 Series PCH SATA300 controller> port 0x70f0-0x70f7,0x70e0-0x70e3,0x70d0-0x70d7,0x70c0-0x70c3,0x70b0-0x70bf,0x70a0-0x70af irq 18 at device 31.2 on pci0
ata2: at channel 0 on atapci0
ata3: at channel 1 on atapci0
atapci1: <Intel 5 Series/3400 Series PCH SATA300 controller> port 0x7090-0x7097,0x7080-0x7083,0x7070-0x7077,0x7060-0x7063,0x7050-0x705f,0x7040-0x704f irq 18 at device 31.5 on pci0
ata4: at channel 0 on atapci1
ata5: at channel 1 on atapci1
pcib6: on acpi0
pci4: on pcib6
acpi_lid0: on acpi0
acpi_button0: on acpi0
acpi_button1: on acpi0
acpi_acad0: on acpi0
battery0: on acpi0
battery1: on acpi0
atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0
atkbd0: irq 1 on atkbdc0
atkbd0: registered evdev provider: AT Keyboard <0>
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
ppc1: cannot reserve I/O port range
psm0: <PS/2 Mouse> irq 12 on atkbdc0
psm0: [GIANT-LOCKED]
psm0: registered evdev provider: Generic PS/2 mouse <0>
psm0: model Generic PS/2 mouse, device ID 0
orm0: at iomem 0xcf000-0xcffff on isa0
ppc0: cannot reserve I/O port range
est0: on cpu0
est1: on cpu1
est2: on cpu2
est3: on cpu3
usbus0: 480Mbps High Speed USB v2.0
usbus1: 480Mbps High Speed USB v2.0
ZFS filesystem version: 5
ZFS storage pool version: features support (5000)
Timecounters tick every 1.000 msec
hdacc0: <IDT 92HD81B1C HDA CODEC> at cad 0 on hdac0
hdaa0: <IDT 92HD81B1C Audio Function Group> at nid 1 on hdacc0
pcm0: <IDT 92HD81B1C (Analog 2.0+HP/2.0)> at nid 13,11 and 10,15 on hdaa0
pcm1: <IDT 92HD81B1C (Analog)> at nid 14 and 17 on hdaa0
hdacc1: at cad 3 on hdac0
hdaa1: at nid 1 on hdacc1
pcm2: <Intel Ibex Peak (HDMI/DP 8ch)> at nid 4 on hdaa1
pcm3: <Intel Ibex Peak (HDMI/DP 8ch)> at nid 5 on hdaa1
ugen1.1: at usbus1
uhub0: <Intel EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1> on usbus1
ugen0.1: at usbus0
uhub1: <Intel EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1> on usbus0
uhub0: 3 ports with 3 removable, self powered
uhub1: 3 ports with 3 removable, self powered
ada0 at ata2 bus 0 scbus0 target 0 lun 0
ada0: ACS-2 ATA SATA 3.x device
ada0: Serial Number 152724400380
ada0: 300.000MB/s transfers (SATA 2.x, UDMA5, PIO 512bytes)
ada0: 457862MB (937703088 512 byte sectors)
SMP: AP CPU #1 Launched!
SMP: AP CPU #3 Launched!
SMP: AP CPU #2 Launched!
cd0 at ata3 bus 0 scbus1 target 0 lun 0
cd0: <HL-DT-ST DVD+-RW GU10N A102> Removable CD-ROM SCSI device
cd0: Serial Number KW5A5HM1606
cd0: 150.000MB/s transfers (SATA 1.x, UDMA5, ATAPI 12bytes, PIO 8192bytes)
cd0: Attempt to query device size failed: NOT READY, Medium not present - tray closed
Timecounter "TSC-low" frequency 1197030501 Hz quality 1000
Trying to mount root from zfs:tank/ROOT/12.0-CURRENT-up-20160923_131108 []...
ugen0.2: <vendor 0x8087> at usbus0
uhub2: <vendor 0x8087 product 0x0020, class 9/0, rev 2.00/0.00, addr 2> on usbus0
ugen1.2: <vendor 0x8087> at usbus1
uhub3: <vendor 0x8087 product 0x0020, class 9/0, rev 2.00/0.00, addr 2> on usbus1
uhub2: 6 ports with 6 removable, self powered
uhub3: 8 ports with 8 removable, self powered
GEOM_ELI: Device ada0p2.eli created.
GEOM_ELI: Encryption: AES-XTS 128
GEOM_ELI: Crypto: hardware
ugen1.3: at usbus1
ukbd0: <Logitech USB Receiver, class 0/0, rev 1.10/22.40, addr 3> on usbus1
kbd2 at ukbd0
ukbd0: registered evdev provider: Logitech USB Receiver, class 0/0, rev 1.10/22.40, addr 3 <0>
Root mount waiting for: usbus1
ugen1.4: at usbus1
Root mount waiting for: usbus1
ugen1.5: at usbus1
GEOM_ELI: Device label/swap0.eli created.
GEOM_ELI: Encryption: AES-XTS 128
GEOM_ELI: Crypto: hardware
Cuse v0.1.34 @ /dev/cuse
acpi_video0: on vgapci0
fuse-freebsd: version 0.4.4, FUSE ABI 7.8
ums0: <Logitech USB Receiver, class 0/0, rev 1.10/22.40, addr 3> on usbus1
ums0: 10 buttons and [XYZT] coordinates ID=2
ums0: registered evdev provider: Logitech USB Receiver, class 0/0, rev 1.10/22.40, addr 3 <0>
[drm] Initialized drm 1.1.0 20060810
drmn0: on vgapci0
vgapci0: child drmn0 requested pci_enable_io
vgapci0: child drmn0 requested pci_enable_io
[drm] Memory usable by graphics device = 2048M
Failed to add WC MTRR for [0xe0000000-0xefffffff]: -45; performance may suffer
[drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
[drm] Driver supports precise vblank timestamp query.
[drm] Connector VGA-1: get mode from tunables:
[drm] - kern.vt.fb.modes.VGA-1
[drm] - kern.vt.fb.default_mode
[drm] Connector eDP-1: get mode from tunables:
[drm] - kern.vt.fb.modes.eDP-1
[drm] - kern.vt.fb.default_mode
[drm] Connector HDMI-A-1: get mode from tunables:
[drm] - kern.vt.fb.modes.HDMI-A-1
[drm] - kern.vt.fb.default_mode
[drm] Connector DP-1: get mode from tunables:
[drm] - kern.vt.fb.modes.DP-1
[drm] - kern.vt.fb.default_mode
[drm] Connector HDMI-A-2: get mode from tunables:
[drm] - kern.vt.fb.modes.HDMI-A-2
[drm] - kern.vt.fb.default_mode
[drm] Connector HDMI-A-3: get mode from tunables:
[drm] - kern.vt.fb.modes.HDMI-A-3
[drm] - kern.vt.fb.default_mode
[drm] Connector DP-2: get mode from tunables:
[drm] - kern.vt.fb.modes.DP-2
[drm] - kern.vt.fb.default_mode
[drm] Connector DP-3: get mode from tunables:
[drm] - kern.vt.fb.modes.DP-3
[drm] - kern.vt.fb.default_mode
[drm] RC6 disabled, disabling runtime PM support
[drm] Initialized i915 1.6.0 20160425 for drmn on minor 0
VT: Replacing driver "vga" with new "fb".
start FB_INFO:
type=11 height=900 width=1600 depth=32
cmsize=16 size=9216000
pbase=0xe0047000 vbase=0xfffff800e0047000
name=drmn0 flags=0x0 stride=7680 bpp=32
cmap[0]=0 cmap[1]=7f0000 cmap[2]=7f00 cmap[3]=c4a000
end FB_INFO
drmn0: fb0: inteldrmfb frame buffer device
wlan0: Ethernet address: 00:23:14:8d:87:e0
em0: link state changed to UP
iwn0: iwn_read_firmware: ucode rev=0x09dd0401
ubt0: <Dell Computer Corp DW375 Bluetooth Module, class 224/1, rev 2.00/5.17, addr 4> on usbus1
WARNING: attempt to domain_add(bluetooth) after domainfinalize()
WARNING: attempt to domain_add(netgraph) after domainfinalize()
ipfw2 (+ipv6) initialized, divert loadable, nat loadable, default to accept, logging disabled

@mattmacy
Copy link
Member

mattmacy commented Oct 4, 2016

Ahh - this is pre-SB. In that case it probably is DRM. I'll pull out my ancient Dell to see if I can reproduce.

@dwrobison
Copy link
Author

Yes my trusty Dell E6510 laptop.

Thanks for the time and attention.

@kmoore134
Copy link

Ok, I've enabled DDB now (KDB was already there)

@dwrobison
Copy link
Author

Had 3 or 4 more of these crashes. Usually after I leave kodi up for hours.

Got this error last night but no crash:

Oct 25 21:50:16 lefty kernel: [drm:0xffffffff82b47687s] ERROR Atomic update failure on pipe A (start=672137 end=672138) time 0 us, min 894, max 899, scanline start 893, end 899
Oct 26 00:36:21 lefty kernel: [drm:0xffffffff82b47687s] ERROR Atomic update failure on pipe A (start=1270023 end=1270024) time 0 us, min 894, max 899, scanline start 893, end 900

@hselasky
Copy link

hselasky commented Feb 8, 2017

Hi,

Can you re-test with the latest drm-next?

--HPS

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants