GT650M: Failed to initialize NVIDIA GPU #172

Closed
joeythesaint opened this Issue Jun 2, 2012 · 87 comments
@joeythesaint
baseboard-manufacturer: CLEVO CO.
baseboard-product-name: W110ER                          
baseboard-version     : N/A                             
system-manufacturer   : CLEVO CO.                       
system-product-name   : W110ER                          
system-version        : N/A                             
bios-vendor           : American Megatrends Inc.
bios-version          : 4.6.5
bios-release-date     : 04/26/2012

The "Failed to initialize NVIDIA GPU" error appears in syslog from bumblebeed / Xorg.8 whenever I try to use optirun on anything.  It doesn't appear to be a problem with acpi options on my command line:

% cat /proc/cmdline 
BOOT_IMAGE=/boot/vmlinuz-3.2.0-24-generic root=UUID=9b0ea8db-ffb8-451e-a38b-485142cd15dc ro quiet splash vt.handoff=7

I am not seeing an error of the form "Error inserting nvidia_current..." but this command:

lspci -d 10de: -vvnn

Definitely produces the expected error:

01:00.0 VGA compatible controller [0300]: NVIDIA Corporation Device [10de:0fd1] (rev ff) (prog-if ff)
    !!! Unknown header type 7f
    Kernel driver in use: nvidia

Not really sure what to do next, I don't know which scripts I'm looking for to enable my card described in the troubleshooting section. I'll mail the logs gathered by bumblebee-bugreport to the mailing list now.

@Lekensteyn
Bumblebee-Project member

Messages like:

CPU0: Package power limit notification (total events = 1) and

NVRM: GPU at 0000:01:00.0 has fallen off the bus. looks worrying. Can you try setting PMMethod=none in /etc/bumblebee/bumblebee.conf and reboot?

@ArchangeGabriel
Bumblebee-Project member

I've got three users on the french forums with quite the same problem, extract from the syslog below:

First attempt of optirun:

kernel: [  151.013036] bbswitch: enabling discrete graphics
bumblebeed[1061]: The PCI config space could not be written fully at offset 0x1; 0 bytes have been written; error: Resource temporarily unavailable
kernel: [  151.421348] pci 0000:01:00.0: power state changed by ACPI to D0
kernel: [  151.421353] pci 0000:01:00.0: power state changed by ACPI to D0
kernel: [  151.421370] pci 0000:01:00.0: restoring config space at offset 0xc (was 0x0, writing 0xfff80000)
kernel: [  151.421374] pci 0000:01:00.0: restoring config space at offset 0x9 (was 0x1, writing 0x3001)
kernel: [  151.421378] pci 0000:01:00.0: restoring config space at offset 0x7 (was 0xc, writing 0xb000000c)
kernel: [  151.421382] pci 0000:01:00.0: restoring config space at offset 0x5 (was 0xc, writing 0xa000000c)
kernel: [  151.421385] pci 0000:01:00.0: restoring config space at offset 0x4 (was 0x0, writing 0xd0000000)
kernel: [  151.421388] pci 0000:01:00.0: restoring config space at offset 0x3 (was 0x0, writing 0x10)
kernel: [  151.421392] pci 0000:01:00.0: restoring config space at offset 0x1 (was 0x100000, writing 0x100006)
kernel: [  151.421404] pci 0000:01:00.0: power state changed by ACPI to D0
kernel: [  151.421406] pci 0000:01:00.0: power state changed by ACPI to D0
kernel: [  151.421410] pci 0000:01:00.0: enabling device (0006 -> 0007)
kernel: [  151.421415] pci 0000:01:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
kernel: [  151.421419] pci 0000:01:00.0: setting latency timer to 64
kernel: [  151.459137] nvidia: module license 'NVIDIA' taints kernel.
kernel: [  151.459140] Disabling lock debugging due to kernel taint
kernel: [  151.496381] nvidia 0000:01:00.0: power state changed by ACPI to D0
kernel: [  151.496385] nvidia 0000:01:00.0: power state changed by ACPI to D0
kernel: [  151.496389] nvidia 0000:01:00.0: enabling device (0006 -> 0007)
kernel: [  151.496394] nvidia 0000:01:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
kernel: [  151.496400] nvidia 0000:01:00.0: setting latency timer to 64
kernel: [  151.496405] vgaarb: device changed decodes: PCI:0000:01:00.0,olddecodes=io+mem,decodes=none:owns=none
kernel: [  151.496474] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  295.53  Fri May 11 23:28:10 PDT 2012
acpid: client connected from 2318[0:1001]
acpid: 1 client rule loaded
kernel: [  159.041876] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
kernel: [  159.041898] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
bumblebeed[1061]: [XORG] (EE) NVIDIA(0): Failed to initialize the NVIDIA GPU at PCI:1:0:0.  Please
bumblebeed[1061]: [XORG] (EE) NVIDIA(0):     check your system's kernel log for additional error
bumblebeed[1061]: [XORG] (EE) NVIDIA(0):     messages and refer to Chapter 8: Common Problems in the
bumblebeed[1061]: [XORG] (EE) NVIDIA(0):     README for additional information.
bumblebeed[1061]: [XORG] (EE) NVIDIA(0): Failed to initialize the NVIDIA graphics device!
bumblebeed[1061]: [XORG] (EE) Screen(s) found, but none have a usable configuration.
bumblebeed[1061]: X did not start properly
kernel: [  159.299492] NVRM: RmInitAdapter failed! (0x26:0xffffffff:1170)
kernel: [  159.299499] NVRM: rm_init_adapter(0) failed

All further attempts during the same session:

kernel: [ 2542.461516] bbswitch: enabling discrete graphics
kernel: [ 2542.461534] nvidia 0000:01:00.0: power state changed by ACPI to D0
bumblebeed[1061]: Could not enable discrete graphics card
kernel: [ 2542.474149] nvidia 0000:01:00.0: Refused to change power state, currently in D3
kernel: [ 2542.474158] nvidia 0000:01:00.0: power state changed by ACPI to D0

I will tell them to try PMMethod=none.

@Lekensteyn
Bumblebee-Project member

Aw, that PCI config space write is nasty. I've seen that sometimes happening with the blob driver. It needs some refinement as not everything in that config space needs to be written.

@joeythesaint

Okay, so I switched to PMMethod=none and optirun is still failing with the same apparent errrors. In case it helps, this is what's currently installed for nvidia and bumblebee stuff:

% dpkg --list | egrep '(nvidia|bumbl)' | grep ^ii
ii  bumblebee                              3.0-2~preciseppa1                       nVidia Optimus support
ii  bumblebee-nvidia                       3.0-2~preciseppa1                       nVidia Optimus support using the proprietary NVIDIA driver
ii  nvidia-common                          1:0.2.44                                Find obsolete NVIDIA drivers
ii  nvidia-current                         295.53-0ubuntu1~precise~xup1            NVIDIA binary Xorg driver, kernel module and VDPAU library
ii  nvidia-current-dev                     295.53-0ubuntu1~precise~xup1            NVIDIA binary Xorg driver development files
ii  nvidia-settings                        295.53-0ubuntu1~precise~xup1            Tool of configuring the NVIDIA graphics driver

I just got this machine last week so I know I've never tried any of the older bumblebee or ironhide releases, so hopefully that's not an issue. In case it helps I've gathered another bug report, I'll send it to the list.

@Lekensteyn
Bumblebee-Project member

Did you reboot after changing the config?

@joeythesaint
@joeythesaint

I did some more reading and I'm thinking now it may actually be the binary Nvidia driver that's at fault here. Looking here:

http://www.nvidia.com/object/linux-display-amd64-295.53-driver.html

I don't see support for my specific GPU listed, it's a GT 650M. Sorry if this all turned out to be a false alarm.

@Lekensteyn
Bumblebee-Project member

Now that you're mentioning that, ftp://download.nvidia.com/XFree86/Linux-x86_64/302.11/README/supportedchips.html does not list your the PCI ID of your card either.

You may have very basic support with the nouveau driver.
http://nouveau.freedesktop.org/wiki/CodeNames#NVE0
http://nouveau.freedesktop.org/wiki/FeatureMatrix

@ArchangeGabriel
Bumblebee-Project member

And only with the latest nouveau from git.

And Lekensteyn, FYI, all the french users facing this problem also have a GT650M...

Closing since it's not a bug, but a lack of support from nVidia.

@joeythesaint
@studentz

Same problem with the same hardware but with a new driver
Machine
clevo W11OER
Intel HD Graphics 3000
Nvidia GefoRCE GT 650M

I installed the nvidia driver 295.59, which support this new nvidia video card.
Here the links for the driver and ppa.

http://www.nvidia.com/object/linux-display-amd64-295.59-driver.html
https://launchpad.net/~upubuntu-com/+archive/nvidia

Here the log reports
Xorg.8.log

Kernel command line: BOOT_IMAGE=/boot/vmlinuz-3.2.0-25-generic root=UUID=d867879c-20d0-4c60-86e4-814ccebc8857 ro quiet splash vt.handoff=7

(II) NVIDIA dlloader X Driver  295.59  Wed Jun  6 21:21:24 PDT 2012
(II) NVIDIA Unified Driver for all Supported NVIDIA GPUs
(--) using VT number 7
(II) Loading sub module "fb"
(II) LoadModule: "fb"
(II) Loading /usr/lib/xorg/modules/libfb.so
(II) Module fb: vendor="X.Org Foundation"
    compiled for 1.11.3, module version = 1.0.0
    ABI class: X.Org ANSI C Emulation, version 0.4
(II) Loading sub module "wfb"
(II) LoadModule: "wfb"
(II) Loading /usr/lib/xorg/modules/libwfb.so
(II) Module wfb: vendor="X.Org Foundation"
    compiled for 1.11.3, module version = 1.0.0
    ABI class: X.Org ANSI C Emulation, version 0.4
(II) Loading sub module "ramdac"
(II) LoadModule: "ramdac"
(II) Module "ramdac" already built-in
(II) Loading /usr/lib/nvidia-current/xorg/nvidia_drv.so
(II) Loading /usr/lib/xorg/modules/libwfb.so
(II) Loading /usr/lib/xorg/modules/libfb.so
(II) NVIDIA(0): Creating default Display subsection in Screen section
        "Default Screen Section" for depth/fbbpp 24/32
(==) NVIDIA(0): Depth 24, (==) framebuffer bpp 32
(==) NVIDIA(0): RGB weight 888
(==) NVIDIA(0): Default visual is TrueColor
(==) NVIDIA(0): Using gamma correction (1.0, 1.0, 1.0)
(**) NVIDIA(0): Option "NoLogo" "true"
(**) NVIDIA(0): Option "UseEDID" "false"
(**) NVIDIA(0): Option "ConnectedMonitor" "DFP"
(**) NVIDIA(0): Enabling 2D acceleration
(**) NVIDIA(0): ConnectedMonitor string: "DFP"
(**) NVIDIA(0): Ignoring EDIDs
(EE) NVIDIA(0): Failed to initialize the NVIDIA GPU at PCI:1:0:0.  Please
(EE) NVIDIA(0):     check your system's kernel log for additional error
(EE) NVIDIA(0):     messages and refer to Chapter 8: Common Problems in the
(EE) NVIDIA(0):     README for additional information.
(EE) NVIDIA(0): Failed to initialize the NVIDIA graphics device!
(II) UnloadModule: "nvidia"
(II) Unloading nvidia
(II) UnloadModule: "wfb"
(II) Unloading wfb
(II) UnloadModule: "fb"
(II) Unloading fb
(EE) Screen(s) found, but none have a usable configuration.
Fatal server error:no screens found

sys.log

before using optirun

bumblebeed[1050]: /usr/sbin/bumblebeed 3.0 started
 kernel: [   17.304961] bbswitch: version 0.4.2
 kernel: [   17.304970] bbswitch: Found integrated VGA device 0000:00:02.0: \_SB_.PCI0.GFX0
 kernel: [   17.304980] bbswitch: Found discrete VGA device 0000:01:00.0: \_SB_.PCI0.PEG0.PEGP
 kernel: [   17.305100] bbswitch: detected an Optimus _DSM function
 kernel: [   17.305108] bbswitch: Succesfully loaded. Discrete card 0000:01:00.0 is on
 kernel: [   17.309286] bbswitch: disabling discrete graphics
 kernel: [   17.309588] bbswitch: Result of Optimus _DSM call: 11000059

using optirun

 kernel: [   74.314846] bbswitch: enabling discrete graphics
 kernel: [   74.872140] CPU3: Package power limit notification (total events = 1)
 kernel: [   74.872145] CPU2: Package power limit notification (total events = 1)
 kernel: [   74.872149] CPU1: Package power limit notification (total events = 1)
 kernel: [   74.872152] CPU0: Package power limit notification (total events = 1)
 kernel: [   74.872182] CPU3: Package power limit normal
 kernel: [   74.872185] CPU2: Package power limit normal
 kernel: [   74.872187] CPU1: Package power limit normal
 kernel: [   74.872189] CPU0: Package power limit normal
 goa[2286]: goa-daemon version 3.4.0 starting [main.c:112, main()]
 bluetoothd[923]: Endpoint unregistered: sender=:1.31 path=/MediaEndpoint/HFPAG
 bluetoothd[923]: Endpoint unregistered: sender=:1.31 path=/MediaEndpoint/A2DPSource
 bluetoothd[923]: Endpoint unregistered: sender=:1.31 path=/MediaEndpoint/A2DPSink
 dbus[901]: [system] Activating service name='com.ubuntu.DeviceDriver' (using servicehelper)
 dbus[901]: [system] Successfully activated service 'com.ubuntu.DeviceDriver'
 dbus[901]: [system] Activating service name='com.ubuntu.DeviceDriver' (using servicehelper)
 dbus[901]: [system] Successfully activated service 'com.ubuntu.DeviceDriver'
 dbus[901]: [system] Activating service name='org.freedesktop.PackageKit' (using servicehelper)
 dbus[901]: [system] Activating service name='com.ubuntu.SystemService' (using servicehelper)
 AptDaemon: INFO: Initializing daemon
 dbus[901]: [system] Successfully activated service 'com.ubuntu.SystemService'
 AptDaemon.PackageKit: INFO: Initializing PackageKit compat layer
 dbus[901]: [system] Successfully activated service 'org.freedesktop.PackageKit'
 AptDaemon.PackageKit: INFO: Initializing PackageKit transaction
 AptDaemon.Worker: INFO: Simulating trans: /org/debian/apt/transaction/fe0bfd2c513544b398b97e1d104d825c
 AptDaemon.Worker: INFO: Processing transaction /org/debian/apt/transaction/fe0bfd2c513544b398b97e1d104d825c
 AptDaemon.PackageKit: INFO: Get updates()
 AptDaemon.Worker: INFO: Finished transaction /org/debian/apt/transaction/fe0bfd2c513544b398b97e1d104d825c
 kernel: [  337.599305] ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEG0.PEGP.SGON] (Node ffff88012826bac8), AE_AML_INFINITE_LOOP (20110623/psparse-536)
 kernel: [  337.599320] ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEG0.PEGP._ON_] (Node ffff88012826e118), AE_AML_INFINITE_LOOP (20110623/psparse-536)
 kernel: [  337.599327] ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEG0.PEGP._PS0] (Node ffff88012826be38), AE_AML_INFINITE_LOOP (20110623/psparse-536)
 kernel: [  337.599337] ACPI: Device [PEGP] failed to transition to D0
 kernel: [  337.615292] pci_raw_set_power_state: 37 callbacks suppressed
 kernel: [  337.615297] pci 0000:01:00.0: Refused to change power state, currently in D3

Kern Log

(Ubuntu/Linaro 4.6.3-1ubuntu5) ) #40-Ubuntu SMP Wed May 23 20:30:51 UTC 2012 (Ubuntu 3.2.0-25.40-generic 3.2.18)
kernel: [    0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-3.2.0-25-generic root=UUID=d867879c-20d0-4c60-86e4-814ccebc8857 ro quiet splash vt.handoff=7
kernel: [   74.314846] bbswitch: enabling discrete graphics
kernel: [   74.872140] CPU3: Package power limit notification (total events = 1)
kernel: [   74.872145] CPU2: Package power limit notification (total events = 1)
kernel: [   74.872149] CPU1: Package power limit notification (total events = 1)
kernel: [   74.872152] CPU0: Package power limit notification (total events = 1)
kernel: [   74.872182] CPU3: Package power limit normal
kernel: [   74.872185] CPU2: Package power limit normal
kernel: [   74.872187] CPU1: Package power limit normal
kernel: [   74.872189] CPU0: Package power limit normal
kernel: [  337.599305] ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEG0.PEGP.SGON] (Node ffff88012826bac8), AE_AML_INFINITE_LOOP (20110623/psparse-536)
kernel: [  337.599320] ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEG0.PEGP._ON_] (Node ffff88012826e118), AE_AML_INFINITE_LOOP (20110623/psparse-536)
kernel: [  337.599327] ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEG0.PEGP._PS0] (Node ffff88012826be38), AE_AML_INFINITE_LOOP (20110623/psparse-536)
kernel: [  337.599337] ACPI: Device [PEGP] failed to transition to D0
kernel: [  337.615292] pci_raw_set_power_state: 37 callbacks suppressed
kernel: [  337.615297] pci 0000:01:00.0: Refused to change power state, currently in D3
kernel: [  374.924644] CPU1: Package power limit notification (total events = 11139)
kernel: [  374.924648] CPU3: Package power limit notification (total events = 11139)
kernel: [  374.924670] CPU0: Package power limit notification (total events = 11138)
kernel: [  374.924674] CPU2: Package power limit notification (total events = 11139)
kernel: [  374.925974] CPU1: Package power limit normal
kernel: [  374.925978] CPU2: Package power limit normal
kernel: [  374.925981] CPU0: Package power limit normal
kernel: [  374.925984] CPU3: Package power limit normal
kernel: [  600.888155] ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEG0.PEGP.SGON] (Node ffff88012826bac8), AE_AML_INFINITE_LOOP (20110623/psparse-536)
kernel: [  600.888169] ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEG0.PEGP._ON_] (Node ffff88012826e118), AE_AML_INFINITE_LOOP (20110623/psparse-536)
kernel: [  600.888175] ACPI Error: Method parse/execution failed [\_SB_.PCI0.PEG0.PEGP._PS0] (Node ffff88012826be38), AE_AML_INFINITE_LOOP (20110623/psparse-536)
kernel: [  600.888185] ACPI: Device [PEGP] failed to transition to D0

Any Suggestion

Thanks

@joeythesaint
@Lekensteyn
Bumblebee-Project member
@joeythesaint
@studentz
@yimm

Have you tried with last drivers ?
Using last drivers I get :
NVRM: RmInitAdapter failed! (0x26:0xffffffff:1170)
NVRM: rm_init_adapter(0) failed

Clevo W150ERQ - 650M

@joeythesaint
@Lekensteyn
Bumblebee-Project member

The 302.xx and 295.xx drivers seems to be different. 295.59 is newer than 302.11 and supports the GT 650M where the 302.11 does not work.

@yimm
@Lekensteyn
Bumblebee-Project member

Does nouveau work? What does lspci -nn -d10de: output?

@yimm
@Lekensteyn
Bumblebee-Project member

"Does not work" is not meaningful. Please attach /var/log/Xorg.8.log and the relevant tail of dmesg after attempting to use nouveau.

@yimm

As I remember there is nothing under dmesg and Xorg.8.log (This file simply doesn't exist).
Now, I switch to Archlinux, using nouveau I get:

XORG [drm] failed to open device

In dmesg :

Jun 21 00:29:30 localhost kernel: [ 799.239993] wmi: Mapper loaded
Jun 21 00:29:30 localhost kernel: [ 799.256477] MXM: GUID detected in BIOS
Jun 21 00:29:30 localhost kernel: [ 799.256645] VGA switcheroo: detected Optimus DSM method _SB_.PCI0.PEG0.PEGP handle
Jun 21 00:29:30 localhost kernel: [ 799.256687] nouveau 0000:01:00.0: power state changed by ACPI to D0
Jun 21 00:29:30 localhost kernel: [ 799.256692] nouveau 0000:01:00.0: power state changed by ACPI to D0
Jun 21 00:29:30 localhost kernel: [ 799.257093] [drm] nouveau 0000:01:00.0: Detected an NVe0 generation card (0x0e7110a2)
Jun 21 00:29:30 localhost kernel: [ 799.277186] vga_switcheroo: enabled
Jun 21 00:29:30 localhost kernel: [ 799.277192] [drm] nouveau 0000:01:00.0: Checking PRAMIN for VBIOS
Jun 21 00:29:30 localhost kernel: [ 799.325258] [drm] nouveau 0000:01:00.0: ... BIOS signature not found
Jun 21 00:29:30 localhost kernel: [ 799.325261] [drm] nouveau 0000:01:00.0: Checking PROM for VBIOS
Jun 21 00:29:30 localhost kernel: [ 799.325361] [drm] nouveau 0000:01:00.0: ... BIOS signature not found
Jun 21 00:29:30 localhost kernel: [ 799.325363] [drm] nouveau 0000:01:00.0: Checking ACPI for VBIOS
Jun 21 00:29:30 localhost kernel: [ 799.497528] [drm] nouveau 0000:01:00.0: ... BIOS checksum invalid
Jun 21 00:29:30 localhost kernel: [ 799.497538] [drm] nouveau 0000:01:00.0: Checking PCIROM for VBIOS
Jun 21 00:29:30 localhost kernel: [ 799.498479] [drm] nouveau 0000:01:00.0: ... BIOS checksum invalid
Jun 21 00:29:30 localhost kernel: [ 799.498483] [drm] nouveau 0000:01:00.0: Using VBIOS from ACPI
Jun 21 00:29:30 localhost kernel: [ 799.498490] [drm] nouveau 0000:01:00.0: BIT BIOS found
Jun 21 00:29:30 localhost kernel: [ 799.498497] [drm] nouveau 0000:01:00.0: Bios version 80.07.1d.00
Jun 21 00:29:30 localhost kernel: [ 799.498505] [drm] nouveau 0000:01:00.0: TMDS table version 2.0
Jun 21 00:29:30 localhost kernel: [ 799.500210] [drm] nouveau 0000:01:00.0: MXM: no VBIOS data, nothing to do
Jun 21 00:29:30 localhost kernel: [ 799.500220] [drm] nouveau 0000:01:00.0: DCB version 4.0

And now xorg.8.log :
http://pastebin.com/TRjA4jX8

@Lekensteyn
Bumblebee-Project member

If that is all data from dmesg (a command), then the vbios is invalid or wrongly read. Strange.

@joeythesaint
@yimm

Hm...
I remember that I use "acpi_osi=" option.

Without "acpi_osi=" to grub line I noticed that nvidia card doesn't recognize screen (using proprietary drivers) :

optirun glxgears
[ERROR]Cannot access secondary GPU - error: XORG NVIDIA(0): Failed to assign any connected display devices to X screen 0

hm

nvidia-xconfig --query-gpu-info
Number of GPUs: 1

GPU #0:
Name : GeForce GT 650M
PCI BusID : PCI: 1: 0: 0

Number of Display Devices: 0

Strange problem.

dmesg log when I start optirun :

Jun 21 20:48:29 localhost kernel: [ 2784.209162] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 302.17 Tue Jun 12 16:03:22 PDT 2012
Jun 21 20:48:40 localhost acpid: client connected from 3015[0:999]
Jun 21 20:48:40 localhost acpid: 1 client rule loaded
Jun 21 20:48:43 localhost kernel: [ 2798.053979] ACPI Error: Field [TMPB] at 282624 exceeds Buffer [ROM1] size 262144 (bits) (20120320/dsopcode-236)
Jun 21 20:48:43 localhost kernel: [ 2798.053989] ACPI Error: Method parse/execution failed _SB_.PCI0.PEG0.PEGP._ROM, AE_AML_BUFFER_LIMIT (20120320/psparse-536)
Jun 21 20:48:44 localhost bumblebeed[1946]: XORG NVIDIA(0): Failed to enable display hotplug notification
Jun 21 20:48:47 localhost acpid: client 3015[0:999] has disconnected

With "acpi_osi=" option, I get via dmesg :

[ 30.398673] NVRM: RmInitAdapter failed! (0x26:0xffffffff:1170)
[ 30.398682] NVRM: rm_init_adapter(0) failed

nvidia-xconfig --query-gpu-info
NVIDIA: could not open the device file /dev/nvidiactl (No such device or address).
ERROR: Unable to query GPU information

@joeythesaint

Right, so I fixed the CRT-0 thing in my /etc/bumblebee/xorg.conf.nvidia:

Section "ServerLayout"
    Identifier "Layout0"
    Option "AutoAddDevices" "false"
EndSection

Section "Device"
    Identifier "Device1"
    Driver "nvidia"
    VendorName "NVIDIA Corporation"
    Option "NoLogo" "true"
    Option "UseEDID" "false"
    Option "ConnectedMonitor" "CRT-0"
EndSection

And I'm still getting no love from the nvidia drivers, we're back to "fell off the bus" though:

[  131.218429] nvidia: module license 'NVIDIA' taints kernel.
[  131.218432] Disabling lock debugging due to kernel taint
[  131.223910] nvidia 0000:01:00.0: power state changed by ACPI to D0
[  131.223914] nvidia 0000:01:00.0: power state changed by ACPI to D0
[  131.223916] nvidia 0000:01:00.0: enabling device (0006 -> 0007)
[  131.223926] vgaarb: device changed decodes: PCI:0000:01:00.0,olddecodes=io+mem,decodes=none:owns=none
[  131.224003] NVRM: loading NVIDIA UNIX x86_64 Kernel Module  302.17  Tue Jun 12 16:03:22 PDT 2012
[  136.726514] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[  136.726542] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
[  136.996248] NVRM: RmInitAdapter failed! (0x26:0xffffffff:1170)
[  136.996255] NVRM: rm_init_adapter(0) failed

This is the Xorg.8.log:

[   131.186] 
X.Org X Server 1.12.2
Release Date: 2012-05-29
[   131.186] X Protocol Version 11, Revision 0
[   131.186] Build Operating System: Linux 2.6.24-29-xen x86_64 Ubuntu
[   131.186] Current Operating System: Linux skynet 3.5.0-1-generic #1-Ubuntu SMP Tue Jun 19 20:56:48 UTC 2012 x86_64
[   131.186] Kernel command line: BOOT_IMAGE=/boot/vmlinuz-3.5.0-1-generic root=UUID=9b0ea8db-ffb8-451e-a38b-485142cd15dc ro quiet splash vt.handoff=7
[   131.186] Build Date: 05 June 2012  08:35:55AM
[   131.186] xorg-server 2:1.12.2+git20120605+server-1.12-branch.aaf48906-0ubuntu0ricotz~precise (For technical support please see http://www.ubuntu.com/support) 
[   131.186] Current version of pixman: 0.26.0
[   131.186]    Before reporting problems, check http://wiki.x.org
    to make sure that you have the latest version.
[   131.186] Markers: (--) probed, (**) from config file, (==) default setting,
    (++) from command line, (!!) notice, (II) informational,
    (WW) warning, (EE) error, (NI) not implemented, (??) unknown.
[   131.186] (==) Log file: "/var/log/Xorg.8.log", Time: Thu Jun 21 16:17:13 2012
[   131.186] (++) Using config file: "/etc/bumblebee/xorg.conf.nvidia"
[   131.186] (==) Using system config directory "/usr/share/X11/xorg.conf.d"
[   131.198] (==) ServerLayout "Layout0"
[   131.198] (==) No screen section available. Using defaults.
[   131.198] (**) |-->Screen "Default Screen Section" (0)
[   131.198] (**) |   |-->Monitor "<default monitor>"
[   131.199] (==) No device specified for screen "Default Screen Section".
    Using the first device section listed.
[   131.199] (**) |   |-->Device "Device1"
[   131.199] (==) No monitor specified for screen "Default Screen Section".
    Using a default monitor configuration.
[   131.199] (**) Option "AutoAddDevices" "false"
[   131.199] (**) Not automatically adding devices
[   131.199] (==) Automatically enabling devices
[   131.199] (WW) The directory "/usr/share/fonts/X11/cyrillic" does not exist.
[   131.199]    Entry deleted from font path.
[   131.199] (WW) The directory "/usr/share/fonts/X11/100dpi/" does not exist.
[   131.199]    Entry deleted from font path.
[   131.199] (WW) The directory "/usr/share/fonts/X11/75dpi/" does not exist.
[   131.199]    Entry deleted from font path.
[   131.199] (WW) The directory "/usr/share/fonts/X11/100dpi" does not exist.
[   131.199]    Entry deleted from font path.
[   131.199] (WW) The directory "/usr/share/fonts/X11/75dpi" does not exist.
[   131.199]    Entry deleted from font path.
[   131.199] (WW) The directory "/var/lib/defoma/x-ttcidfont-conf.d/dirs/TrueType" does not exist.
[   131.199]    Entry deleted from font path.
[   131.199] (==) FontPath set to:
    /usr/share/fonts/X11/misc,
    /usr/share/fonts/X11/Type1,
    built-ins
[   131.199] (++) ModulePath set to "/usr/lib/nvidia-current/xorg,/usr/lib/xorg/modules"
[   131.199] (==) |-->Input Device "<default pointer>"
[   131.199] (==) |-->Input Device "<default keyboard>"
[   131.199] (==) The core pointer device wasn't specified explicitly in the layout.
    Using the default mouse configuration.
[   131.199] (==) The core keyboard device wasn't specified explicitly in the layout.
    Using the default keyboard configuration.
[   131.199] (II) Loader magic: 0x7fdea6c4db00
[   131.199] (II) Module ABI versions:
[   131.199]    X.Org ANSI C Emulation: 0.4
[   131.199]    X.Org Video Driver: 12.0
[   131.199]    X.Org XInput driver : 16.0
[   131.199]    X.Org Server Extension : 6.0
[   131.199] (--) PCI:*(0:1:0:0) 10de:0fd1:1558:1150 rev 161, Mem @ 0xf6000000/16777216, 0xe0000000/268435456, 0xf0000000/33554432, I/O @ 0x0000e000/128, BIOS @ 0x????????/524288
[   131.199] (II) Open ACPI successful (/var/run/acpid.socket)
[   131.199] (II) LoadModule: "extmod"
[   131.199] (II) Loading /usr/lib/xorg/modules/extensions/libextmod.so
[   131.200] (II) Module extmod: vendor="X.Org Foundation"
[   131.200]    compiled for 1.12.2, module version = 1.0.0
[   131.200]    Module class: X.Org Server Extension
[   131.200]    ABI class: X.Org Server Extension, version 6.0
[   131.200] (II) Loading extension MIT-SCREEN-SAVER
[   131.200] (II) Loading extension XFree86-VidModeExtension
[   131.200] (II) Loading extension XFree86-DGA
[   131.200] (II) Loading extension DPMS
[   131.200] (II) Loading extension XVideo
[   131.200] (II) Loading extension XVideo-MotionCompensation
[   131.200] (II) Loading extension X-Resource
[   131.200] (II) LoadModule: "dbe"
[   131.200] (II) Loading /usr/lib/xorg/modules/extensions/libdbe.so
[   131.200] (II) Module dbe: vendor="X.Org Foundation"
[   131.200]    compiled for 1.12.2, module version = 1.0.0
[   131.200]    Module class: X.Org Server Extension
[   131.200]    ABI class: X.Org Server Extension, version 6.0
[   131.200] (II) Loading extension DOUBLE-BUFFER
[   131.200] (II) LoadModule: "glx"
[   131.200] (II) Loading /usr/lib/nvidia-current/xorg/libglx.so
[   131.726] (II) Module glx: vendor="NVIDIA Corporation"
[   131.726]    compiled for 4.0.2, module version = 1.0.0
[   131.726]    Module class: X.Org Server Extension
[   131.726] (II) NVIDIA GLX Module  302.17  Tue Jun 12 16:22:45 PDT 2012
[   131.726] (II) Loading extension GLX
[   131.726] (II) LoadModule: "record"
[   131.726] (II) Loading /usr/lib/xorg/modules/extensions/librecord.so
[   131.726] (II) Module record: vendor="X.Org Foundation"
[   131.726]    compiled for 1.12.2, module version = 1.13.0
[   131.726]    Module class: X.Org Server Extension
[   131.726]    ABI class: X.Org Server Extension, version 6.0
[   131.726] (II) Loading extension RECORD
[   131.726] (II) LoadModule: "dri"
[   131.726] (II) Loading /usr/lib/xorg/modules/extensions/libdri.so
[   131.726] (II) Module dri: vendor="X.Org Foundation"
[   131.726]    compiled for 1.12.2, module version = 1.0.0
[   131.726]    ABI class: X.Org Server Extension, version 6.0
[   131.726] (II) Loading extension XFree86-DRI
[   131.726] (II) LoadModule: "dri2"
[   131.726] (II) Loading /usr/lib/xorg/modules/extensions/libdri2.so
[   131.727] (II) Module dri2: vendor="X.Org Foundation"
[   131.727]    compiled for 1.12.2, module version = 1.2.0
[   131.727]    ABI class: X.Org Server Extension, version 6.0
[   131.727] (II) Loading extension DRI2
[   131.727] (II) LoadModule: "nvidia"
[   131.727] (II) Loading /usr/lib/nvidia-current/xorg/nvidia_drv.so
[   131.786] (II) Module nvidia: vendor="NVIDIA Corporation"
[   131.786]    compiled for 4.0.2, module version = 1.0.0
[   131.786]    Module class: X.Org Video Driver
[   131.805] (II) LoadModule: "mouse"
[   131.805] (II) Loading /usr/lib/xorg/modules/input/mouse_drv.so
[   131.817] (II) Module mouse: vendor="X.Org Foundation"
[   131.817]    compiled for 1.12.0, module version = 1.7.2
[   131.817]    Module class: X.Org XInput Driver
[   131.817]    ABI class: X.Org XInput driver, version 16.0
[   131.817] (II) LoadModule: "kbd"
[   131.817] (WW) Warning, couldn't open module kbd
[   131.817] (II) UnloadModule: "kbd"
[   131.817] (II) Unloading kbd
[   131.817] (EE) Failed to load module "kbd" (module does not exist, 0)
[   131.817] (II) NVIDIA dlloader X Driver  302.17  Tue Jun 12 16:04:53 PDT 2012
[   131.817] (II) NVIDIA Unified Driver for all Supported NVIDIA GPUs
[   131.818] (--) using VT number 7

[   131.819] (II) Loading sub module "fb"
[   131.819] (II) LoadModule: "fb"
[   131.819] (II) Loading /usr/lib/xorg/modules/libfb.so
[   131.819] (II) Module fb: vendor="X.Org Foundation"
[   131.819]    compiled for 1.12.2, module version = 1.0.0
[   131.819]    ABI class: X.Org ANSI C Emulation, version 0.4
[   131.819] (II) Loading sub module "wfb"
[   131.819] (II) LoadModule: "wfb"
[   131.819] (II) Loading /usr/lib/xorg/modules/libwfb.so
[   131.831] (II) Module wfb: vendor="X.Org Foundation"
[   131.831]    compiled for 1.12.2, module version = 1.0.0
[   131.831]    ABI class: X.Org ANSI C Emulation, version 0.4
[   131.831] (II) Loading sub module "ramdac"
[   131.831] (II) LoadModule: "ramdac"
[   131.831] (II) Module "ramdac" already built-in
[   132.002] (II) NVIDIA(0): Creating default Display subsection in Screen section
    "Default Screen Section" for depth/fbbpp 24/32
[   132.002] (==) NVIDIA(0): Depth 24, (==) framebuffer bpp 32
[   132.002] (==) NVIDIA(0): RGB weight 888
[   132.002] (==) NVIDIA(0): Default visual is TrueColor
[   132.002] (==) NVIDIA(0): Using gamma correction (1.0, 1.0, 1.0)
[   132.002] (**) NVIDIA(0): Option "NoLogo" "true"
[   132.002] (**) NVIDIA(0): Option "UseEDID" "false"
[   132.002] (**) NVIDIA(0): Option "ConnectedMonitor" "CRT-0"
[   132.002] (**) NVIDIA(0): Enabling 2D acceleration
[   132.002] (**) NVIDIA(0): ConnectedMonitor string: "CRT-0"
[   132.003] (**) NVIDIA(0): Ignoring EDIDs
[   136.904] (EE) NVIDIA(0): Failed to initialize the NVIDIA GPU at PCI:1:0:0.  Please
[   136.904] (EE) NVIDIA(0):     check your system's kernel log for additional error
[   136.904] (EE) NVIDIA(0):     messages and refer to Chapter 8: Common Problems in the
[   136.904] (EE) NVIDIA(0):     README for additional information.
[   136.904] (EE) NVIDIA(0): Failed to initialize the NVIDIA graphics device!
[   136.904] (EE) NVIDIA(0): Failing initialization of X screen 0
[   136.904] (II) UnloadModule: "nvidia"
[   136.904] (II) UnloadSubModule: "wfb"
[   136.904] (II) UnloadSubModule: "fb"
[   136.904] (EE) Screen(s) found, but none have a usable configuration.
[   136.904] 
Fatal server error:
[   136.904] no screens found
[   136.904] 
Please consult the The X.Org Foundation support 
     at http://wiki.x.org
 for help. 
[   136.904] Please also check the log file at "/var/log/Xorg.8.log" for additional information.
[   136.904] 
[   136.904]  ddxSigGiveUp: Closing log
[   136.904] Server terminated with error (1). Closing log file.

It's otherwise exactly the same configuration as what was half working with nouveau last night.

skynet ~ uname -a                           
Linux skynet 3.5.0-1-generic #1-Ubuntu SMP Tue Jun 19 20:56:48 UTC 2012 x86_64 x86_64 x86_64 GNU/Linux
skynet ~ dpkg --list | grep ^ii | grep vidia
ii  bumblebee-nvidia                       3.0-2~preciseppa1                                                       nVidia Optimus support using the proprietary NVIDIA driver
ii  nvidia-common                          1:0.2.44                                                                Find obsolete NVIDIA drivers
ii  nvidia-current                         302.17-0ubuntu1~xedgers~precise1                                        NVIDIA binary Xorg driver, kernel module and VDPAU library
ii  nvidia-current-dev                     302.17-0ubuntu1~xedgers~precise1                                        NVIDIA binary Xorg driver development files
ii  nvidia-settings                        302.17-0ubuntu1~precise~xup3                                            Tool of configuring the NVIDIA graphics driver
@Lekensteyn
Bumblebee-Project member

Do you have "The PCI config space could not be written fully at ..." messages in syslog?

@joeythesaint
@Lekensteyn
Bumblebee-Project member

That _ROM error may be related to https://bugzilla.kernel.org/show_bug.cgi?id=43063. I currently do not know what is going on here with those new chips.

@ArchangeGabriel
Bumblebee-Project member

Is nouveau well blacklisted ? Because the NVRM errors are typical from those you're getting when nouveau is loaded somewhere before nvidia. Else, it should be that _ROM error.

@ArchangeGabriel
Bumblebee-Project member

Oh by the way Lekensteyn, I finally got nouveau working without that error on the 3.5rc1.

@joeythesaint
@Lekensteyn Lekensteyn reopened this Jul 10, 2012
@Lekensteyn
Bumblebee-Project member

Reopening just to track this issue.

@studentz

With nvidia drivers the problem still persist, even though Ubuntu 12.04 had an update of the kernel and xorg.
nvidia-current 302.17
bumblebee-nvidia 3.02
kernel 3.2.0-27

@joey55

I have the same issue here with a Dell Inspiron 17R.

Ubuntu 12.04
nvidia-current 302.17
bumblebee 3.0-2
kernel 3.2.0-26-generic

@glaere

Not so familiar with these things but discrete graphics don't work.

Ubuntu 12.04
linux 3.2.0-26 generic (64)
clevo w150er
nvidia 302.17
i7 3610qm (VGA 4000)

optirun glxgears reads:

[ERROR]Cannot access secondary GPU - error: XORG NVIDIA(0): Failed to initialize the NVIDIA GPU at PCI:1:0:0. Please

[ERROR]Aborting because fallback start is disabled.

/var/log/kern.log reads:

Jul 13 20:45:48 thespacebetween kernel: [13894.035893] bbswitch: enabling discrete graphics
Jul 13 20:45:48 thespacebetween kernel: [13894.350419] CPU4: Package power limit notification (total events = 14111)
Jul 13 20:45:48 thespacebetween kernel: [13894.350421] CPU0: Package power limit notification (total events = 14107)
Jul 13 20:45:48 thespacebetween kernel: [13894.350454] CPU5: Package power limit notification (total events = 14111)
Jul 13 20:45:48 thespacebetween kernel: [13894.350458] CPU1: Package power limit notification (total events = 14105)
Jul 13 20:45:48 thespacebetween kernel: [13894.350461] CPU2: Package power limit notification (total events = 14108)
Jul 13 20:45:48 thespacebetween kernel: [13894.350464] CPU6: Package power limit notification (total events = 14110)
Jul 13 20:45:48 thespacebetween kernel: [13894.350468] CPU3: Package power limit notification (total events = 14111)
Jul 13 20:45:48 thespacebetween kernel: [13894.350471] CPU7: Package power limit notification (total events = 14111)
Jul 13 20:45:48 thespacebetween kernel: [13894.350498] CPU4: Package power limit normal
Jul 13 20:45:48 thespacebetween kernel: [13894.350499] CPU0: Package power limit normal
Jul 13 20:45:48 thespacebetween kernel: [13894.350537] CPU5: Package power limit normal
Jul 13 20:45:48 thespacebetween kernel: [13894.350539] CPU1: Package power limit normal
Jul 13 20:45:48 thespacebetween kernel: [13894.350542] CPU6: Package power limit normal
Jul 13 20:45:48 thespacebetween kernel: [13894.350545] CPU2: Package power limit normal
Jul 13 20:45:48 thespacebetween kernel: [13894.350548] CPU3: Package power limit normal
Jul 13 20:45:48 thespacebetween kernel: [13894.350550] CPU7: Package power limit normal
Jul 13 20:45:50 thespacebetween kernel: [13895.756115] pci 0000:01:00.0: power state changed by ACPI to D0
Jul 13 20:45:50 thespacebetween kernel: [13895.756124] pci 0000:01:00.0: power state changed by ACPI to D0
Jul 13 20:45:50 thespacebetween kernel: [13895.756148] pci 0000:01:00.0: restoring config space at offset 0x9 (was 0x15580b01, writing 0xe001)
Jul 13 20:45:50 thespacebetween kernel: [13895.756153] pci 0000:01:00.0: restoring config space at offset 0x7 (was 0x5800000c, writing 0xf000000c)
Jul 13 20:45:50 thespacebetween kernel: [13895.756158] pci 0000:01:00.0: restoring config space at offset 0x5 (was 0xc, writing 0xe000000c)
Jul 13 20:45:50 thespacebetween kernel: [13895.756162] pci 0000:01:00.0: restoring config space at offset 0x4 (was 0x1000000, writing 0xf6000000)
Jul 13 20:45:50 thespacebetween kernel: [13895.756168] pci 0000:01:00.0: restoring config space at offset 0x1 (was 0x100002, writing 0x100006)
Jul 13 20:45:50 thespacebetween kernel: [13895.756183] pci 0000:01:00.0: power state changed by ACPI to D0
Jul 13 20:45:50 thespacebetween kernel: [13895.756186] pci 0000:01:00.0: power state changed by ACPI to D0
Jul 13 20:45:50 thespacebetween kernel: [13895.756191] pci 0000:01:00.0: enabling device (0006 -> 0007)
Jul 13 20:45:50 thespacebetween kernel: [13895.756198] pci 0000:01:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Jul 13 20:45:50 thespacebetween kernel: [13895.756203] pci 0000:01:00.0: setting latency timer to 64
Jul 13 20:45:50 thespacebetween kernel: [13895.908340] nvidia: module license 'NVIDIA' taints kernel.
Jul 13 20:45:50 thespacebetween kernel: [13895.908343] Disabling lock debugging due to kernel taint
Jul 13 20:45:50 thespacebetween kernel: [13895.962387] nvidia 0000:01:00.0: power state changed by ACPI to D0
Jul 13 20:45:50 thespacebetween kernel: [13895.962392] nvidia 0000:01:00.0: power state changed by ACPI to D0
Jul 13 20:45:50 thespacebetween kernel: [13895.962398] nvidia 0000:01:00.0: enabling device (0006 -> 0007)
Jul 13 20:45:50 thespacebetween kernel: [13895.962405] nvidia 0000:01:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Jul 13 20:45:50 thespacebetween kernel: [13895.962413] nvidia 0000:01:00.0: setting latency timer to 64
Jul 13 20:45:50 thespacebetween kernel: [13895.962419] vgaarb: device changed decodes: PCI:0000:01:00.0,olddecodes=io+mem,decodes=none:owns=none
Jul 13 20:45:50 thespacebetween kernel: [13895.962531] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 302.17 Tue Jun 12 16:03:22 PDT 2012
Jul 13 20:45:59 thespacebetween kernel: [13904.932647] NVRM: RmInitAdapter failed! (0x26:0xffffffff:1170)
Jul 13 20:45:59 thespacebetween kernel: [13904.932655] NVRM: rm_init_adapter(0) failed

@apottere

Same errors with Ubuntu 12.04 and Nvidia GT 650M.

/var/log/kern.log:

Jul 15 16:49:18 Empire kernel: [ 55.158348] bbswitch: enabling discrete graphics
Jul 15 16:49:19 Empire kernel: [ 55.246274] [Firmware Bug]: ACPI: BIOS _OSI(Linux) query ignored
Jul 15 16:49:19 Empire kernel: [ 55.262214] pci 0000:01:00.0: power state changed by ACPI to D0
Jul 15 16:49:19 Empire kernel: [ 55.262220] pci 0000:01:00.0: power state changed by ACPI to D0
Jul 15 16:49:19 Empire kernel: [ 55.262239] pci 0000:01:00.0: restoring config space at offset 0xc (was 0x0, writing 0xfff80000)
Jul 15 16:49:19 Empire kernel: [ 55.262245] pci 0000:01:00.0: restoring config space at offset 0x9 (was 0x1, writing 0x4001)
Jul 15 16:49:19 Empire kernel: [ 55.262250] pci 0000:01:00.0: restoring config space at offset 0x7 (was 0xc, writing 0xb000000c)
Jul 15 16:49:19 Empire kernel: [ 55.262254] pci 0000:01:00.0: restoring config space at offset 0x5 (was 0xc, writing 0xa000000c)
Jul 15 16:49:19 Empire kernel: [ 55.262259] pci 0000:01:00.0: restoring config space at offset 0x4 (was 0x0, writing 0xd2000000)
Jul 15 16:49:19 Empire kernel: [ 55.262263] pci 0000:01:00.0: restoring config space at offset 0x3 (was 0x0, writing 0x10)
Jul 15 16:49:19 Empire kernel: [ 55.262267] pci 0000:01:00.0: restoring config space at offset 0x1 (was 0x100000, writing 0x100006)
Jul 15 16:49:19 Empire kernel: [ 55.262281] pci 0000:01:00.0: power state changed by ACPI to D0
Jul 15 16:49:19 Empire kernel: [ 55.262284] pci 0000:01:00.0: power state changed by ACPI to D0
Jul 15 16:49:19 Empire kernel: [ 55.262288] pci 0000:01:00.0: enabling device (0006 -> 0007)
Jul 15 16:49:19 Empire kernel: [ 55.262294] pci 0000:01:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Jul 15 16:49:19 Empire kernel: [ 55.262299] pci 0000:01:00.0: setting latency timer to 64
Jul 15 16:49:19 Empire kernel: [ 55.515153] nvidia: module license 'NVIDIA' taints kernel.
Jul 15 16:49:19 Empire kernel: [ 55.515155] Disabling lock debugging due to kernel taint
Jul 15 16:49:19 Empire kernel: [ 55.551904] nvidia 0000:01:00.0: power state changed by ACPI to D0
Jul 15 16:49:19 Empire kernel: [ 55.551908] nvidia 0000:01:00.0: power state changed by ACPI to D0
Jul 15 16:49:19 Empire kernel: [ 55.551912] nvidia 0000:01:00.0: enabling device (0006 -> 0007)
Jul 15 16:49:19 Empire kernel: [ 55.551918] nvidia 0000:01:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Jul 15 16:49:19 Empire kernel: [ 55.551924] nvidia 0000:01:00.0: setting latency timer to 64
Jul 15 16:49:19 Empire kernel: [ 55.551929] vgaarb: device changed decodes: PCI:0000:01:00.0,olddecodes=io+mem,decodes=none:owns=none
Jul 15 16:49:19 Empire kernel: [ 55.551998] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 295.40 Thu Apr 5 21:37:00 PDT 2012

@ArchangeGabriel
Bumblebee-Project member

This error is a kernel one according to nVidia documentation, at ACPI level, and the only thing to do is to wait for it to be fixed upstream in Linux Kernel.

If it's not working with 304.22, "UseDisplayDevice" "none" instead of "ConnectedMonitor" "DFP" (or "CRT") and PM set to none, then you're in that case.

@joey55

I tried again with kernel 3.5 and 304.22 driver (from xorg-edgers ppa).
I used "UseDisplayDevice" "none" instead of "ConnectedMonitor" "DFP", as suggested above, but the result remains the same :-{.


699.237 NVIDIA(0): Option "UseDisplayDevice" set to "none"; enabling NoScanout
699.237 NVIDIA(0): mode
704.082 NVIDIA(0): Failed to initialize the NVIDIA GPU at PCI:1:0:0. Please
704.082 NVIDIA(0): check your system's kernel log for additional error
704.082 NVIDIA(0): messages and refer to Chapter 8: Common Problems in the
704.082 NVIDIA(0): README for additional information.
704.082 NVIDIA(0): Failed to initialize the NVIDIA graphics device!
704.082 NVIDIA(0): Failing initialization of X screen 0
704.082 UnloadModule: "nvidia"


It looks like I am out of options here....

The nouveau driver works, but it is way slower than the IGP on glxgears. And it crashes on glxspheres:

X Error of failed request: BadMatch (invalid parameter attributes)
Major opcode of failed request: 72 (X_PutImage)


@Jeeppler

Hi, I have the same problem. A new i7 3rd generation processor and motherboard and a new GT 650M. I have Linux Mint 13 maya with the ppa from ubuntu. I try some solution ideas from the threat but it does not work.

kern.log

Jul 19 22:46:29 jogopc kernel: [ 83.484889] nvidia: module license 'NVIDIA' taints kernel.
Jul 19 22:46:29 jogopc kernel: [ 83.484892] Disabling lock debugging due to kernel taint
Jul 19 22:46:29 jogopc kernel: [ 83.529253] nvidia 0000:01:00.0: power state changed by ACPI to D0
Jul 19 22:46:29 jogopc kernel: [ 83.529257] nvidia 0000:01:00.0: power state changed by ACPI to D0
Jul 19 22:46:29 jogopc kernel: [ 83.529262] nvidia 0000:01:00.0: enabling device (0006 -> 0007)
Jul 19 22:46:29 jogopc kernel: [ 83.529267] nvidia 0000:01:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
Jul 19 22:46:29 jogopc kernel: [ 83.529274] nvidia 0000:01:00.0: setting latency timer to 64
Jul 19 22:46:29 jogopc kernel: [ 83.529278] vgaarb: device changed decodes: PCI:0000:01:00.0,olddecodes=io+mem,decodes=none:owns=none
Jul 19 22:46:29 jogopc kernel: [ 83.529351] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 295.59 Wed Jun 6 21:19:40 PDT 2012
Jul 19 22:46:39 jogopc kernel: [ 93.626197] NVRM: RmInitAdapter failed! (0x26:0xffffffff:1170)
Jul 19 22:46:39 jogopc kernel: [ 93.626205] NVRM: rm_init_adapter(0) failed
Jul 19 22:48:02 jogopc kernel: [ 176.516692] ehci_hcd 0000:00:1d.0: PCI INT A disabled
Jul 19 22:48:02 jogopc kernel: [ 176.516744] ehci_hcd 0000:00:1d.0: PME# enabled

dmesg

[ 83.484889] nvidia: module license 'NVIDIA' taints kernel.
[ 83.484892] Disabling lock debugging due to kernel taint
[ 83.529253] nvidia 0000:01:00.0: power state changed by ACPI to D0
[ 83.529257] nvidia 0000:01:00.0: power state changed by ACPI to D0
[ 83.529262] nvidia 0000:01:00.0: enabling device (0006 -> 0007)
[ 83.529267] nvidia 0000:01:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
[ 83.529274] nvidia 0000:01:00.0: setting latency timer to 64
[ 83.529278] vgaarb: device changed decodes: PCI:0000:01:00.0,olddecodes=io+mem,decodes=none:owns=none
[ 83.529351] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 295.59 Wed Jun 6 21:19:40 PDT 2012
[ 93.626197] NVRM: RmInitAdapter failed! (0x26:0xffffffff:1170)
[ 93.626205] NVRM: rm_init_adapter(0) failed
[ 176.516692] ehci_hcd 0000:00:1d.0: PCI INT A disabled
[ 176.516744] ehci_hcd 0000:00:1d.0: PME# enabled

Xorg.8.log

[ 83.383] X Protocol Version 11, Revision 0
[ 83.383] Build Operating System: Linux 2.6.42-26-generic x86_64 Ubuntu
[ 83.383] Current Operating System: Linux jogopc 3.2.0-26-generic #41-Ubuntu SMP Thu Jun 14 17:49:24 UTC 2012 x86_64
[ 83.383] Kernel command line: BOOT_IMAGE=/boot/vmlinuz-3.2.0-26-generic root=UUID=87015997-e65b-4530-a633-412a6b06ba86 ro quiet splash vt.handoff=7
[ 83.383] Build Date: 16 July 2012 08:06:31PM
[ 83.383] xorg-server 2:1.11.4-0ubuntu10.6 (For technical support please see http://www.ubuntu.com/support)
[ 83.383] Current version of pixman: 0.24.4
[ 83.383] Before reporting problems, check http://wiki.x.org
to make sure that you have the latest version.
[ 83.383] Markers: (--) probed, (*) from config file, (==) default setting,
(++) from command line, (!!) notice, (II) informational,
(WW) warning, (EE) error, (NI) not implemented, (??) unknown.
83.383 Log file: "/var/log/Xorg.8.log", Time: Thu Jul 19 22:46:29 2012
83.383 Using config file: "/etc/bumblebee/xorg.conf.nvidia"
83.383 Using system config directory "/usr/share/X11/xorg.conf.d"
83.384 ServerLayout "Layout0"
83.384 No screen section available. Using defaults.
83.384 |-->Screen "Default Screen Section" (0)
83.384 | |-->Monitor ""
83.384 No device specified for screen "Default Screen Section".
Using the first device section listed.
83.384 | |-->Device "Device1"
83.384 No monitor specified for screen "Default Screen Section".
Using a default monitor configuration.
83.384 Option "AutoAddDevices" "false"
83.384 Not automatically adding devices
83.384 Automatically enabling devices
83.384 The directory "/usr/share/fonts/X11/cyrillic" does not exist.
[ 83.384] Entry deleted from font path.
83.384 The directory "/usr/share/fonts/X11/100dpi/" does not exist.
[ 83.384] Entry deleted from font path.
83.384 The directory "/usr/share/fonts/X11/75dpi/" does not exist.
[ 83.384] Entry deleted from font path.
83.384 The directory "/usr/share/fonts/X11/100dpi" does not exist.
[ 83.384] Entry deleted from font path.
83.384 The directory "/usr/share/fonts/X11/75dpi" does not exist.
[ 83.384] Entry deleted from font path.
83.384 The directory "/var/lib/defoma/x-ttcidfont-conf.d/dirs/TrueType" does not exist.
[ 83.384] Entry deleted from font path.
83.384 FontPath set to:
/usr/share/fonts/X11/misc,
/usr/share/fonts/X11/Type1,
built-ins
83.384 ModulePath set to "/usr/lib/nvidia-current/xorg,/usr/lib/xorg/modules"
83.384 |-->Input Device ""
83.384 |-->Input Device ""
83.384 The core pointer device wasn't specified explicitly in the layout.
Using the default mouse configuration.
83.384 The core keyboard device wasn't specified explicitly in the layout.
Using the default keyboard configuration.
83.384 Loader magic: 0x7f813ab57b00
83.384 Module ABI versions:
[ 83.384] X.Org ANSI C Emulation: 0.4
[ 83.384] X.Org Video Driver: 11.0
[ 83.384] X.Org XInput driver : 16.0
[ 83.384] X.Org Server Extension : 6.0
83.385 PCI:
(0:1:0:0) 10de:0fd1:1558:1550 rev 161, Mem @ 0xf6000000/16777216, 0xe0000000/268435456, 0xf0000000/33554432, I/O @ 0x0000e000/128, BIOS @ 0x????????/524288
83.385 Open ACPI successful (/var/run/acpid.socket)
83.385 LoadModule: "extmod"
83.385 Loading /usr/lib/xorg/modules/extensions/libextmod.so
83.386 Module extmod: vendor="X.Org Foundation"
[ 83.386] compiled for 1.11.3, module version = 1.0.0
[ 83.386] Module class: X.Org Server Extension
[ 83.386] ABI class: X.Org Server Extension, version 6.0
83.386 Loading extension MIT-SCREEN-SAVER
83.386 Loading extension XFree86-VidModeExtension
83.386 Loading extension XFree86-DGA
83.386 Loading extension DPMS
83.386 Loading extension XVideo
83.386 Loading extension XVideo-MotionCompensation
83.386 Loading extension X-Resource
83.386 LoadModule: "dbe"
83.386 Loading /usr/lib/xorg/modules/extensions/libdbe.so
83.386 Module dbe: vendor="X.Org Foundation"
[ 83.386] compiled for 1.11.3, module version = 1.0.0
[ 83.386] Module class: X.Org Server Extension
[ 83.386] ABI class: X.Org Server Extension, version 6.0
83.386 Loading extension DOUBLE-BUFFER
83.386 LoadModule: "glx"
83.386 Loading /usr/lib/nvidia-current/xorg/libglx.so
83.771 Module glx: vendor="NVIDIA Corporation"
[ 83.771] compiled for 4.0.2, module version = 1.0.0
[ 83.771] Module class: X.Org Server Extension
83.771 NVIDIA GLX Module 295.59 Wed Jun 6 21:40:44 PDT 2012
83.771 Loading extension GLX
83.771 LoadModule: "record"
83.771 Loading /usr/lib/xorg/modules/extensions/librecord.so
83.771 Module record: vendor="X.Org Foundation"
[ 83.771] compiled for 1.11.3, module version = 1.13.0
[ 83.771] Module class: X.Org Server Extension
[ 83.771] ABI class: X.Org Server Extension, version 6.0
83.771 Loading extension RECORD
83.771 LoadModule: "dri"
83.772 Loading /usr/lib/xorg/modules/extensions/libdri.so
83.772 Module dri: vendor="X.Org Foundation"
[ 83.772] compiled for 1.11.3, module version = 1.0.0
[ 83.772] ABI class: X.Org Server Extension, version 6.0
83.772 Loading extension XFree86-DRI
83.772 LoadModule: "dri2"
83.772 Loading /usr/lib/xorg/modules/extensions/libdri2.so
83.772 Module dri2: vendor="X.Org Foundation"
[ 83.772] compiled for 1.11.3, module version = 1.2.0
[ 83.772] ABI class: X.Org Server Extension, version 6.0
83.772 Loading extension DRI2
83.772 LoadModule: "nvidia"
83.772 Loading /usr/lib/nvidia-current/xorg/nvidia_drv.so
83.823 Module nvidia: vendor="NVIDIA Corporation"
[ 83.823] compiled for 4.0.2, module version = 1.0.0
[ 83.823] Module class: X.Org Video Driver
83.849 LoadModule: "mouse"
83.849 Loading /usr/lib/xorg/modules/input/mouse_drv.so
83.851 Module mouse: vendor="X.Org Foundation"
[ 83.851] compiled for 1.11.3, module version = 1.7.1
[ 83.851] Module class: X.Org XInput Driver
[ 83.851] ABI class: X.Org XInput driver, version 16.0
83.851 LoadModule: "kbd"
83.852 Warning, couldn't open module kbd
83.852 UnloadModule: "kbd"
83.852 Unloading kbd
83.852 Failed to load module "kbd" (module does not exist, 0)
83.852 NVIDIA dlloader X Driver 295.59 Wed Jun 6 21:21:24 PDT 2012
83.852 NVIDIA Unified Driver for all Supported NVIDIA GPUs
83.852 using VT number 7

83.853 Loading sub module "fb"
83.853 LoadModule: "fb"
83.853 Loading /usr/lib/xorg/modules/libfb.so
83.853 Module fb: vendor="X.Org Foundation"
[ 83.853] compiled for 1.11.3, module version = 1.0.0
[ 83.853] ABI class: X.Org ANSI C Emulation, version 0.4
83.853 Loading sub module "wfb"
83.853 LoadModule: "wfb"
83.853 Loading /usr/lib/xorg/modules/libwfb.so
83.858 Module wfb: vendor="X.Org Foundation"
[ 83.858] compiled for 1.11.3, module version = 1.0.0
[ 83.858] ABI class: X.Org ANSI C Emulation, version 0.4
83.858 Loading sub module "ramdac"
83.858 LoadModule: "ramdac"
83.858 Module "ramdac" already built-in
83.859 Loading /usr/lib/nvidia-current/xorg/nvidia_drv.so
83.859 Loading /usr/lib/xorg/modules/libwfb.so
83.859 Loading /usr/lib/xorg/modules/libfb.so
83.869 NVIDIA(0): Creating default Display subsection in Screen section
"Default Screen Section" for depth/fbbpp 24/32
83.869 NVIDIA(0): Depth 24, (==) framebuffer bpp 32
83.869 NVIDIA(0): RGB weight 888
83.869 NVIDIA(0): Default visual is TrueColor
83.869 NVIDIA(0): Using gamma correction (1.0, 1.0, 1.0)
83.869 NVIDIA(0): Option "NoLogo" "true"
83.869 NVIDIA(0): Option "UseEDID" "false"
83.869 NVIDIA(0): Option "ConnectedMonitor" "DFP"
83.869 NVIDIA(0): Enabling 2D acceleration
83.869 NVIDIA(0): ConnectedMonitor string: "DFP"
83.869 NVIDIA(0): Ignoring EDIDs
93.486 NVIDIA(0): Failed to initialize the NVIDIA GPU at PCI:1:0:0. Please
93.486 NVIDIA(0): check your system's kernel log for additional error
93.486 NVIDIA(0): messages and refer to Chapter 8: Common Problems in the
93.486 NVIDIA(0): README for additional information.
93.486 NVIDIA(0): Failed to initialize the NVIDIA graphics device!
93.486 UnloadModule: "nvidia"
93.486 Unloading nvidia
93.486 UnloadModule: "wfb"
93.486 Unloading wfb
93.486 UnloadModule: "fb"
93.486 Unloading fb
93.486 Screen(s) found, but none have a usable configuration.
[ 93.486]
Fatal server error:
[ 93.486] no screens found
[ 93.486]
Please consult the The X.Org Foundation support
at http://wiki.x.org
for help.
[ 93.486] Please also check the log file at "/var/log/Xorg.8.log" for additional information.
[ 93.486]
[ 93.486] ddxSigGiveUp: Closing log
[ 93.486] Server terminated with error (1). Closing log file.

I have installed the bumblebee 3.0.2, Xorg 11.30, kernel 3.2.0-26-generic, nvidia-current 295.59, linux mint 13 maya.
The noveau driver does not work with this configuration, because nouveau does not know the gpu.

If I type 'optirun' in terminal it needs 3 seconds or more, but then it report the errors.

@ArchangeGabriel
Bumblebee-Project member

As I said above,
[ 93.626197] NVRM: RmInitAdapter failed! (0x26:0xffffffff:1170)
[ 93.626205] NVRM: rm_init_adapter(0) failed

is a kernel error according to nVidia, and the only thing to do is to wait for ACPI to be fixed in kernel, which could take a long time. Unless @amonakov's issue #201 fix that too, as he wrote on #197.

@joey55

@ArchangeGabriel: I do not see the "NVRM: failed to copy vbios to system memory." message in syslog.
However, assuming this is a kernel issue: is there a bug reported in the kernel for this?
Is there anything more I can do to help solving this problem?

@ArchangeGabriel
Bumblebee-Project member

Yes but the "failed to copy vbios" message isn't shown anymore on new drivers, you can see that if you look at #204. However, it seems that #201 is going to solve that too.

@joey55

The workaround mentioned in #204 works for me. Great!
It can be used repeatedly. Does nvidia-xconfig do something that is missing in optirun?

$ sudo tee /proc/acpi/bbswitch <<<ON; sudo nvidia-xconfig -query-gpu-info; optirun glxspheres
@amonakov

No, it simply initializes the card. Otherwise, initialization is done in the X server forked from the bumblebee daemon, and that fails due to a race condition with polling a socket. Please note that a fix is available in hotfix-3.0.1 branch of this repo.

@joey55

Thanks. I will check out the fix.

@ArchangeGabriel
Bumblebee-Project member

Please not that this is a temporary fix, and not finished right now. Fixing this like that has some issues, and we have to make some more changes before a new version including this fix will be out.

@amonakov

Fixing this like that has some issues

Can you please explain what exactly the issues are?

we have to make some more changes before a new version including this fix will be out

What changes are needed?

@ArchangeGabriel
Bumblebee-Project member

AFAIK, with blocking sockets everywhere, you can't have more than one instance of optirun at once for example. This will be solved either by implementing socket polling or pthreads, and before I leave y-day, @Thulinma and @Samsagax were going on the second one.

@amonakov

No, what the hotfix-3.0.1 branch does is switching only optirun.c to a blocking socket, which fixes the issue at hand and does not restrict users to one optirun instance. It's ready to be used by all users.

Switching to blocking sockets everywhere can be done at any later point.

@ArchangeGabriel
Bumblebee-Project member

Should now be fixed in 3.0.1.

Arch package being currently built, Ubuntu ones will probably take a little more time.

@joey55

I reinstalled bumblebee from git, and confirm that the issue is fixed for me.
Great work!

@ArchangeGabriel
Bumblebee-Project member

NIce to hear, then just waiting for the other joey to confirm.

@yimm

This error (#172 (comment)) is fixed too.
Bumblebee 3.0.1 + nvidia 304.22 beta
Thanks to the team.

@throgh

Hello together!

I've just found this report, because the problem also occurs on my new laptop (using also the GeForce GT650M. Therefore a big "THANKS" for this information. But I've got also the question: How could I install bumblebee from GIT? Sorry for this dumb question from side, I'm new to this and trying to find my way into the depth of my Ubuntu Linux 12.04 x64.

And the last question: Is there also a stable package for installation? Because the last look took me the version number 3.0.1, but the error wasn't gone. Thanks!

@ArchangeGabriel
Bumblebee-Project member

3.0.1 and git are currently the same. Could you open a new issue and provide all usefull informations ?

@throgh

Okay, I'm going to post more information tomorrow / later the upcoming day in a new ticket. The basic error message is just as other members reported when trying to startup "optirun". So I've installed Ubuntu 12.04 again and give this another try.

Big thanks for this great toolset and to the team behind "bumblebee"!

@joeythesaint

Woo! Took a little bit of doing to unwind some of the changes I'd made to my config in the name of debugging, but I can confirm bumblebee is now working properly for me as well on my machine with 3.0.1 and the 304.22 NVidia drivers on my 3.5.0-6 kernel from the xorg-edgers PPA.

Life's good, thanks!

@studentz
@Witos

I've got gentoo-3.3.8, GT 650M, bumblebee 3.0.1 from portage, andOption "UseDisplayDevice" set. I still got above error. Any clues?

optirun -vv glxspheres
[ 553.841462] [DEBUG]Reading file: /etc/bumblebee/bumblebee.conf
[ 553.841637] [INFO]Configured driver: nvidia
[ 553.955200] [DEBUG]optirun version 3.0.1 starting...
[ 553.955213] [DEBUG]Active configuration:
[ 553.955229] [DEBUG] bumblebeed config file: /etc/bumblebee/bumblebee.conf
[ 553.955235] [DEBUG] X display: :8
[ 553.955246] [DEBUG] LD_LIBRARY_PATH: /usr/lib64/opengl/nvidia/lib:/usr/lib32/opengl/nvidia/lib:/usr/lib/opengl/nvidia/lib
[ 553.955253] [DEBUG] Socket path: /var/run/bumblebee.socket
[ 553.955259] [DEBUG] VGL Compression: proxy
[ 554.070160] [INFO]Response: No - error: Could not enable discrete graphics card

[ 554.070184] [ERROR]Cannot access secondary GPU - error: Could not enable discrete graphics card

[ 554.070197] [DEBUG]Socket closed.
[ 554.070211] [ERROR]Aborting because fallback start is disabled.
[ 554.070219] [DEBUG]Killing all remaining processes.

systemlog:

Sep 19 16:40:13 witos-linux kernel: bbswitch: enabling discrete graphics
Sep 19 16:40:13 witos-linux kernel: nvidia 0000:01:00.0: power state changed by ACPI to D0
Sep 19 16:40:13 witos-linux kernel: nvidia 0000:01:00.0: Refused to change power state, currently in D3
Sep 19 16:40:13 witos-linux kernel: nvidia 0000:01:00.0: power state changed by ACPI to D0
Sep 19 16:40:13 witos-linux bumblebeed[14981]: Could not enable discrete graphics card
Sep 19 16:41:31 witos-linux kernel: NVRM: RmInitAdapter failed! (0x23:0x2f:675)
Sep 19 16:41:31 witos-linux kernel: NVRM: rm_init_adapter(0) failed
Sep 19 16:44:54 witos-linux kernel: bbswitch: enabling discrete graphics
Sep 19 16:44:54 witos-linux kernel: nvidia 0000:01:00.0: power state changed by ACPI to D0
Sep 19 16:44:54 witos-linux kernel: nvidia 0000:01:00.0: Refused to change power state, currently in D3
Sep 19 16:44:54 witos-linux kernel: nvidia 0000:01:00.0: power state changed by ACPI to D0
Sep 19 16:44:54 witos-linux bumblebeed[14981]: Could not enable discrete graphics card

@amonakov

What if you use it like this:

sudo tee /proc/acpi/bbswitch <<<ON; sudo nvidia-xconfig -query-gpu-info; optirun glxspheres
@Witos

ON
NVIDIA: could not open the device file /dev/nvidia0 (Input/output error).

WARNING: Unable to use the nvidia-cfg library to query NVIDIA hardware.

ERROR: Unable to query GPU information

[ 1494.620166] [ERROR]Cannot access secondary GPU - error: Could not enable discrete graphics card

[ 1494.620201] [ERROR]Aborting because fallback start is disabled.

My nvidia-drivers - 304.43

@amonakov

Do you have CONFIG_NO_HZ and CONFIG_RCU_FAST_NO_HZ enabled in kernel config? See the latest post of this thread: http://www.nvnews.net/vbulletin/showthread.php?t=191780

@Witos

I didn't have second one, but after enabling, compiliing and installing kernel, reinstalling nvidia-drivers, bumblebee and bbswitch and rebooting - no progress :(

@godlike64

Witos, try to narrow down the problem by starting at the lowest level possible . Make sure neither bbswitch, bumblebee, nor nvidia module is loaded upon reboot (for nvidia, you might have to rm /lib/udev/nvidia-udev.sh or something like that, an ugly udev rule that might be doing more harm than good here). After a clean reboot, modprobe nvidia module and try to run one of the most basic programs which access the card (nvidia-xconfig, nvidia-smi, or one of the simple CUDA programs from the SDK. The first two come bundled with the drivers so might be your best chance). If, after running that, you get "GPU has fallen off the bus" then you're in for the same problem as me (or related).

I'm the one who came up with those two kernel options, after endless hours of debugging last night. What I ended up doing was, since I knew that on Ubuntu on my same laptop (Thinkpad W530 with a Quadro K1000M) it had worked under Ubuntu, I booted with a liveiso of 12.04 amd64 (to make sure it worked on 64 bits) and tested it. Of course it worked, so next step was to download ubuntu-sources on my Gentoo installation, use the config from Ubuntu's liveiso, compile it and test it (the ubuntu-sources because there may be some patches that are neither in gentoo-sources nor on vanilla-sources. The config because, if it wasn't the patches, it might be a .config option).

Booting from the Ubuntu kernel inside my Gentoo box worked, so by now I had made sure that neither the hard or the OS was the problem, but the kernel. What was left was diff the two .config files side by side and start testing. I got lucky since it worked by the time I got to the tickless part.

You might want to play around enabling more things under the RCU subsystem (I remember setting one of the values there from 32 to 64, can't recall what's it named but it's the only numerical value you can touch there), enabling the rest, and I THINK that I also enabled everything IOMMU related (the settings are spread out in two or three places).

Anyway, sorry for the long post. If you need further help, I'm at #bumblebee on Freenode. Good luck!

@godlike64

I just did some more testing, I can confirm that the previous two settings plus enabling CONFIG_CALGARY_IOMMU and CONFIG_CALGARY_IOMMU_ENABLED_BY_DEFAULT makes the card work. If IOMMU config options are missing, I don't get "has fallen off the bus" but nevertheless the card "breaks".

@Witos

Thank you guys, I'm really grateful for your help. Re-emerging of xorg-server did the trick. It works now, optirun start glxspheres with big boost. Dell 7720 17R, with GT650M on board is supported by bumblebee!

@Witos

Well, it doesn't work again after reboot, although I've change nothing. Strange... Before the successful run it booted windows, checked nvidia settings, and rebooted to linux, maybe it was the cause it run. @godlike64 could you please send me your .config so I can diff it with my own (snajper[at]o2.pl)?

@godlike64

Sure, here it is: http://bpaste.net/show/46582/

Note that yesterday after my first post we discovered with amonakov that IOMMU settings in the kernel were relevant too.

@Witos

Thanks. I had thos IOMMU already enabled. It's hard to compare the .config, since my kernel has been made by genkernel with some additions from gentoo-wiki. I should make a new kernel by hand and then check the differences, not much time for that unfortunately. Strange thing is I've re-emerged the xorg-server and bbswitch again and it worked again for a boot, then after reboot it stopped working and re-emerging didn't work again. Maybe there is a race condition during boot or maybe xorg-server files are written by some process... If I find out sth I let you know.

@godlike64

Witos, have you made sure that nvidia module does NOT get loaded upon reboot?

@Witos

I tried that, I removed it from /etc/conf.d/modules, removed bumblebee from rc-update, but it gets loaded anyway and I don't know how yet.

@Lekensteyn
Bumblebee-Project member

Pass the modprobe.blacklist=nvidia option or create an /etc/modprobe.d/(whatever).conf file containing blacklist nvidia.

@godlike64

Witos, on Gentoo an udev rule is installed in /lib/udev/nvidia-udev.sh (I can't recall the name right now, but it's definitely under /lib/udev). In my case that was messing things up. It is safe to remove it (or move it to another directory if you wish to keep it). The file is reinstalled on every emerge of nvidia-drivers. Try to remove that file and reboot.

@Witos

Hi again,
This is what I did - got brand new 3.4.9 gentoo-sources and made a new kernel with defconfig. I mimiced @godlike64 config in terms of Graphics Drivers (ie. I didn't have the VGA_SWITCHEROO) and turned on configs that @godlike64 suggested. I removed all x11 packages, bumblebee, and drivers and reinstalled again. After that optirun keep working every time after 3 reboots, thanks guys!

@godlike64

Glad to hear that! Could you upload your final kernel config?

@Witos

Sure: http://bpaste.net/show/46880/ , dell 17R 7720, bbswitch 0.4.2, nvidia-drivers 304.48, bumblebee 3.0.1

@babau

same issue here I'm on gentoo kernel 3.6.2 all the modules that have been suggested in this thread are enabled

nvidia drivers 304.51

but

optirun -vv glxsphere
[ 408.623778] [DEBUG]Reading file: /etc/bumblebee/bumblebee.conf
[ 408.624114] [INFO]Configured driver: nvidia
[ 408.804918] [DEBUG]optirun version 3.0.1 starting...
[ 408.804957] [DEBUG]Active configuration:
[ 408.804965] [DEBUG] bumblebeed config file: /etc/bumblebee/bumblebee.conf
[ 408.804971] [DEBUG] X display: :8
[ 408.804977] [DEBUG] LD_LIBRARY_PATH: /usr/lib64/opengl/nvidia/lib:/usr/lib32/opengl/nvidia/lib:/usr/lib/opengl/nvidia/lib
[ 408.804995] [DEBUG] Socket path: /var/run/bumblebee.socket
[ 408.805001] [DEBUG] VGL Compression: jpeg
[ 408.916935] [INFO]Response: No - error: Could not enable discrete graphics card

[ 408.916973] [ERROR]Cannot access secondary GPU - error: Could not enable discrete graphics card

[ 408.916986] [DEBUG]Socket closed.
[ 408.917014] [ERROR]Aborting because fallback start is disabled.
[ 408.917024] [DEBUG]Killing all remaining processes.

kern.log

[ 25.815658] bbswitch: enabling discrete graphics
[ 26.050878] pci 0000:01:00.0: power state changed by ACPI to D0
[ 26.050909] thinkpad_acpi: EC reports that Thermal Table has changed
[ 26.062638] pci 0000:01:00.0: Refused to change power state, currently in D3
[ 26.122819] pci 0000:01:00.0: power state changed by ACPI to D0
[ 26.136003] pci 0000:01:00.0: Refused to change power state, currently in D3
[ 28.238425] bbswitch: enabling discrete graphics
[ 28.238446] pci 0000:01:00.0: power state changed by ACPI to D0
[ 28.248664] pci 0000:01:00.0: Refused to change power state, currently in D3
[ 28.248687] pci 0000:01:00.0: power state changed by ACPI to D0
[ 28.261995] pci 0000:01:00.0: Refused to change power state, currently in D3
[ 408.899528] bbswitch: enabling discrete graphics
[ 408.899546] pci 0000:01:00.0: power state changed by ACPI to D0
[ 408.911132] pci 0000:01:00.0: Refused to change power state, currently in D3

@godlike64

Have you ensured that nvidia driver does not get loaded on startup (see my first post on this thread)? If you do, try rebooting cleanly (making sure NVIDIA card is off and does not get turn on by things like, the nvidia driver module autoloading), and immediately after reboot (if you can disable X that helps, as an additional measure to make sure nothing tries to access the card) run nvidia-xconfig -query-gpu-info. In my case, when it was failing, that command took around 3 seconds to run before exiting with error, and then if you immediately look on dmesg, you will see the true error. Those ACPI errors you posted tend to happen when something tries to access the card and, for example, it is turned off or something else corrupted its state.

If you still see "has fallen off the bus" in dmesg even though you have enabled the options I mentioned before, there coulld be some other option influencing this that I have not noticed before. I can give you the ubuntu kernel config I used, and installing ubuntu-sources with that config should at least get your card properly up and running.

@babau

I did black list the nvidia driver but with the same result

if this can help I have a Lenovo W530 with K2000M on board

@godlike64

Could you do a clean reboot, ensure nvidia driver was not loaded, and run nvidia-xconfig -query-gpu-info? If it fails, right after that check the last lines of dmesg and paste them here. I believe the problem lies with the very first access to the card.

@babau

nvidia module is not loaded

nvidia-xconfig -query-gpu-info
NVIDIA: could not open the device file /dev/nvidiactl (No such file or directory).

ERROR: Unable to query GPU information

but nothing show up un my dmesg

[ 10.791310] EXT4-fs (sda2): re-mounted. Opts: discard,commit=0
[ 10.987301] EXT4-fs (dm-2): re-mounted. Opts: commit=0
[ 10.988997] EXT4-fs (dm-0): re-mounted. Opts: commit=0
[ 10.990371] EXT4-fs (dm-1): re-mounted. Opts: commit=0
[ 11.209627] Bluetooth: HIDP (Human Interface Emulation) ver 1.2

@godlike64

That's weird... running that command should load the module for you. Are you sure you set video mode to Optimus in the BIOS?

@babau

bios is set to optimus

if this can help here is my kernel config

http://www.babau.me/config

@Lekensteyn
Bumblebee-Project member

@babau Can you try kernel 3.5? Someone reported issues with 3.6 in combination with kernel 3.6 Bumblebee-Project/bbswitch#35

@babau

sorry for late reply I did down grade the kernel to 3.5.7 and now all is working perfectly

thx for the support

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment