GUI won't start on Windows (unhandled exception in ggml_vk_available_devices) #1477

ADD-eNavarro · 2023-10-06T06:39:10Z

cebtenzzre · 2023-10-06T14:22:02Z

It would be really helpful if you could build GPT4All from source in Debug mode, and run it under either the Visual Studio debugger, or windbg, in order to get the call stack. Unfortunately, the binaries we publish are stripped Release builds with very little information to assist debugging.

ADD-eNavarro · 2023-10-10T13:13:33Z

That won't be easy. I'm not much of a developer, and cpp is not among the languages I know well. Also, I have security constraints, imposed by my enterprise, to install/run third party's code (I had to ask permission and wait for a week just to have the program installed). All in all, I don't see myself doing that.
Any volunteers?

cosmic-snow · 2023-10-10T13:51:17Z

You mean 2.4.19 not 2.4.9, right?

First of all, one thing you can try is rename your settings file, which is located at C:\Users\<name>\AppData\Roaming\nomic.ai\GPT4All.ini. Try giving it a different extension (so you have it backed up). A new one with default values will be created automatically the next time you start GPT4All.

If that doesn't help, you can also try adding a line device=CPU to the General section, or change the line if device= already exists there, e.g.:

[General]
device=CPU
...

Close the program before you do that and restart it afterwards.

ADD-eNavarro · 2023-10-10T14:18:02Z

You mean 2.4.19 not 2.4.9, right?

Yes, sorry, already updated the issue title.

First of all, one thing you can try is rename your settings file, which is located at C:\Users\<name>\AppData\Roaming\nomic.ai\GPT4All.ini. Try giving it a different extension (so you have it backed up). A new one with default values will be created automatically the next time you start GPT4All.

Changed the extension, no success: GPT4All still won't start.

If that doesn't help, you can also try adding a line device=CPU to the General section, or change the line if device= already exists there, e.g.:
[General]
device=CPU
...
Close the program before you do that and restart it afterwards.
I didn't need to close the program for obvious reasons. Added the device configuration, still won't start :(

cebtenzzre · 2023-10-10T14:35:21Z

I uploaded a debug build of the installer to the releases page, it's called gpt4all-installer-win64-v2.5.0-pre1-debug.exe. If you install that, the output of Event Viewer will at least have some meaning to us. windbg would be even better:

Download the Windows SDK
Install it, clearing all checkboxes except for "Debugging Tools for Windows", which is the only one you would need
Start WinDbg (X64)
File > Open Executable, navigate to C:\Program Files\gpt4all\bin\chat.exe
If it stops at ntdll!LdrpDoDebuggerBreak, press the F5 key to continue
If it stops again, go to View > Call Stack, which will hopefully have useful information about the crash

ADD-eNavarro · 2023-10-18T10:26:17Z

Here's the result of following your instructions with Windbg:

cebtenzzre · 2023-10-18T14:10:39Z

Here's the result of following your instructions with Windbg:

Can you continue past that with F5? I think that's just another bug in Windows breakpoint handling, not an actual issue with the code. You should be able to continue until you get a call stack with lines other than ntdll!... in it.

ADD-eNavarro · 2023-10-18T15:22:05Z

Hope this is what you need:

cebtenzzre · 2023-10-18T15:43:48Z

Hope this is what you need:

Yes, that is very helpful, thanks.

edit: Could you please try to get info for the exception by running the .exr -1 command after windbg stops at that point?

ADD-eNavarro · 2023-10-19T06:17:35Z

Sure thing, here it goes:

cebtenzzre · 2023-10-19T14:39:25Z

Unfortunately, I'm not sure how to get the exception message with WinDbg. Here's another option:

I uploaded a console-enabled build (gpt4all-installer-win64-v2.5.0-pre2-debug-console.exe ) to the pre-release.

It would be helpful if you could start chat.exe via the command line - install that version, use "Open File Location" on the shortcut to find chat.exe, shift-right-click in the folder and open a powershell or command prompt there, and run .\chat (powershell) or chat (command prompt).

If there is any console output, please post it here.

ADD-eNavarro · 2023-10-20T05:56:47Z

Morning!

Got this:

I?m afraid all three options result in the process stopping without further message:

ADD-eNavarro · 2023-11-07T07:01:41Z

So, are we out of luck, @cebtenzzre ?

cebtenzzre · 2023-11-07T17:50:36Z

Unless you can debug it with Visual Studio (which I know will provide the exception information), I'm not sure what else to do.

H4CKS4F3 · 2023-11-07T19:53:20Z

Just a suggestion for debugging this. What about using procdump (from Microsoft) to help capture the stack trace. Something like: procdump -mm -x . chat.exe (assuming procdump v11 and that it's in the current path). The -mm switch is the minidump format, captures the basic process details. You can use something like WinDbg (and other tools) to debug it. Again, just a thought to help capture the instant it crashes.

ADD-eNavarro · 2023-11-08T11:39:52Z

@H4CKS4F3 , WinDbg was already used, if you read back a little.
I gave a try to procdump, here are the two files, first one with -mm and, since I couldn't see a thing in there, the second one without the minidump parameter.
dump.dmp
dump2.dmp

H4CKS4F3 · 2023-11-08T23:21:48Z

@ADD-eNavarro run the following and attach the dump. Since procdump defaults to not dump on unhandled exceptions, it lost the actual exception in the minidump. procdump -mm -e -x . chat.exe

ADD-eNavarro · 2023-11-09T06:43:14Z

Here's the result of that last procdump run:
dump3_231109_073726.dmp

cebtenzzre · 2023-11-09T18:33:27Z

Now we're getting somewhere:

KERNELBASE!RaiseException+6c    
VCRUNTIME140!_CxxThrowException+90 [D:\a\_work\1\s\src\vctools\crt\vcruntime\src\eh\throw.cpp @ 75]   D:\a\_work\1\s\src\vctools\crt\vcruntime\src\eh\throw.cpp @ 75 
llmodel+ba4dc    
0x0000002f`b14fd2b8

Unfortunately, I no longer have a copy of the debug info for that build of GPT4All, so I can't resolve llmodel+ba4dc to anything specific.

Here is a newer build that you can install and run the same procdump command on: gpt4all-installer-win64-v2.5.2.r8.gd4ce9f4-debug-console.exe

I'll keep that build tree in a separate folder so I'll be able to debug it when you reply.

ADD-eNavarro · 2023-11-10T08:53:16Z

New dump:
dump4_231110_094643.dmp

cebtenzzre · 2023-11-10T21:51:29Z

Here is the call stack when the exception is thrown:

KERNELBASE!RaiseException+0x6c
VCRUNTIME140D!_CxxThrowException+0x120
llmodel!vk::detail::throwResultException+0x29c
llmodel!vk::resultCheck+0x23
llmodel!vk::Instance::enumeratePhysicalDevices<std::allocator<vk::PhysicalDevice>,vk::DispatchLoaderDynamic>+0xf7
llmodel!kp::Manager::listDevices+0x38
llmodel!ggml_vk_available_devices+0xf6
llmodel!LLModel::availableGPUDevices+0x4f
chat!MySettings::MySettings+0x74
chat!MyPrivateSettings::MyPrivateSettings+0x14
chat!`anonymous namespace'::Q_QGS_settingsInstance::innerFunction+0x36
chat!QtGlobalStatic::Holder<`anonymous namespace'::Q_QGS_settingsInstance>::Holder<`anonymous namespace'::Q_QGS_settingsInstance>+0x1c
chat!QGlobalStatic<QtGlobalStatic::Holder<`anonymous namespace'::Q_QGS_settingsInstance> >::instance+0x4c
chat!QGlobalStatic<QtGlobalStatic::Holder<`anonymous namespace'::Q_QGS_settingsInstance> >::operator()+0x24
chat!MySettings::globalInstance+0x12
chat!main+0x12f
chat!invoke_main+0x39
chat!__scrt_common_main_seh+0x12e
chat!__scrt_common_main+0xe
chat!mainCRTStartup+0xe
kernel32!BaseThreadInitThunk+0x10
ntdll!RtlUserThreadStart+0x2b

It's caused by VK_ERROR_DEVICE_LOST:

So it looks like we need to catch Vulkan exceptions from komputeManager()->listDevices() and ignore them. It seems like there is some issue with your GPU driver that prevents Vulkan from being used.

ADD-eNavarro · 2023-11-13T07:29:54Z

Anything I can do then?

H4CKS4F3 · 2023-11-16T00:57:37Z

From my perspective, unless you can suggest a patch, looks like you'll need to wait for the developers to do something. One thing I'd suggest is updating drivers, since this seems to be a driver issue. I actually was suffering from this issue too, but "something changed" and it started working again. Maybe I updated drivers, but I can't be certain. I have NVIDIA card, so I may have updated the driver + CUDA.

Sometimes Vulkan is not available due to VK_ERROR_INITIALIZATION_FAILED or VK_ERROR_DEVICE_LOST. Ingore the exception instead of crashing. Fixes nomic-ai/gpt4all#1477

Sometimes Vulkan is not available due to VK_ERROR_INITIALIZATION_FAILED or VK_ERROR_DEVICE_LOST. Ignore the exception instead of crashing. Fixes nomic-ai/gpt4all#1477

ADD-eNavarro · 2023-12-11T12:04:05Z

Following @H4CKS4F3 advice, we've updated the CUDA to version 12.3.1, which updated NVidia drivers from 545.84 to 546.12.
Other changes that came along were:
Nsight Compute, 2023.3.1 -> 2023.3.1
Nsight Visual Studio Edition, 2023.3.0.23xxx -> 2023.3.1.23311

But GPT4All still doesn't start. So maybe it's not the drivers.

Fixes nomic-ai#1477 Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre · 2024-05-21T20:41:59Z

i had exactly this problem

Different issue. OP experienced a crash caused by a bad interaction with a non-functional Vulkan driver.

cebtenzzre added bug Something isn't working chat gpt4all-chat issues labels Oct 10, 2023

ADD-eNavarro changed the title ~~GPT4All not starting after update to version 2.4.9~~ GPT4All not starting after update to version 2.4.19 Oct 10, 2023

cebtenzzre changed the title ~~GPT4All not starting after update to version 2.4.19~~ GUI won't start on Windows (unhandled exception in llmodel_threadCount) Oct 18, 2023

cebtenzzre changed the title ~~GUI won't start on Windows (unhandled exception in llmodel_threadCount)~~ GUI won't start on Windows (unhandled exception in ggml_vk_available_devices) Oct 18, 2023

cebtenzzre mentioned this issue Dec 1, 2023

kompute : ignore exceptions in ggml_vk_available_devices nomic-ai/llama.cpp#12

Merged

This was referenced Jan 8, 2024

GPT4all doesnt start on Windows 10 #1600

Closed

chat.exe not launching on windows 11 #1656

Closed

Application does not open on Windows 10 #1699

Closed

cebtenzzre linked a pull request Jan 17, 2024 that will close this issue

kompute : ignore exceptions in ggml_vk_available_devices nomic-ai/llama.cpp#12

Merged

cebtenzzre closed this as completed in a9c5f53 Jan 17, 2024

gruzefix mentioned this issue Jan 27, 2024

Crash on loading certain models, ntdll.dll apparently at fault #1878

Closed

2 tasks

dpsalvatierra pushed a commit to dpsalvatierra/gpt4all that referenced this issue Feb 16, 2024

update llama.cpp for nomic-ai/llama.cpp#12

843ab19

Fixes nomic-ai#1477 Signed-off-by: Jared Van Bortel <jared@nomic.ai>

This comment was marked as off-topic.

Sign in to view

nomic-ai locked as resolved and limited conversation to collaborators May 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GUI won't start on Windows (unhandled exception in ggml_vk_available_devices) #1477

GUI won't start on Windows (unhandled exception in ggml_vk_available_devices) #1477

ADD-eNavarro commented Oct 6, 2023

cebtenzzre commented Oct 6, 2023

ADD-eNavarro commented Oct 10, 2023

cosmic-snow commented Oct 10, 2023

ADD-eNavarro commented Oct 10, 2023

cebtenzzre commented Oct 10, 2023 •

edited

Loading

ADD-eNavarro commented Oct 18, 2023

cebtenzzre commented Oct 18, 2023 •

edited

Loading

ADD-eNavarro commented Oct 18, 2023

cebtenzzre commented Oct 18, 2023 •

edited

Loading

ADD-eNavarro commented Oct 19, 2023

cebtenzzre commented Oct 19, 2023

ADD-eNavarro commented Oct 20, 2023

ADD-eNavarro commented Nov 7, 2023

cebtenzzre commented Nov 7, 2023

H4CKS4F3 commented Nov 7, 2023

ADD-eNavarro commented Nov 8, 2023

H4CKS4F3 commented Nov 8, 2023 •

edited

Loading

ADD-eNavarro commented Nov 9, 2023

cebtenzzre commented Nov 9, 2023

ADD-eNavarro commented Nov 10, 2023

cebtenzzre commented Nov 10, 2023

ADD-eNavarro commented Nov 13, 2023

H4CKS4F3 commented Nov 16, 2023

ADD-eNavarro commented Dec 11, 2023 •

edited

Loading

This comment was marked as off-topic.

cebtenzzre commented May 21, 2024

GUI won't start on Windows (unhandled exception in ggml_vk_available_devices) #1477

GUI won't start on Windows (unhandled exception in ggml_vk_available_devices) #1477

Comments

ADD-eNavarro commented Oct 6, 2023

System Info

Information

Related Components

Reproduction

Expected behavior

cebtenzzre commented Oct 6, 2023

ADD-eNavarro commented Oct 10, 2023

cosmic-snow commented Oct 10, 2023

ADD-eNavarro commented Oct 10, 2023

cebtenzzre commented Oct 10, 2023 • edited Loading

ADD-eNavarro commented Oct 18, 2023

cebtenzzre commented Oct 18, 2023 • edited Loading

ADD-eNavarro commented Oct 18, 2023

cebtenzzre commented Oct 18, 2023 • edited Loading

ADD-eNavarro commented Oct 19, 2023

cebtenzzre commented Oct 19, 2023

ADD-eNavarro commented Oct 20, 2023

ADD-eNavarro commented Nov 7, 2023

cebtenzzre commented Nov 7, 2023

H4CKS4F3 commented Nov 7, 2023

ADD-eNavarro commented Nov 8, 2023

H4CKS4F3 commented Nov 8, 2023 • edited Loading

ADD-eNavarro commented Nov 9, 2023

cebtenzzre commented Nov 9, 2023

ADD-eNavarro commented Nov 10, 2023

cebtenzzre commented Nov 10, 2023

ADD-eNavarro commented Nov 13, 2023

H4CKS4F3 commented Nov 16, 2023

ADD-eNavarro commented Dec 11, 2023 • edited Loading

This comment was marked as off-topic.

cebtenzzre commented May 21, 2024

cebtenzzre commented Oct 10, 2023 •

edited

Loading

cebtenzzre commented Oct 18, 2023 •

edited

Loading

cebtenzzre commented Oct 18, 2023 •

edited

Loading

H4CKS4F3 commented Nov 8, 2023 •

edited

Loading

ADD-eNavarro commented Dec 11, 2023 •

edited

Loading