Skip to content

Phi-4-mini-reasoning crash on Qualcomm device #136

Open
@kinfey

Description

@kinfey

if I run foundry local in Qualcomm using Phi-4-mini-reasoning , it will crash.

My vm has 64GB memory(https://qdc.qualcomm.com/) , but Deepseek all models are okay .

My devices

Image

My NPU

Image

The log is

2025-06-03 01:22:59.509 -07:00 [INF] Starting Foundry Local CLI with 'model run Phi-4-mini-reasoning-qnn-npu'
2025-06-03 01:22:59.516 -07:00 [INF] Starting service <C:\Program Files\WindowsApps\Microsoft.FoundryLocal_0.3.9267.43123_arm64__8wekyb3d8bbwe\Inference.Service.Agent.exe --urls=http://localhost:5273 --OpenAIServiceSettings:ModelDirPath="C:\Users\HCKTest.foundry\cache\models" --JsonRpcServer:Run=true --JsonRpcServer:PipeName="inference_agent">
2025-06-03 01:22:59.625 -07:00 [INF] Command:ServiceStart Status:Success Direct:False Time:111ms
2025-06-03 01:23:00.366 -07:00 [INF] Downloading Phi-4-mini-reasoning-qnn-npu url:http://localhost:5273/openai/download
2025-06-03 01:24:04.359 -07:00 [INF] Command:ModelDownload Status:Success Direct:False Time:64733ms
2025-06-03 01:24:04.359 -07:00 [INF] Loading model: http://localhost:5273/openai/load/Phi-4-mini-reasoning-qnn-npu?ttl=600
2025-06-03 01:24:09.849 -07:00 [INF] 🟢 Model Phi-4-mini-reasoning-qnn-npu loaded successfully
2025-06-03 01:24:09.849 -07:00 [INF] Command:ModelLoad Status:Success Direct:False Time:5489ms
2025-06-03 01:24:47.343 -07:00 [INF] Command:SessionInteractive Status:Failure Direct:False Time:37492ms
2025-06-03 01:24:47.343 -07:00 [INF] Command:ModelRun Status:Failure Direct:True Time:107827ms
2025-06-03 01:24:47.346 -07:00 [INF] LogException
Microsoft.AI.Foundry.Local.Common.FLException: Error during chat
---> System.Net.Http.HttpIOException: The response ended prematurely. (ResponseEnded)
at System.Net.Http.HttpConnection.d__82.MoveNext() + 0x288
--- End of stack trace from previous location ---
at System.Net.Http.HttpConnection.ChunkedEncodingReadStream.d__6.MoveNext() + 0x288
--- End of stack trace from previous location ---
at System.ClientModel.Internal.ReadTimeoutStream.d__21.MoveNext() + 0x298
--- End of stack trace from previous location ---
at System.Net.ServerSentEvents.SseParser1.<FillLineBufferAsync>d__35.MoveNext() + 0x1d8 --- End of stack trace from previous location --- at System.Net.ServerSentEvents.SseParser1.d__20.MoveNext() + 0x4fc
--- End of stack trace from previous location ---
at OpenAI.AsyncSseUpdateCollection1.AsyncSseUpdateEnumerator1.<System-Collections-Generic-IAsyncEnumerator-MoveNextAsync>d__12.MoveNext() + 0x1f4
--- End of stack trace from previous location ---
at OpenAI.AsyncSseUpdateCollection1.<GetValuesFromPageAsync>d__8.MoveNext() + 0x270 --- End of stack trace from previous location --- at OpenAI.AsyncSseUpdateCollection1.d__8.MoveNext() + 0x3e8
--- End of stack trace from previous location ---
at System.ClientModel.AsyncCollectionResult1.<GetAsyncEnumerator>d__1.MoveNext() + 0x300 --- End of stack trace from previous location --- at System.ClientModel.AsyncCollectionResult1.d__1.MoveNext() + 0x6bc
--- End of stack trace from previous location ---
at System.ClientModel.AsyncCollectionResult1.<GetAsyncEnumerator>d__1.MoveNext() + 0x800 --- End of stack trace from previous location --- at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.<InteractiveNewRoundAsync>d__4.MoveNext() + 0x1e4 --- End of stack trace from previous location --- at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.<InteractiveNewRoundAsync>d__4.MoveNext() + 0x4bc --- End of inner exception stack trace --- at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.HandleExceptionDuringChat(Exception) + 0x108 at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.<InteractiveNewRoundAsync>d__4.MoveNext() + 0x65c --- End of stack trace from previous location --- at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.<RunInteractiveNewRoundAsync>d__2.MoveNext() + 0x64 --- End of stack trace from previous location --- at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.<<Create>b__1_0>d.MoveNext() + 0x1694 --- End of stack trace from previous location --- at Microsoft.AI.Foundry.Local.Common.CommandActionFactory.<>c__DisplayClass0_01.<b__0>d.MoveNext() + 0x238
--- End of stack trace from previous location ---
at System.CommandLine.NamingConventionBinder.CommandHandler.d__66.MoveNext() + 0x5c
--- End of stack trace from previous location ---
at System.CommandLine.NamingConventionBinder.ModelBindingCommandHandler.d__11.MoveNext() + 0x6c
--- End of stack trace from previous location ---
at System.CommandLine.Invocation.InvocationPipeline.d__0.MoveNext() + 0x1f4
--- End of stack trace from previous location ---
at Microsoft.AI.Foundry.Local.Program.

d__1.MoveNext() + 0x52c

Metadata

Metadata

Assignees

No one assigned

    Labels

    ADObugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions