Description
if I run foundry local in Qualcomm using Phi-4-mini-reasoning , it will crash.
My vm has 64GB memory(https://qdc.qualcomm.com/) , but Deepseek all models are okay .
My devices

My NPU

The log is
2025-06-03 01:22:59.509 -07:00 [INF] Starting Foundry Local CLI with 'model run Phi-4-mini-reasoning-qnn-npu'
2025-06-03 01:22:59.516 -07:00 [INF] Starting service <C:\Program Files\WindowsApps\Microsoft.FoundryLocal_0.3.9267.43123_arm64__8wekyb3d8bbwe\Inference.Service.Agent.exe --urls=http://localhost:5273 --OpenAIServiceSettings:ModelDirPath="C:\Users\HCKTest.foundry\cache\models" --JsonRpcServer:Run=true --JsonRpcServer:PipeName="inference_agent">
2025-06-03 01:22:59.625 -07:00 [INF] Command:ServiceStart Status:Success Direct:False Time:111ms
2025-06-03 01:23:00.366 -07:00 [INF] Downloading Phi-4-mini-reasoning-qnn-npu url:http://localhost:5273/openai/download
2025-06-03 01:24:04.359 -07:00 [INF] Command:ModelDownload Status:Success Direct:False Time:64733ms
2025-06-03 01:24:04.359 -07:00 [INF] Loading model: http://localhost:5273/openai/load/Phi-4-mini-reasoning-qnn-npu?ttl=600
2025-06-03 01:24:09.849 -07:00 [INF] 🟢 Model Phi-4-mini-reasoning-qnn-npu loaded successfully
2025-06-03 01:24:09.849 -07:00 [INF] Command:ModelLoad Status:Success Direct:False Time:5489ms
2025-06-03 01:24:47.343 -07:00 [INF] Command:SessionInteractive Status:Failure Direct:False Time:37492ms
2025-06-03 01:24:47.343 -07:00 [INF] Command:ModelRun Status:Failure Direct:True Time:107827ms
2025-06-03 01:24:47.346 -07:00 [INF] LogException
Microsoft.AI.Foundry.Local.Common.FLException: Error during chat
---> System.Net.Http.HttpIOException: The response ended prematurely. (ResponseEnded)
at System.Net.Http.HttpConnection.d__82.MoveNext() + 0x288
--- End of stack trace from previous location ---
at System.Net.Http.HttpConnection.ChunkedEncodingReadStream.d__6.MoveNext() + 0x288
--- End of stack trace from previous location ---
at System.ClientModel.Internal.ReadTimeoutStream.d__21.MoveNext() + 0x298
--- End of stack trace from previous location ---
at System.Net.ServerSentEvents.SseParser1.<FillLineBufferAsync>d__35.MoveNext() + 0x1d8 --- End of stack trace from previous location --- at System.Net.ServerSentEvents.SseParser
1.d__20.MoveNext() + 0x4fc
--- End of stack trace from previous location ---
at OpenAI.AsyncSseUpdateCollection1.AsyncSseUpdateEnumerator
1.<System-Collections-Generic-IAsyncEnumerator-MoveNextAsync>d__12.MoveNext() + 0x1f4
--- End of stack trace from previous location ---
at OpenAI.AsyncSseUpdateCollection1.<GetValuesFromPageAsync>d__8.MoveNext() + 0x270 --- End of stack trace from previous location --- at OpenAI.AsyncSseUpdateCollection
1.d__8.MoveNext() + 0x3e8
--- End of stack trace from previous location ---
at System.ClientModel.AsyncCollectionResult1.<GetAsyncEnumerator>d__1.MoveNext() + 0x300 --- End of stack trace from previous location --- at System.ClientModel.AsyncCollectionResult
1.d__1.MoveNext() + 0x6bc
--- End of stack trace from previous location ---
at System.ClientModel.AsyncCollectionResult1.<GetAsyncEnumerator>d__1.MoveNext() + 0x800 --- End of stack trace from previous location --- at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.<InteractiveNewRoundAsync>d__4.MoveNext() + 0x1e4 --- End of stack trace from previous location --- at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.<InteractiveNewRoundAsync>d__4.MoveNext() + 0x4bc --- End of inner exception stack trace --- at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.HandleExceptionDuringChat(Exception) + 0x108 at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.<InteractiveNewRoundAsync>d__4.MoveNext() + 0x65c --- End of stack trace from previous location --- at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.<RunInteractiveNewRoundAsync>d__2.MoveNext() + 0x64 --- End of stack trace from previous location --- at Microsoft.AI.Foundry.Local.Commands.ModelRunCommand.<<Create>b__1_0>d.MoveNext() + 0x1694 --- End of stack trace from previous location --- at Microsoft.AI.Foundry.Local.Common.CommandActionFactory.<>c__DisplayClass0_0
1.<b__0>d.MoveNext() + 0x238
--- End of stack trace from previous location ---
at System.CommandLine.NamingConventionBinder.CommandHandler.d__66.MoveNext() + 0x5c
--- End of stack trace from previous location ---
at System.CommandLine.NamingConventionBinder.ModelBindingCommandHandler.d__11.MoveNext() + 0x6c
--- End of stack trace from previous location ---
at System.CommandLine.Invocation.InvocationPipeline.d__0.MoveNext() + 0x1f4
--- End of stack trace from previous location ---
at Microsoft.AI.Foundry.Local.Program.