Issue search results

Filter by

412 results

(110 ms)inmicrosoft/onnxruntime-genai (press backspace or delete to remove)

microsoft/onnxruntime-genai
How to inference LLM models with OpenVino EP

I have found that openvino relate code has been merged into main branch in latest commit history. I wonder how can I infer with openvino EP. If you can provide the openvino detail self build and infer ...

ep:DML

ZhangWei125521

Opened
2 days ago

#1501

microsoft/onnxruntime-genai
How to build for iOS

Hello, I am trying to build onnxruntime-genai for iOS using this tutorial https://github.com/Azure-Samples/Phi-3MiniSamples/blob/main/ios/README.md. But i think it is now out of date as the is no branch ...

platform:mobile

sahmed53

Opened
9 days ago

#1484

microsoft/onnxruntime-genai
AttributeError: 'onnxruntime_genai.onnxruntime_genai.GeneratorParams' object has no attribute 'input_ids'

Since the GeneratorParams.input_ids attribute has been decommissioned in the latest version of OGA, what is the alternative? input_tokens = test_enc[(i * seqlen) : ((i + 1) * seqlen - 1)] params.input_ids ...

satreysa

Opened
16 days ago

#1458

microsoft/onnxruntime-genai
Phi-4 audio task prompts don't work at all (Apple, onnx, dotnet)

Savvkin

Opened
17 days ago

#1455

microsoft/onnxruntime-genai
Endless response using the Phi-4-mini-instruct model

I m experiencing problems using the Phi-4-mini-instruct model where it will generate responses that begin to repeat text until max_length is reached. To Reproduce I see this problem with my application, ...

f2bo

Opened
18 days ago

#1450

microsoft/onnxruntime-genai
How to enable the Qwen3-30B-A3B

If you have any plan to enable Qwen3-30B-A3B which architectures is Qwen3MoeForCausalLM

ZhangWei125521

Opened
24 days ago

#1433

microsoft/onnxruntime-genai
max_length vs. context_length

Describe the bug I have encountered the following OGA error message: Completion failure max_length (2347) cannot be greater than model context_length (2176) I am mainly trying to understand OGA s definition ...

jeremyfowers

Opened
29 days ago

#1425

microsoft/onnxruntime-genai
🔧 Optimizing Phi-4 MM Instruct Vision Model (ONNX Inference)

Hi all, I’ve optimized the finetuned Phi-4 MM Instruct vision model by converting it to ONNX and applying quantization — inference time dropped from 26s ➝ 7s. :tada: I have a few quick questions: Audio ...

MeemankGupta

Opened
on Apr 24

#1423

microsoft/onnxruntime-genai
ORT-GenAI should ship ARM64EC binaries for AMD64 python running on ARM64 Windows

Describe the bug Windows on ARM users commonly use AMD64 python to execute models using ONNX runtime. This is needed because several python packages (eg. Torch, h5py, etc.) do not yet ship ARM64 for Windows ...

platform:windows

kory

Opened
on Apr 23

#1417

microsoft/onnxruntime-genai
Leverage the latest opset and IR version in model builder

The model builder currently uses opset 14 and IR version 7 for built models. I recommend adopting a later opset (18+) and IR version (10) for the models to leverage latest onnx features and help the ecosystem ...

justinchuby

Opened
on Apr 23

#1414

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues

ProTip!

Restrict your search to the title by using the in:title qualifier.

Learn how you can use GitHub Issues to plan and track your work.

Save views for sprints, backlogs, teams, or releases. Rank, sort, and filter issues to suit the occasion. The possibilities are endless.Learn more about GitHub Issues

ProTip!

Restrict your search to the title by using the in:title qualifier.

Languages

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter by

State

Advanced

microsoft/onnxruntime-genai
How to inference LLM models with OpenVino EP

microsoft/onnxruntime-genai
How to build for iOS

microsoft/onnxruntime-genai
AttributeError: 'onnxruntime_genai.onnxruntime_genai.GeneratorParams' object has no attribute 'input_ids'

microsoft/onnxruntime-genai
Phi-4 audio task prompts don't work at all (Apple, onnx, dotnet)

microsoft/onnxruntime-genai
Endless response using the Phi-4-mini-instruct model

microsoft/onnxruntime-genai
How to enable the Qwen3-30B-A3B

microsoft/onnxruntime-genai
max_length vs. context_length

microsoft/onnxruntime-genai
🔧 Optimizing Phi-4 MM Instruct Vision Model (ONNX Inference)

microsoft/onnxruntime-genai
ORT-GenAI should ship ARM64EC binaries for AMD64 python running on ARM64 Windows

microsoft/onnxruntime-genai
Leverage the latest opset and IR version in model builder

Learn how you can use GitHub Issues to plan and track your work.

Learn how you can use GitHub Issues to plan and track your work.

issues Search Results · repo:microsoft/onnxruntime-genai language:C++

Filter by

State

Advanced

412 results

Learn how you can use GitHub Issues to plan and track your work.

Learn how you can use GitHub Issues to plan and track your work.