-
Notifications
You must be signed in to change notification settings - Fork 185
Issues: microsoft/onnxruntime-genai
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Certain prompts crash for Phi 3 mini int4 DML (with simpler example provided)
crash
ep:DML
#833
opened Aug 22, 2024 by
elephantpanda
Memory leak during back-to-back inferences
ep:DML
model:transformer
performance
#590
opened Jun 10, 2024 by
jeremyfowers
ORT-GenAI should ship ARM64EC binaries for AMD64 python running on ARM64 Windows
platform:windows
#1417
opened Apr 23, 2025 by
kory
Phi-3-Mini fails to execute on long prompts on Intel integrated GPU with DirectML
ep:DML
#570
opened Jun 5, 2024 by
ofirzaf
Phi-3-mini-4k-instruct-onnx model generates nonsensical results when prompt is longer than half of the context window length
bug
Something isn't working
#552
opened May 31, 2024 by
jackylu0124
onnxruntime-genai
generation speed very slow on int4
performance
#1098
opened Nov 23, 2024 by
tarekziade
Back-to-back inferences speed slowdown over time
bug
Something isn't working
ep:DML
model:transformer
performance
#737
opened Jul 31, 2024 by
xcmgttacct
[.NET] Genny chat-bot sample doesn't support DirectML and Phi-3
ep:DML
#569
opened Jun 5, 2024 by
asmirnov82
is there any exmaple of phi-3 vision model deploy on Android?
platform:mobile
#608
opened Jun 14, 2024 by
henrywang0314
.Net How to free GPU memory after each inference
bug
Something isn't working
enhancement
New feature or request
performance
#1131
opened Dec 9, 2024 by
strikene
Querying the ids of all the DML devices and selecting a device for inference
ep:DML
#360
opened Apr 30, 2024 by
jojo1899
GPU suspended (887A0005) while running example in DML
ep:DML
#628
opened Jun 21, 2024 by
skyline75489
Incorrect output shape for logits output returned from get_output("logits")
#1385
opened Apr 8, 2025 by
VishalX
Build for Android failing: error: no template named 'unordered_map' in namespace 'std'
platform:mobile
#1291
opened Mar 2, 2025 by
Danmoreng
Running custom encoder-decoder models in onnxruntime-genai
model:transformer
#875
opened Sep 5, 2024 by
KarelZe
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.