-
Notifications
You must be signed in to change notification settings - Fork 185
Issues: microsoft/onnxruntime-genai
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[DML] [iGPU] [AMD] [Intel] GPU command exception
ep:DML
model:transformer
#667
opened Jul 2, 2024 by
tjtanaa
ONNXRuntime-genai doesn't release GPU memory after first inference
ep:CUDA
performance
#526
opened May 28, 2024 by
Positronx
Phi-3-mini-4k-instruct-onnx model generates nonsensical results when prompt is longer than half of the context window length
bug
Something isn't working
#552
opened May 31, 2024 by
jackylu0124
[.NET] Genny chat-bot sample doesn't support DirectML and Phi-3
ep:DML
#569
opened Jun 5, 2024 by
asmirnov82
Phi-3-Mini fails to execute on long prompts on Intel integrated GPU with DirectML
ep:DML
#570
opened Jun 5, 2024 by
ofirzaf
Memory leak during back-to-back inferences
ep:DML
model:transformer
performance
#590
opened Jun 10, 2024 by
jeremyfowers
is there any exmaple of phi-3 vision model deploy on Android?
platform:mobile
#608
opened Jun 14, 2024 by
henrywang0314
GPU suspended (887A0005) while running example in DML
ep:DML
#628
opened Jun 21, 2024 by
skyline75489
Querying the ids of all the DML devices and selecting a device for inference
ep:DML
#360
opened Apr 30, 2024 by
jojo1899
GPU driver error when using AMD eGPU via DirectML
ep:DML
model:transformer
platform:windows
#644
opened Jun 25, 2024 by
x0wllaar
Nodes are not topologically sorted from models generated by model builder
#707
opened Jul 17, 2024 by
BowenBao
Inference with batching is significantly slower than without batching.
ep:CUDA
#714
opened Jul 20, 2024 by
Jester6136
Issues running on Ryzen
ep:DML
platform:windows
waiting-for-customer
#728
opened Jul 26, 2024 by
jarroddavis68
Setting specific device_id with set_current_gpu_device_id not working
bug
Something isn't working
ep:CUDA
#730
opened Jul 29, 2024 by
MadMenHitBooker
Back-to-back inferences speed slowdown over time
bug
Something isn't working
ep:DML
model:transformer
performance
#737
opened Jul 31, 2024 by
xcmgttacct
Compiling
ort_genai_c.h
as a C file fails due to <cstddef>
inclusion
#1512
opened May 28, 2025 by
nizarbenalla
Previous Next
ProTip!
Updated in the last three days: updated:>2025-05-25.