-
Notifications
You must be signed in to change notification settings - Fork 196
Pull requests: microsoft/onnxruntime-genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
NvTensorRtRtx: Pass the dynamic shapes (ISL and batch_size) to the ep at runtime as nv profile.
#1614
opened Jul 9, 2025 by
anujj
Loading…
Update OnnxRuntimeGenAIChatClient to M.E.AI.Abstractions 9.7.0
#1612
opened Jul 8, 2025 by
stephentoub
Loading…
Modify Model Builder to build paged attention models
#1605
opened Jul 3, 2025 by
aciddelgado
•
Draft
Add Encode with Options for
add_special_tokens=True
use-case
#1504
opened May 22, 2025 by
sayanshaw24
Loading…
Model Builder: Add Post processing script to convert fp16/32 LM_HEAD to int8 and use tied embeddings
#1437
opened Apr 30, 2025 by
sushraja-msft
Loading…
add extra_options use_channel_wised_quantization to builder.py
#1362
opened Mar 31, 2025 by
bopeng1234
Loading…
Avoid potential desynchronization of cpu and device memory
#1132
opened Dec 9, 2024 by
aciddelgado
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.