Skip to content

Conversation

@RyanUnderhill
Copy link
Contributor

@RyanUnderhill RyanUnderhill commented May 1, 2025

Update version to 0.8.0-rc2 and cherry pick these 3 changes:

#1435 update ESRP settings
#1434 make WebGPU name consistent
#1432 Missed an all lowercase "webgpu" string
#1440 Apply provider name backwards compatibility at runtime
#1452 Update Extensions Commit to Support Chat Template Override for Unsupported Models
#1439 Sign macos binaries
#1442 Set add_special_tokens

RyanUnderhill and others added 4 commits April 30, 2025 17:19
Exact same change as microsoft/onnxruntime#24608
in Onnxruntime
This is in code we should deprecate going forward but it's breaking an
existing case and this is the quickest fix.
@RyanUnderhill RyanUnderhill enabled auto-merge (squash) May 1, 2025 00:22
@RyanUnderhill RyanUnderhill disabled auto-merge May 1, 2025 00:27
@natke natke self-requested a review May 1, 2025 00:28
RyanUnderhill and others added 6 commits May 2, 2025 17:44
Add backwards compatibility for dml->DML
Give a runtime error if DML chosen but if we're not built with DML
(otherwise it goes into Ort and it will use DML even if GenAI isn't
built for it)
Sets `add_special_tokens` from `OrtxTokenizeWithOptions` added in
microsoft/onnxruntime-extensions#940 to false to
solve chat template issue in GenAI with extra BOS tokens.

See huggingface/transformers#37686 for more
context.

---------

Co-authored-by: Sayan Shaw <sayanshaw@microsoft.com>
Co-authored-by: Ryan Hill <38674843+RyanUnderhill@users.noreply.github.com>
### Description

This PR removes generating prompt templates and storing them in the
GenAI config.

### Motivation and Context

The prompt templates are not used internally anymore.
@baijumeswani baijumeswani merged commit ba0a910 into rel-0.8.0 May 6, 2025
13 checks passed
@baijumeswani baijumeswani deleted the ryanunderhill/rc2_cherry_picks branch May 6, 2025 20:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants