Skip to content

Constrained decoding integration #1381

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 187 commits into from
May 9, 2025

Conversation

ajindal1
Copy link
Collaborator

@ajindal1 ajindal1 commented Apr 7, 2025

Integrate Constrained decoding using LLGuidance library.

Based on Ying's Constrained Decoding branch (yingxiong/constrained_decoding)

@ajindal1 ajindal1 added the 0.8.0 label May 8, 2025
kunal-vaishnavi
kunal-vaishnavi previously approved these changes May 9, 2025
@ajindal1 ajindal1 merged commit 48e5cb3 into main May 9, 2025
14 checks passed
@ajindal1 ajindal1 deleted the abjindal/constrained_decoding_integration branch May 9, 2025 22:22
RyanUnderhill added a commit that referenced this pull request May 12, 2025
Integrate Constrained decoding using LLGuidance library.

Based on Ying's Constrained Decoding branch
(yingxiong/constrained_decoding)

---------

Co-authored-by: Ying Xiong <yingxiong@microsoft.com>
Co-authored-by: Michał Moskal <michal@moskal.me>
Co-authored-by: Kunal Vaishnavi <kvaishnavi@microsoft.com>
Co-authored-by: Ryan Hill <38674843+RyanUnderhill@users.noreply.github.com>
Co-authored-by: Baiju Meswani <bmeswani@microsoft.com>
baijumeswani added a commit that referenced this pull request May 14, 2025
Address previous PR review comments from #1470 (#1473)
Address QNN specific regressions (#1470)
Fix array eos_token_id handling (#1463)
Constrained decoding integration (#1381)
Remove BF16 CPU from valid GQA configuration (#1469)
Avoid adding providers if not requested (#1464)
Persist provider options across ClearProviders, AppendProvider where
possible (#1454)
Fix accuracy issues with Gemma models (#1448)
Add bfloat16 support in model builder (#1447)
Add final norm for LoRA models (#1446)

Update version to 0.8.0-rc3

---------

Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com>
Co-authored-by: Nenad Banfic <46795300+nenad1002@users.noreply.github.com>
Co-authored-by: Nenad Banfic <nebanfic@microsoft.com>
Co-authored-by: Baiju Meswani <bmeswani@microsoft.com>
Co-authored-by: Abhishek Jindal <abjindal@microsoft.com>
Co-authored-by: Ying Xiong <yingxiong@microsoft.com>
Co-authored-by: Michał Moskal <michal@moskal.me>
Co-authored-by: Kunal Vaishnavi <kvaishnavi@microsoft.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants