Add final norm for LoRA models #1446

kunal-vaishnavi · 2025-05-04T09:24:28Z

Description

This PR adds the missing pattern to identify the final norm layer in LoRA models. It also cleans up some of the classes in the model builder.

Motivation and Context

The missing final norm layer in LoRA models caused the generated LoRA models to be incorrect.

Copilot

Pull Request Overview

This PR fixes issues related to the handling of LoRA models by adding support for identifying the final normalization layer and cleaning up model builder class methods. Key changes include renaming function parameters in packed MatMul calls to ensure consistency and updating the final norm detection logic (via has_final_norm) to handle PEFT models correctly.

Comments suppressed due to low confidence (1)

src/python/py/models/builder.py:920

The parameter 'basename' replaces the previous 'name' in this function; please ensure that all call sites are updated consistently to prevent mismatches between the naming of the MatMul nodes.

def make_packed_matmul_fp16_or_fp32(self, q_matmul, k_matmul, v_matmul, basename, root_input, **kwargs):

src/python/py/models/builder.py

### Description This PR adds the missing pattern to identify the final norm layer in LoRA models. It also cleans up some of the classes in the model builder. ### Motivation and Context The missing final norm layer in LoRA models caused the generated LoRA models to be incorrect.

Address previous PR review comments from #1470 (#1473) Address QNN specific regressions (#1470) Fix array eos_token_id handling (#1463) Constrained decoding integration (#1381) Remove BF16 CPU from valid GQA configuration (#1469) Avoid adding providers if not requested (#1464) Persist provider options across ClearProviders, AppendProvider where possible (#1454) Fix accuracy issues with Gemma models (#1448) Add bfloat16 support in model builder (#1447) Add final norm for LoRA models (#1446) Update version to 0.8.0-rc3 --------- Co-authored-by: kunal-vaishnavi <115581922+kunal-vaishnavi@users.noreply.github.com> Co-authored-by: Nenad Banfic <46795300+nenad1002@users.noreply.github.com> Co-authored-by: Nenad Banfic <nebanfic@microsoft.com> Co-authored-by: Baiju Meswani <bmeswani@microsoft.com> Co-authored-by: Abhishek Jindal <abjindal@microsoft.com> Co-authored-by: Ying Xiong <yingxiong@microsoft.com> Co-authored-by: Michał Moskal <michal@moskal.me> Co-authored-by: Kunal Vaishnavi <kvaishnavi@microsoft.com>

kunal-vaishnavi added 2 commits May 4, 2025 09:19

Fix missing final norm for LoRA models

62d0661

Match against all name variants for LoRA models

d7aeb9a

kunal-vaishnavi added the 0.8.0 label May 5, 2025

Merge branch 'main' into kvaishnavi/fix-logits-with-adapter

19c35d5

RyanUnderhill requested a review from Copilot May 7, 2025 06:29

Copilot AI reviewed May 7, 2025

View reviewed changes

src/python/py/models/builder.py Show resolved Hide resolved

RyanUnderhill approved these changes May 7, 2025

View reviewed changes

kunal-vaishnavi merged commit 03b0fea into main May 7, 2025
14 of 17 checks passed

kunal-vaishnavi deleted the kvaishnavi/fix-logits-with-adapter branch May 7, 2025 06:37

RyanUnderhill mentioned this pull request May 12, 2025

Ryanunderhill/rc3 cherry picks #1475

Merged

RyanUnderhill added the cherry picked label May 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add final norm for LoRA models #1446

Add final norm for LoRA models #1446

kunal-vaishnavi commented May 4, 2025

Copilot AI left a comment

Add final norm for LoRA models #1446

Add final norm for LoRA models #1446

Conversation

kunal-vaishnavi commented May 4, 2025

Description

Motivation and Context

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview