fix: MoE lora adapter layout by akoumpa · Pull Request #1395 · NVIDIA-NeMo/Automodel

akoumpa · 2026-02-26T15:03:14Z

What does this PR do ?

TestMoELoRASaveRestoreMergeHF covers -- the exact workflow from issue #1226:

test_peft_loads_adapter -- Grouped LoRA tensors are converted to per-expert HF format, saved as adapter_model.safetensors + adapter_config.json, and PeftModel.from_pretrained() loads them without error.
test_merge_changes_all_expert_weights -- After merge_and_unload(), every expert's gate_proj, up_proj, and down_proj weights differ from the base (all 24 = 2 layers x 4 experts x 3 projections).
test_merge_values_are_numerically_correct -- The merged weight values exactly satisfy W_merged = W_base + (lora_B @ lora_A) * (alpha / r), verifying the LoRA delta is correctly applied.
test_merge_removes_lora_params -- After merge, no lora_ parameters or modules remain in the model.

Changelog

Add specific line by line info of high level changes in this PR.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Related to # (issue)

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

copy-pr-bot · 2026-02-26T15:03:18Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

akoumpa · 2026-02-26T15:05:08Z

/ok to test 8915726

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

akoumpa · 2026-02-26T15:08:06Z

/ok to test 750f55a

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

akoumpa · 2026-02-26T16:47:15Z

/ok to test dad2a6f

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

akoumpa · 2026-02-26T20:41:45Z

/ok to test 90eae91

HuiyingLi

lgtm.

fix

8915726

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

akoumpa linked an issue Feb 26, 2026 that may be closed by this pull request

LoRA adapter merging gives loss/output the similar to base (untrained) model #1226

Closed

copy-pr-bot Bot had a problem deploying to nemo-ci February 26, 2026 15:05 Error

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 15:05 Inactive

copy-pr-bot Bot requested a deployment to nemo-ci February 26, 2026 15:05 In progress

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 15:05 Inactive

copy-pr-bot Bot had a problem deploying to nemo-ci February 26, 2026 15:05 Error

copy-pr-bot Bot had a problem deploying to test February 26, 2026 15:05 Error

fix

750f55a

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

copy-pr-bot Bot temporarily deployed to test February 26, 2026 15:08 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 15:08 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 15:27 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 15:37 Inactive

copy-pr-bot Bot had a problem deploying to nemo-ci February 26, 2026 15:37 Failure

fix

dad2a6f

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

copy-pr-bot Bot temporarily deployed to test February 26, 2026 16:47 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 16:47 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 18:22 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 18:24 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 18:41 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 19:01 Inactive

fix lint

90eae91

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 20:42 Inactive

copy-pr-bot Bot temporarily deployed to test February 26, 2026 20:42 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 20:58 Inactive

copy-pr-bot Bot temporarily deployed to nemo-ci February 26, 2026 21:09 Inactive

hemildesai approved these changes Feb 26, 2026

View reviewed changes

HuiyingLi approved these changes Feb 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: MoE lora adapter layout#1395

fix: MoE lora adapter layout#1395
thomasdhc merged 5 commits intomainfrom
akoumparouli/fix_hf_moe_adapters

akoumpa commented Feb 26, 2026 •

edited

Loading

Uh oh!

copy-pr-bot Bot commented Feb 26, 2026

Uh oh!

akoumpa commented Feb 26, 2026

Uh oh!

akoumpa commented Feb 26, 2026

Uh oh!

akoumpa commented Feb 26, 2026

Uh oh!

akoumpa commented Feb 26, 2026

Uh oh!

HuiyingLi left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

akoumpa commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot Bot commented Feb 26, 2026

Uh oh!

akoumpa commented Feb 26, 2026

Uh oh!

akoumpa commented Feb 26, 2026

Uh oh!

akoumpa commented Feb 26, 2026

Uh oh!

akoumpa commented Feb 26, 2026

Uh oh!

HuiyingLi left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

akoumpa commented Feb 26, 2026 •

edited

Loading