fix: include medusa in data_module assignment in main.py by yeyu-nvidia · Pull Request #1370 · NVIDIA/Model-Optimizer

yeyu-nvidia · 2026-04-29T16:17:06Z

Problem

When training.mode == "medusa" is used in main.py, the data_module variable is never assigned because line 344 only covered eagle3 and dflash modes. This causes an UnboundLocalError when the trainer is constructed with **data_module.

Fixes OMNIML-4147

Fix

Add "medusa" to the training_args.mode in ("eagle3", "dflash") condition so data_module is correctly populated for medusa training.

Summary by CodeRabbit

Bug Fixes
- Fixed speculative decoding example to properly handle "medusa" mode alongside existing "eagle3" and "dflash" modes.

When mode == "medusa", data_module was never assigned because the condition only covered "eagle3" and "dflash", causing an UnboundLocalError at the trainer construction. Add "medusa" to the condition so the data module is correctly prepared for all supported training modes. Fixes OMNIML-4147 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Signed-off-by: Ye Yu <yeyu@nvidia.com>

coderabbitai · 2026-04-29T16:17:19Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: d0a67b92-0065-4a5f-86e9-5760321ec053

📥 Commits

Reviewing files that changed from the base of the PR and between 9bb917d and 4475873.

📒 Files selected for processing (1)

examples/speculative_decoding/main.py

📝 Walkthrough

Walkthrough

The train() function in the speculative decoding example is modified to include "medusa" mode in the conditional check for using make_speculative_data_module(). Previously, only "eagle3" and "dflash" modes triggered speculative data module construction; "medusa" now qualifies for the same treatment.

Changes

Cohort / File(s)	Summary
Speculative Decoding Mode Support `examples/speculative_decoding/main.py`	Added `"medusa"` to the set of training modes that use speculative data module construction, alongside existing `"eagle3"` and `"dflash"` modes.

Estimated code review effort

🎯 1 (Trivial) | ⏱️ ~3 minutes

🚥 Pre-merge checks | ✅ 5 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately describes the main change: adding 'medusa' to the data_module assignment condition in main.py. It is specific, concise, and directly summarizes the primary purpose of the fix.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Security Anti-Patterns	✅ Passed	The pull request is a minimal bug fix that only adds 'medusa' to an existing conditional statement on line 344. The code does not introduce any security anti-patterns outlined in SECURITY.md: it uses torch.load() with weights_only=True (safe), exposes trust_remote_code as a configurable parameter with a secure default of False, avoids eval/exec calls, contains no # nosec comments, and adds no new dependencies. The fix correctly prevents an UnboundLocalError when training_args.mode == 'medusa'.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch fix/medusa-unbound-data-module

_{Review rate limit: 9/10 reviews remaining, refill in 6 minutes.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codecov · 2026-04-29T16:31:53Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 77.49%. Comparing base (077e29a) to head (4475873).
⚠️ Report is 13 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1370      +/-   ##
==========================================
+ Coverage   76.48%   77.49%   +1.00%     
==========================================
  Files         471      471              
  Lines       50487    50487              
==========================================
+ Hits        38617    39124     +507     
+ Misses      11870    11363     -507

Flag	Coverage Δ
examples	`41.58% <ø> (+10.83%)`	⬆️
regression	`14.91% <ø> (+0.20%)`	⬆️
unit	`52.78% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions · 2026-05-04T09:48:20Z

PR Preview Action v1.8.1
Preview removed because the pull request was closed.
2026-05-04 09:48 UTC

#1368 #1373 #1359 #1361 #1325 #1369 #1370 #1371 #1375 #1386 #1353 #1356 #1390 (#1385) ## Cherry-picked PRs - #1352 - #1351 - #1330 - #1354 - #1355 - #1360 - #1342 - #1324 - #1340 - #1368 - #1373 - #1359 - #1361 - #1325 - #1369 - #1370 - #1371 - #1375 - #1386 - #1353 - #1356 - #1390  ## Summary by CodeRabbit * **New Features** * Added Python 3.14 support (basic unit tests verified; production defaults on Python 3.12) * Added Windows CUDA 13.x installation guidance * Introduced LLM ONNX export utilities with quantization support * Extended Medusa mode support in speculative decoding pipeline * **Bug Fixes** * Fixed FP8 quantization for vision transformer multi-head attention * Improved MoE expert handling in quantization calibration and inference * Enhanced ONNX graph utilities for FP8 weight transformation * **Documentation** * Comprehensive Minitron pruning + distillation + quantization + vLLM tutorials with ablation studies * Megatron data preparation guide for tokenization workflows * Puzzletron distillation results and cross-reference updates  --------- Signed-off-by: Keval Morabia <28916987+kevalmorabia97@users.noreply.github.com> Signed-off-by: ajrasane <131806219+ajrasane@users.noreply.github.com> Signed-off-by: Grzegorz Karch <gkarch@nvidia.com> Signed-off-by: Grzegorz K. Karch <grzegorz-k-karch@users.noreply.github.com> Signed-off-by: Chenjie Luo <chenjiel@nvidia.com> Signed-off-by: Asha Anoosheh <aanoosheh@nvidia.com> Signed-off-by: Jennifer Chen <jennifchen@nvidia.com> Signed-off-by: weimingc <17592131+meenchen@users.noreply.github.com> Signed-off-by: ynankani <ynankani@nvidia.com> Signed-off-by: h-guo18 <67671475+h-guo18@users.noreply.github.com> Signed-off-by: vipandya <vipandya@nvidia.com> Signed-off-by: dmoodie <dmoodie@nvidia.com> Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com> Signed-off-by: Ye Yu <yeyu@nvidia.com> Signed-off-by: Kai Xu <kaix@nvidia.com> Signed-off-by: Suguna Velury <178320438+sugunav14@users.noreply.github.com> Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com> Co-authored-by: Ajinkya Rasane <131806219+ajrasane@users.noreply.github.com> Co-authored-by: Grzegorz K. Karch <grzegorz-k-karch@users.noreply.github.com> Co-authored-by: CodeRabbit <noreply@coderabbit.ai> Co-authored-by: Chenjie Luo <108829653+cjluo-nv@users.noreply.github.com> Co-authored-by: Asha Anoosheh <aanoosheh@nvidia.com> Co-authored-by: Jenny Chen <jennifchen@nvidia.com> Co-authored-by: Wei-Ming Chen <17592131+meenchen@users.noreply.github.com> Co-authored-by: ynankani <ynankani@nvidia.com> Co-authored-by: h-guo18 <67671475+h-guo18@users.noreply.github.com> Co-authored-by: vishalpandya1990 <vishalpandya1990@gmail.com> Co-authored-by: dthienan-nv <dmoodie@nvidia.com> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Hrishith Thadicherla <99313418+hthadicherla@users.noreply.github.com> Co-authored-by: yeyu-nvidia <yeyu@nvidia.com> Co-authored-by: kaix-nv <kaix@nvidia.com> Co-authored-by: sugunav14 <178320438+sugunav14@users.noreply.github.com>

yeyu-nvidia requested a review from a team as a code owner April 29, 2026 16:17

yeyu-nvidia requested a review from ChenhanYu April 29, 2026 16:17

yeyu-nvidia added the cherry-pick-0.44.0 After code freeze, cherry-pick to release branch for next rc (bulk update). Only for bug fixes / doc label Apr 29, 2026

ChenhanYu approved these changes Apr 29, 2026

View reviewed changes

yeyu-nvidia enabled auto-merge (squash) April 29, 2026 16:36

auto-merge was automatically disabled April 29, 2026 17:12
Branch protection rule check failed

kevalmorabia97 merged commit 383ab4e into main May 4, 2026
38 checks passed

kevalmorabia97 deleted the fix/medusa-unbound-data-module branch May 4, 2026 09:47

kevalmorabia97 mentioned this pull request May 4, 2026

[Cherry-pick] PRs #1352 #1351 #1330 #1354 #1355 #1360 #1342 #1324 #1340 #1368 #1373 #1359 #1361 #1325 #1369 #1370 #1371 #1375 #1386 #1353 #1356 #1390 #1385

Merged

kevalmorabia97 added the cherry-pick-done Added by bot once PR is cherry-picked to the release branch label May 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: include medusa in data_module assignment in main.py#1370

fix: include medusa in data_module assignment in main.py#1370
kevalmorabia97 merged 1 commit intomainfrom
fix/medusa-unbound-data-module

yeyu-nvidia commented Apr 29, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 29, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

codecov Bot commented Apr 29, 2026 •

edited

Loading

Uh oh!

Uh oh!

github-actions Bot commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

yeyu-nvidia commented Apr 29, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Fix

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

codecov Bot commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

github-actions Bot commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yeyu-nvidia commented Apr 29, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 29, 2026 •

edited

Loading

codecov Bot commented Apr 29, 2026 •

edited

Loading