chore: Move FP8 MHA quantization entry from 0.45 to 0.44 in CHANGELOG#1351
chore: Move FP8 MHA quantization entry from 0.45 to 0.44 in CHANGELOG#1351kevalmorabia97 merged 1 commit intomainfrom
Conversation
PR #1289 (FP8 MHA quantization for ViT) is being cherry-picked to release/0.44.0 to fix nvbug 6114983 (rc1 wheel missing nn.LayerNorm registration that the example references). Move the feature's release-notes entry accordingly. Signed-off-by: ajrasane <131806219+ajrasane@users.noreply.github.com>
📝 WalkthroughWalkthroughA changelog entry for FP8 MHA quantization support for vision transformers is moved from version 0.45 release notes to version 0.44 release notes, correcting the feature's version attribution without altering the feature description. Changes
Estimated code review effort🎯 1 (Trivial) | ⏱️ ~2 minutes 🚥 Pre-merge checks | ✅ 6✅ Passed checks (6 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches📝 Generate docstrings
🧪 Generate unit tests (beta)
Comment |
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #1351 +/- ##
=======================================
Coverage 75.72% 75.72%
=======================================
Files 471 471
Lines 50375 50375
=======================================
Hits 38146 38146
Misses 12229 12229
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
Summary
PR #1289 (FP8 MHA quantization for ViT) was merged to
mainafter0.44.0rc1was tagged, so the rc1 wheel ships without thenn.LayerNormregistration that the example's_FP8_MHA_OVERRIDEnow references — surfaced as nvbug 6114983 (ValueError: parent_class 'nn.LayerNorm' not found in QuantModuleRegistrywhen runningtorch_quant_to_onnx.py --quantize_mode=fp8). PR #1289 is labeledcherry-pick-0.44.0and will be cherry-picked torelease/0.44.0for the next rc, so the feature ships in 0.44 — this PR moves the corresponding release-notes bullet from the0.45 (Future)section to0.44 (2026-05-xx)to match.