fix: moonlight flop calculation and DAPO config fix#1473
Conversation
📝 WalkthroughWalkthroughConfiguration file updated to remove explicit model save format specification. FLOPS calculation formula for deepseekv3 Q projection now conditionally branches based on q_lora_rank presence, using MLA-based parameters when specified, standard attention parameters otherwise. Changes
Estimated code review effort🎯 2 (Simple) | ⏱️ ~12 minutes
Suggested labels
Suggested reviewers
Pre-merge checks and finishing touches❌ Failed checks (1 warning)
✅ Passed checks (2 passed)
✨ Finishing touches
🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Pro 📒 Files selected for processing (2)
🧰 Additional context used📓 Path-based instructions (6)examples/configs/recipes/**/*.yaml📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
examples/configs/recipes/llm/*.yaml📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
examples/configs/recipes/**/*.{yaml,sh}📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
examples/configs/recipes/**📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
**/*.py📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
nemo_rl/**/*.py📄 CodeRabbit inference engine (CODING_GUIDELINES.md)
Files:
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (5)
🔇 Additional comments (2)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
Signed-off-by: Terry Kong <terryk@nvidia.com>
Signed-off-by: Terry Kong <terryk@nvidia.com>
776495a to
b7e6e7c
Compare
What does this PR do ?
The moonlight fix was on main, but the DAPO one was new (and needs to be fixed on
mainas well. CC @ashors1 )Issues
List issues that this PR closes (syntax):
Usage
# Add a code snippet demonstrating how to use thisBefore your PR is "Ready for review"
Pre checks:
Additional Information
Summary by CodeRabbit
Chores
Bug Fixes