Skip to content

Add BBQ xLLM support#22

Merged
nightlessbaron merged 3 commits into
prodfrom
bbq_support
May 19, 2026
Merged

Add BBQ xLLM support#22
nightlessbaron merged 3 commits into
prodfrom
bbq_support

Conversation

@akhauriyash
Copy link
Copy Markdown
Collaborator

Summary

  • Add xLLM 375B and 8B model-args scripts for BBQ runs.
  • Register xLLM mbridge and Megatron-to-HF conversion support.
  • Keep long-context PPO logits in model precision until chunked logprob computation to avoid full-vocab fp32 materialization OOMs.
  • Clear inactive CUDA allocator blocks before optimizer step and checkpoint saves for large packed RL batches.

Notes

  • The 8B model args use --layernorm-num-groups 4, which depends on the corresponding Megatron-LM grouped RMSNorm support.
  • This branch was created from the RL360-pinned Miles commit 22d4693bf.

Validation

  • python3 -m py_compile on touched Python files.
  • bash -n on scripts/models/xllm-375B.sh and scripts/models/xllm-8B.sh.
  • git diff --cached --check before commit.

@nightlessbaron nightlessbaron marked this pull request as ready for review May 19, 2026 21:10
@nightlessbaron nightlessbaron requested a review from a team as a code owner May 19, 2026 21:10
@nightlessbaron
Copy link
Copy Markdown

Tested this PR e2e on 8B BBQ.

@nightlessbaron nightlessbaron merged commit 8c69fb2 into prod May 19, 2026
14 of 18 checks passed
@nightlessbaron nightlessbaron deleted the bbq_support branch May 19, 2026 21:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants