Skip to content

docs: Add determinism tip#807

Merged
bxyu-nvidia merged 2 commits intomainfrom
bxyu/fix-775
Mar 2, 2026
Merged

docs: Add determinism tip#807
bxyu-nvidia merged 2 commits intomainfrom
bxyu/fix-775

Conversation

@bxyu-nvidia
Copy link
Copy Markdown
Contributor

No description provided.

Signed-off-by: Brian Yu <bxyu@nvidia.com>
@bxyu-nvidia bxyu-nvidia linked an issue Mar 2, 2026 that may be closed by this pull request
Copy link
Copy Markdown
Contributor

@cmunley1 cmunley1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm, but would be cool if we did have a deterministic mode - i guess vllm doesnt support?

e.g. megatron-lm does have bitwise determinstic SFT.

Maybe we can point to this doc for those interested in closer to determinism? https://docs.vllm.ai/en/latest/usage/reproducibility/

cmunley1
cmunley1 previously approved these changes Mar 2, 2026
Signed-off-by: Brian Yu <bxyu@nvidia.com>
@bxyu-nvidia bxyu-nvidia merged commit a19f233 into main Mar 2, 2026
6 checks passed
@bxyu-nvidia bxyu-nvidia deleted the bxyu/fix-775 branch March 2, 2026 20:57
jsw-zorro pushed a commit to niletron/Gym that referenced this pull request Apr 7, 2026
Signed-off-by: Brian Yu <bxyu@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

User confusion on whether rollouts can be reproduced

2 participants