Skip to content

Support true on policy#566

Merged
zhuzilin merged 2 commits intoTHUDM:mainfrom
fzyzcjy:feat/true_on_policy
Oct 24, 2025
Merged

Support true on policy#566
zhuzilin merged 2 commits intoTHUDM:mainfrom
fzyzcjy:feat/true_on_policy

Conversation

@fzyzcjy
Copy link
Collaborator

@fzyzcjy fzyzcjy commented Oct 24, 2025

EDIT: below is an example, where true on policy is exactly zero, while the normal case is nonzero.

image

@zhuzilin zhuzilin merged commit 657d8ee into THUDM:main Oct 24, 2025
4 checks passed
llltttwww pushed a commit to llltttwww/slime that referenced this pull request Nov 30, 2025
Co-authored-by: Zilin Zhu <zhuzilinallen@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants