preserve_thinking 默认是flase,但是历史轮的thinking到底训不训练 #10519
ABCDabcde1098234
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
mask_history: false这个是默认值,那就说明历史轮的回复参与训练。 preserve_thinking: false这个也是默认值,那就是默认不保留历史轮 thinking 内容到后续上下文 ,那个是说明 历史轮的thinking不参与训练呢?不保留内容到后续上下文,还是说thinking也要训练呢?期待得到回复
Beta Was this translation helpful? Give feedback.
All reactions