You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have searched existing issues, and this is a new question or discussion topic. / 我已经搜索过现有的 issues,确认这是一个新的问题与讨论。
Question Description / 问题描述
Hi Team, may I ask how to merge the sft lora checkpoint with the base model, and then use it as new base model for grpo? Or are there other ways to do sft lora -> grpo lora training? Thank you.
Checklist / 检查清单
Question Description / 问题描述
Hi Team, may I ask how to merge the sft lora checkpoint with the base model, and then use it as new base model for grpo? Or are there other ways to do sft lora -> grpo lora training? Thank you.