Implement rewards as proposed in https://arxiv.org/pdf/2405.14655
Activity
ggbondcxl commentedon Jul 11, 2024
I am very interested in multi-turn RLHF, can you give a sample code
rghosh08 commentedon Oct 5, 2024
@vmoens I am interested in this. there any progress. I am ready to collaborate or start from scratch.