Skip to content

[Feature Request] multi-turn reward for RLHF #2271

Open
@vmoens

Description

@vmoens
Collaborator

Implement rewards as proposed in https://arxiv.org/pdf/2405.14655

Activity

self-assigned this
on Jul 6, 2024
ggbondcxl

ggbondcxl commented on Jul 11, 2024

@ggbondcxl

I am very interested in multi-turn RLHF, can you give a sample code

rghosh08

rghosh08 commented on Oct 5, 2024

@rghosh08

@vmoens I am interested in this. there any progress. I am ready to collaborate or start from scratch.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

    Development

    Participants

    @vmoens@rghosh08@ggbondcxl

    Issue actions

      [Feature Request] multi-turn reward for RLHF · Issue #2271 · pytorch/rl