You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It is more correct to show batch["labels"], instead of batch["input_ids"] since we are looking for response_token_ids[0] in labels not in input_ids. I understand that labels are just shifted input_ids in most cases, but sometimes it can be really really misleading
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
It is more correct to show batch["labels"], instead of batch["input_ids"] since we are looking for response_token_ids[0] in labels not in input_ids. I understand that labels are just shifted input_ids in most cases, but sometimes it can be really really misleading
https://github.com/huggingface/trl/blob/9a28b3fd0505aa38798f0122ab0ff3bb795384dd/trl/trainer/utils.py#L170C77-L170C86
The text was updated successfully, but these errors were encountered: