Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

RLHF text summarization diverges #589

Open
AlisonWen opened this issue Jan 14, 2024 · 0 comments
Open

RLHF text summarization diverges #589

AlisonWen opened this issue Jan 14, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@AlisonWen
Copy link

馃悰 Describe the bug

I am running the experiment of trlx_gptj_text_summarization.py, I have not modified the code but the experiment has not converged when more than 3500 steps, and the document said it was meant to converge. I realized the sample project was running the file trlx_gptneo_text_summarization.py, but I cannot find the file anywhere.
image

Which trlX version are you using?

download with source code on 2024/01/13

Additional system and package information

linux jammy, torch==2.0.0+cu118

@AlisonWen AlisonWen added the bug Something isn't working label Jan 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant