-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Works for T5/BART? #13
Comments
Side note: it'd be good to update the |
You are right, when I have time I'll upgrade it to v4.0.0. I haven't tested it but I suspect if you take a model with a text generation head it should work. Note that you need add a value head to your model architecture (see here). |
I can try it. Other than running with no errors, what other ways I can test that the code is working fine? Is there a benchmark or a quantitative way of verifying the code? |
Monitoring the rewards on the IMDb dataset would be a good start. For GPT-2 it takes only 1-2h to train. |
Very cool work!
Does this work for T5/BART models as well?
The text was updated successfully, but these errors were encountered: