Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using pretrained models #90

Closed
syleedandekar opened this issue Mar 19, 2024 · 3 comments
Closed

Using pretrained models #90

syleedandekar opened this issue Mar 19, 2024 · 3 comments

Comments

@syleedandekar
Copy link

The paper mentions that you performed end-to-end validation of AlpacaFarm. Do you have the code up on Github for that? I want to use the LLM pre-trained on human preferences to generate some more preferences.

@YannDubs
Copy link
Collaborator

@syleedandekar
Copy link
Author

syleedandekar commented Mar 21, 2024

I've been trying to generate text using ppo-human but I've just been getting gibberish. It works fine when I use LLama2. Is there an example in AlpacaEval I can refer to?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants