Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to dynamically change API parameters through oaieval CLI tool? #512

Closed
leodiegues opened this issue Mar 29, 2023 · 2 comments
Closed

Comments

@leodiegues
Copy link

I'm trying to run an eval based on evals.elsuite.basic.match:Match class but I can't find a way to modify API parameters such as temperature and max_tokens through the CLI. What is the current solution that the framework offers to this problem?

@leodiegues leodiegues changed the title How to dynamically change temperature through oaieval CLI tool? How to dynamically change API parameters through oaieval CLI tool? Mar 29, 2023
@andrew-openai
Copy link
Contributor

andrew-openai commented Mar 29, 2023

It's a good idea! I added a --modelspec_extra_options that would support setting those values from the CLI, e.g.

oaieval gpt-3.5-turbo test-match --modelspec_extra_options max_tokens=50,temperature=1

Edit: Although that said, I am curious why you want nonzero temperature for evals, usually we want to set that value to 0 for determinism across runs.

@leodiegues
Copy link
Author

leodiegues commented Mar 30, 2023

Wow! Thank you so much for your help @andrew-openai . About the temperature, it doesn't make any sense changing it at all for match evals. You're completely right. Actually, I was just trying to mess around with API parameters in general to better understand how to work with Portuguese prompts and responses. For a moment I thought temperature could be something to that I should change but I wasn't thinking straight anymore I guess.

Again, thank you so much @andrew-openai !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants