Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support running Evals on specific tasks #45

Open
avivex1000 opened this issue Aug 3, 2023 · 0 comments
Open

Support running Evals on specific tasks #45

avivex1000 opened this issue Aug 3, 2023 · 0 comments
Labels
enhancement New feature or request

Comments

@avivex1000
Copy link
Member

In the current implementation, evals is not easily accessible or runnable without downloading the source code.
The proposed implementation will allow importing the evals runner, defining the preferred models to evaluate against and will run the evaluations on a given task.

Suggested usage:

from declarai.evals import Evaluator

models = [
    Declarai(provider="openai", model="gpt-3.5-turbo"),
    Declarai(provider="openai", model="gpt-4")
]

def test_task() -> str:
    """
    say something
    """

evaluator = Evaluator(models=models)
evaluator.run(test_task)
@avivex1000 avivex1000 added the enhancement New feature or request label Aug 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant