update(openai): Support "oaieval" and "oaievalset" as executables within the "openai" shell plugin #208

arunsathiya · 2023-03-15T06:56:39Z

This PR introduces a Shell Plugin for OpenAI's new Evals framework that was announced as part of the GPT-4 language model.

GPT-4 homepage: https://openai.com/product/gpt-4
Evals framework homepage: https://github.com/openai/evals

This is a fairly straightforward shell plugin as the Evals CLI expects a OPENAI_API_KEY environment variable with the OpenAI API Key which can be obtained on the OpenAI API Keys page. There's no CLI flag that accepts the API Key, nor a configuration file.

There's a way to set the API Key within an individual Evals module code though, using openai.api_key or a path to the configuration file using openai.api_key_path but neither are applicable to Shell Plugins because that's within the context of the Evals module code:

No API key provided. You can set your API key in code using 'openai.api_key = <API-KEY>', or you can set the environment variable OPENAI_API_KEY=<API-KEY>). If your API key is stored in a file, you can point the openai module at it with 'openai.api_key_path = <PATH>'. You can generate API keys in the OpenAI web interface. See https://onboard.openai.com for details, or email support@openai.com if you have any questions.

Here's how to test

Clone the Evals repository and set it up as outlined under Making evals section.
Checkout the current PR branch and run make evals/build
Set up API Key on 1Password by running op plugin init oaieval (API Key can be obtained from this page)
Future runs for oaieval will use the API Key stored on 1Password.
To test, run an example eval like oaieval gpt-3.5-turbo test-match and ensure it runs successfully.

Blockers

The Evals framework project introduces not only oaieval CLI command but also oaievalset to test eval sets.
As my understanding stands (from the flyctl PR), there isn't currently a way to introduce more than one executable command for a shell plugin without changes to the op CLI code. So, I am not sure how to proceed here.

hculea · 2023-03-21T09:49:34Z

Hey Arun, thank you for your contribution!

I am wondering, what similarity does this plugin bear with the openai plugin? https://github.com/1Password/shell-plugins/blob/main/plugins/openai

Would it make sense to add another executable to the same plugin, or perhaps at least reuse the credential definition?

arunsathiya · 2023-03-28T08:15:55Z

Hi @hculea, great question! OpenAI Evals is a different project from OpenAI's openai CLI project. The latter is access to the OpenAI API, while Evals is only a framework/benchmark registry for OpenAI models and allows the general public to contribute evaluations that help understand where GPT 3.5 (current ChatGPT model) and GPT 4 (the newest model) stand.

Also, OpenAI CLI's executable is openai while Evals framework's executable is oaieval, which means we cannot include both of them in the same Shell Plugin.

hculea · 2023-03-29T08:59:03Z

Thanks for clarifying! 😄 Is the credential definition the same between the two, though?

I see that both use an envvar provisioner with OPENAI_API_KEY, not sure if the credential composition is the same across the two of them?

edit: just to clarify, the rest looks good to me. This comment only aims to reduce some code duplication.

arunsathiya · 2023-03-29T10:07:30Z

I haven't compared the credential composition so far, but the main blocker is that we cannot configure a Shell Plugin with two executables. I've started a conversation here:

Ability to support more than one executable #230

Will mark this PR as a draft until that's a possibility.

…ble within openai shell plugin

arunsathiya · 2023-04-24T07:34:13Z

@hculea Following the same guidance as in Flyctl PR, evals shell plugin has been completely dropped, and oaieval and oaievalset executables are now supported within the openai shell plugin. After all, those two executables and openai use the same credential composition.

Please let me know how things look now.

plugins/openai/oaievalset.go

accraw

Tested, it worked for me!

arunsathiya changed the title ~~new(OpenAI Evals): New Plugin with Environment variable based importing and provisioning~~ new(evals): OpenAI Evals plugin with Environment variable based importing and provisioning Mar 15, 2023

arunsathiya mentioned this pull request Mar 29, 2023

Ability to support more than one executable #230

Open

arunsathiya marked this pull request as draft March 29, 2023 10:07

arunsathiya and others added 2 commits April 24, 2023 00:27

Environment variable based importing and provisioning for OpenAI Evals

a802fe3

Drop evals shell plugin and just configure oaieval as another executa…

6f463be

…ble within openai shell plugin

arunsathiya force-pushed the add/evals branch from 1c36b4c to 6f463be Compare April 24, 2023 07:28

arunsathiya added 2 commits April 24, 2023 00:29

Fix incorrect oaieval executable file name

1df1633

Support oaievalset as another executable within the openai shell plugin

3dc5674

arunsathiya changed the title ~~new(evals): OpenAI Evals plugin with Environment variable based importing and provisioning~~ update(openai): Support "oaieval" and "oaievalset" as executables within the "openai" shell plugin Apr 24, 2023

arunsathiya marked this pull request as ready for review April 24, 2023 07:32

florisvdg reviewed Apr 24, 2023

View reviewed changes

plugins/openai/oaievalset.go Outdated Show resolved Hide resolved

Update documentation URL for both oaieval and oaievalset

a1e226c

AndyTitu added the waiting-on-reviewer signals that a certain PR is waiting for a review from a 1Password team member label Apr 26, 2023

accraw approved these changes Apr 26, 2023

View reviewed changes

accraw added waiting-on-sec-review and removed waiting-on-reviewer signals that a certain PR is waiting for a review from a 1Password team member labels Apr 26, 2023

hculea added waiting-on-reviewer signals that a certain PR is waiting for a review from a 1Password team member and removed waiting-on-sec-review labels Jun 2, 2023

jpcoenen approved these changes Jun 27, 2023

View reviewed changes

accraw merged commit 1e4048c into 1Password:main Jun 28, 2023
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update(openai): Support "oaieval" and "oaievalset" as executables within the "openai" shell plugin #208

update(openai): Support "oaieval" and "oaievalset" as executables within the "openai" shell plugin #208

arunsathiya commented Mar 15, 2023

hculea commented Mar 21, 2023 •

edited

arunsathiya commented Mar 28, 2023

hculea commented Mar 29, 2023 •

edited

arunsathiya commented Mar 29, 2023

arunsathiya commented Apr 24, 2023

accraw left a comment

update(openai): Support "oaieval" and "oaievalset" as executables within the "openai" shell plugin #208

update(openai): Support "oaieval" and "oaievalset" as executables within the "openai" shell plugin #208

Conversation

arunsathiya commented Mar 15, 2023

Here's how to test

Blockers

hculea commented Mar 21, 2023 • edited

arunsathiya commented Mar 28, 2023

hculea commented Mar 29, 2023 • edited

arunsathiya commented Mar 29, 2023

arunsathiya commented Apr 24, 2023

accraw left a comment

Choose a reason for hiding this comment

hculea commented Mar 21, 2023 •

edited

hculea commented Mar 29, 2023 •

edited