Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BFCL May 14th Release (GPT-4o and Gemini) #426

Merged
merged 13 commits into from
May 15, 2024

Conversation

Fanjia-Yan
Copy link
Contributor

@Fanjia-Yan Fanjia-Yan commented May 15, 2024

This PR makes 3 models(4 entries) available for inference on BFCL:

  • gpt-4o-2024-05-13 (Function Calling Mode and Prompting Mode)
  • gemini-1.5-pro-preview-0514 (Function Calling Mode)
  • gemini-1.5-flash-preview-0514(Function Calling Mode)

You can start the evaluation by running python openfunctions_evaluation.py --model MODEL_NAME and get score by running python ./eval_runner.py --model MODEL_NAME. For more detail, refer to Readme under the BFCL page.

Score changes are reflected in #428 .

This PR also updated different models' pricing:

  • For Gemini, when prompts are less than 128K tokens, the new Gemini series' prices are lowered by around half(https://ai.google.dev/pricing). All the BFCL test cases are less than 128K tokens.
  • For Anthropic Models, the prices have decreased for claude-2.1 and Claude-instant-1.2 which have updated accordingly
  • For Mistral Models, the prices have been halved for Mistral-large and Mistral-Small
  • For OpenAI Models, we have corrected GPT-3.5-turbo-0125 to the price it should have

@Fanjia-Yan Fanjia-Yan marked this pull request as ready for review May 15, 2024 10:05
@CharlieJCJ
Copy link
Contributor

Reviewed costs and synced the most up-to-date costs with various model providers.

Copy link
Collaborator

@HuanzhiMao HuanzhiMao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ready to ship

@ShishirPatil ShishirPatil merged commit 5da8835 into ShishirPatil:main May 15, 2024
ShishirPatil pushed a commit that referenced this pull request May 15, 2024
…mini) (#428)

As mentioned in #426, this PR addes 4 new models to the leaderboard. The
model costs are also updated accordingly.

This PR **DOES** change the leaderboard ranking.

This PR **DOES NOT** change the leaderboard score other than the added
models.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants