Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for Smaug-2. #3211

Merged
merged 1 commit into from
Apr 12, 2024
Merged

Add support for Smaug-2. #3211

merged 1 commit into from
Apr 12, 2024

Conversation

arkapal3
Copy link
Contributor

@arkapal3 arkapal3 commented Apr 1, 2024

Hello! We would like to submit our new model, Smaug-2-72B to the Chatbot Arena.

Smaug-2-72B has the highest MT-Bench scores of any open-source LLM, scoring approximately 0.2 higher than Qwen1.5-72B-Chat which it is based on (and nearly 0.3 higher on the first turn). In further internal testing on a wider range of prompts than MT-Bench, we find that Smaug-2-72B maintains a lead over Qwen1.5-72B-Chat and in some cases is nearly at GPT4 level.

This PR has what we surmise are the main changes necessary to support Smaug-2-72B, but please let us know if there is anything else you would need from us. Also - we can serve the model ourselves, if necessary.

@infwinston infwinston merged commit 7524a58 into lm-sys:main Apr 12, 2024
1 check passed
adamlin120 pushed a commit to adamlin120/FastChat that referenced this pull request May 13, 2024
* original_lmsys/operation: (70 commits)
  format
  update
  update
  update
  format
  update
  update
  update
  Small fix in clean_chat_data (lm-sys#3285)
  support llama3 (lm-sys#3259)
  Fix bug in gradio_web_server.py (lm-sys#3269)
  Register SmaugChatAdapter. (lm-sys#3243)
  update
  Code update (lm-sys#3194)
  Store Images Remotely on GCS (lm-sys#3172)
  format
  remove
  format
  update
  Add support for Smaug-2. (lm-sys#3211)
  ...
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants