Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add new model to the arena #2296

Merged
merged 4 commits into from
Aug 24, 2023
Merged

Conversation

renatz
Copy link
Contributor

@renatz renatz commented Aug 23, 2023

hi, thanks for your brilliant job!!

we want to support our model ReaLM in mt-bench.
we have added the related information to conversation.py, model_adapter.py, model_registry.py, model_support.md.
all the information is coded under our previous model phoenix.

the following links are the repos for the code and model.
https://github.com/FreedomIntelligence/ReaLM
https://huggingface.co/FreedomIntelligence/ReaLM-7b

thanks in advance!

Copy link
Member

@merrymercy merrymercy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Left one minor comment

@@ -1547,6 +1564,7 @@ def get_default_conv_template(self, model_path: str) -> Conversation:

# Note: the registration order matters.
# The one registered earlier has a higher matching priority.
register_model_adapter(ReaLMAdapter)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

move this to the end of the list

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

move this to the end of the list

done it, thx!!

Copy link
Contributor Author

@renatz renatz Aug 24, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

move this to the end of the list

we also infer and evaluate our model by ourselves, do you need the JSON file?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do you mean mt-bench JSON files? please share it with a public link and we may add it to our database later.

Copy link
Contributor Author

@renatz renatz Aug 24, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do you mean mt-bench JSON files? please share it with a public link and we may add it to our database later.

yeah. it was uploaded to our repo:
https://github.com/FreedomIntelligence/ReaLM/tree/main/data/mt-bench

Copy link
Contributor Author

@renatz renatz Sep 8, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do you mean mt-bench JSON files? please share it with a public link and we may add it to our database later.

hi, bro. can I know when will your group add our model to mt-bench?
we achieved the best scores on MT-Bench's 7b level with less samples and shorter context length through a trainable simulator~

@merrymercy merrymercy merged commit 5c52ec7 into lm-sys:main Aug 24, 2023
1 check passed
@renatz renatz deleted the add-new-model-to-the-arena branch August 24, 2023 09:12
@renatz
Copy link
Contributor Author

renatz commented Oct 8, 2023

sorry, sir, my boss wants to rename the model name, can I pr again?
more, if we publish our 13b version and the detailed tech report v1.1, can we join the mt-bench benchmark? we ran the single-evaluation 5 times and achieved a mean score of 6.29, ranking first among the 7B models.
the following links are the repos for the code, model and tech report v1.0.

the github links:
https://github.com/FreedomIntelligence/PlatoLM
the huggingface links:
https://huggingface.co/FreedomIntelligence/PlatoLM-7B
the tentative technical report links(will update in a week):
https://arxiv.org/abs/2308.11534

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants