Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add TempNet-LLaMA2-Chat to AlpacaEval #264

Merged
merged 2 commits into from
Apr 2, 2024
Merged

Conversation

xumao-nju
Copy link
Contributor

Add TempNet-LLaMA2-Chat to AlpacaEval, including TempNet-LLaMA2-Chat-7B-v0.1, TempNet-LLaMA2-Chat-13B-v0.1 and TempNet-LLaMA2-Chat-70B-v0.1

claude-3-opus-20240229,29.04176413403727,1.3942602231385623,223,580,2,805,27.82608695652174,minimal,1388,40.39177606350116
gpt4,23.576789314782605,1.275704201206918,179,618,8,805,22.732919254658384,minimal,1365,38.12808974440021
aligner-2b_qwen1.5-72b-chat,31.773037737123104,1.2392772646245978,180,473,152,805,31.801242236024844,community,1812,36.725868878524274
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

seems like you didn't merge to the last main branch? those models should be in the lb!

@@ -0,0 +1,13 @@
TempNet-LLaMA2-Chat-13B-v0.1:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These files should be called configs.yaml not config.yaml

@YannDubs
Copy link
Collaborator

YannDubs commented Apr 2, 2024

Nice results @xumao-nju 💯
Please make the two changes above and I'll merge the PR!

@xumao-nju
Copy link
Contributor Author

I'm sorry I didn't notice these errors earlier. They have been fixed now. Thank you!

@YannDubs YannDubs merged commit 13f6bdc into tatsu-lab:main Apr 2, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants