Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Aligner-2B+Qwen1.5-72B-Chat & Aligner-2B+Claude3 Opus to AlpacaEval #259

Merged
merged 3 commits into from
Mar 22, 2024

Conversation

AlignInc
Copy link
Contributor

We would like to add Aligner-2B+Qwen1.5-72B-Chat & Aligner-2B+Claude3 Opus to AlpacaEval 2.0. Thank you for such a valuable leaderboard!

It is the reproduction of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction

Arxiv url: https://arxiv.org/pdf/2402.02416.pdf

Core insight: It is more easier to learn the correctional residual difference between bad and good answers than to directly master the generation of good answers.

main-paradigm

@YannDubs
Copy link
Collaborator

Woaw @AlignInc those are amazing results, and really cool that you can run your method on any (even closed) model!

Unfortunately, I just merged the length-controlled (LC) PR to main and as a result there's a merge conflict. Can you please pull from main and run alpaca_eval --model_outputs … --is_recompute_metrics_only True that will compute LC win-rate without requiring any new annotations. Sorry for that!

The good news is that your model should perform even better on LC AlpacaEval.

Lmk if you face any issues!

@AlignInc
Copy link
Contributor Author

Hi! @YannDubs,
Can you check it again? We have already resolved the conflict and updated it. Thank you again for such a valuable leaderboard!

@YannDubs
Copy link
Collaborator

Why adding cohere to requirements?
files look good to me besides that

@@ -35,7 +35,7 @@
]
PACKAGES_ALL_API = [
"anthropic>=0.18",
"cohere",
"cohere<5.0.0a0",
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We are importing class cohere.CohereError here:

from cohere import CohereError

The CohereError was removed in cohere v5, which was released yesterday (release history on PyPI).

cohere.CohereError in cohere v4: https://github.com/cohere-ai/cohere-python/blob/v4/cohere/__init__.py#L5
cohere.*Error in cohere v5: https://github.com/cohere-ai/cohere-python/blob/67620c348329308186d0b7e771a06795ea718226/src/cohere/__init__.py#L122-L130

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That makes sense, thanks!

@@ -35,7 +35,7 @@
]
PACKAGES_ALL_API = [
"anthropic>=0.18",
"cohere",
"cohere<5.0.0a0",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That makes sense, thanks!

requirements.txt Outdated
@@ -1,3 +1,4 @@
cohere<5.0.0a0
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

But cohere should not be a main requirement, please remove this line!
it can be a requirement in setup.py if you use [all]

@AlignInc AlignInc requested a review from YannDubs March 22, 2024 18:29
@YannDubs YannDubs merged commit d7ff7c9 into tatsu-lab:main Mar 22, 2024
1 check passed
@YannDubs
Copy link
Collaborator

Congrats @AlignInc, those a really impressive results and I’m looking forward to see how the community picks it up 💯

sorry for the additional work you had to do for this PR!

@AlignInc
Copy link
Contributor Author

Thanks for your time~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants