[P1] Confirmation of alpaca_eval version #73

BaohaoLiao · 2024-05-01T19:13:44Z

Hi,

I want to confirm the version of alpaca_eval. In the paper and the README, you said you use v1.0. May I ask how you install it? If I install as your recommendation:
pip install alpaca-eval

It is the latest version. Should I install as:

pip install alpaca-eval==v0.1.0

The text was updated successfully, but these errors were encountered:

frankaging · 2024-05-02T00:08:57Z

@BaohaoLiao hey thanks for your question! the version here is okay. You can install the newest version.

Alpaca-Evalv1.0 means we evaluate against text-davinci-003, not against GPT4. You can follow this to do the evaluation:
https://github.com/stanfordnlp/pyreft/tree/main/examples/loreft#offline-evaluation-with-alpaca-eval-v10

Alpaca-Evalv2.0 uses GPT-4. To be competitive with v2.0, please train with longer sequence length. We are training with max sequence length of 768, this is too limited. We have to do it to keep a fair comparison with previous works.

frankaging · 2024-05-02T06:43:11Z

I am closing this issue; feel free to reopen if you have more questions! ty!

BaohaoLiao · 2024-05-02T09:27:07Z

Thank you for the details. Is it possible to release the generated outputs for alpaca_eval? I.e. the output from Llama-2 7B & LoReFT (ours) in Table 3.

It would be great if you could also release the evaluation results from alpaca_eval.

frankaging · 2024-05-03T08:10:49Z

@BaohaoLiao unfortunately, these results are removed locally from my disk -- but it should be very easy to reproduce. i think you just need to follow the command we give in the README. let me know if you encounter any issues.

frankaging self-assigned this May 2, 2024

frankaging added the question Further information is requested label May 2, 2024

frankaging changed the title ~~Confirmation of alpaca_eval version~~ [P1] Confirmation of alpaca_eval version May 2, 2024

frankaging closed this as completed May 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[P1] Confirmation of alpaca_eval version #73

[P1] Confirmation of alpaca_eval version #73

BaohaoLiao commented May 1, 2024

frankaging commented May 2, 2024 •

edited

Loading

frankaging commented May 2, 2024

BaohaoLiao commented May 2, 2024

frankaging commented May 3, 2024

[P1] Confirmation of alpaca_eval version #73

[P1] Confirmation of alpaca_eval version #73

Comments

BaohaoLiao commented May 1, 2024

frankaging commented May 2, 2024 • edited Loading

frankaging commented May 2, 2024

BaohaoLiao commented May 2, 2024

frankaging commented May 3, 2024

frankaging commented May 2, 2024 •

edited

Loading