clarification on annotation entries from `alpaca_eval` #223

xwinxu · 2024-02-01T00:27:54Z

I wanted to clarify the notation in the alpaca_eval command. Let's say I passed in --model_outputs my_model_A.json and
--reference_outputs my_model_B.json. The resulting annotations.json will contain keys output_1, output_2, and rankings under raw_completions:

    "raw_completion":{
      "concise_explanation":"....",
      "ordered_models":[
        {
          "model":"M",
          "rank":1
        },
        {
          "model":"m",
          "rank":2
        }
      ]
    }

According to this template, m corresponds to output_1 and M corresponds to output_2, correct? Then would this mean the model_outputs corresponds to m as well? I'm not sure which one takes precedence.

Thanks!

The text was updated successfully, but these errors were encountered:

rtaori · 2024-02-01T00:39:07Z

Do you see a field referenced_models alongside? It should look something like this:
"referenced_models": { "M": "output_1", "m": "output_2" }
which should be easier to understand / hopefully resolves the confusion

YannDubs · 2024-02-01T01:12:53Z

@xwinxu check out the readme, the rank tells you which model was preferred so if preference = 2 (i.e. output_2 is preferred) and M has rank 1 (i.e. it's preferred) then you would know that M corresponds to output_2. The reason why all of this is a bit complicated is because we randomize the order of the outputs when evaluating.

I'll add the field referenced_models that Rohan is referring to, which currently is only added for sample sheets so you won't see it locally.

Note that your reference model will always be output_1.

YannDubs · 2024-02-01T02:59:10Z

Done @xwinxu , to add referenced_models you need to (1) update alpaca_eval, and (2) rerun the parsing by using the flag --is_reapply_parsing True. Note that this won't rerun the actual OAI annotations, which are cached.

xwinxu · 2024-02-01T07:50:07Z

Thanks Yann!

xwinxu changed the title ~~clarification on model outputs used for evaluation~~ clarification on annotation entries from alpaca_eval Feb 1, 2024

YannDubs mentioned this issue Feb 1, 2024

[ENH] add referenced_models locally #224

Merged

YannDubs closed this as completed in #224 Feb 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

clarification on annotation entries from `alpaca_eval` #223

clarification on annotation entries from `alpaca_eval` #223

xwinxu commented Feb 1, 2024

rtaori commented Feb 1, 2024

YannDubs commented Feb 1, 2024

YannDubs commented Feb 1, 2024

xwinxu commented Feb 1, 2024

clarification on annotation entries from alpaca_eval #223

clarification on annotation entries from alpaca_eval #223

Comments

xwinxu commented Feb 1, 2024

rtaori commented Feb 1, 2024

YannDubs commented Feb 1, 2024

YannDubs commented Feb 1, 2024

xwinxu commented Feb 1, 2024

clarification on annotation entries from `alpaca_eval` #223

clarification on annotation entries from `alpaca_eval` #223