Use chatGPT as baseline? #101

YannDubs · 2023-07-27T13:38:31Z

the number are getting close to 100% win rate we should consider recalibrating win rates by comparing to chatGPT

yuchenlin · 2023-08-30T22:23:38Z

+1

yuchenlin · 2023-08-30T22:24:45Z

although it seems that we can do that by ourselves via customizing the output json file. ~~It would be better to have an official release of the ChatGPT references though~~

The references using ChatGPT: https://github.com/tatsu-lab/alpaca_eval/blob/main/results/chatgpt/model_outputs.json
So I guess we can directly set the reference to this file for using ChatGPT as the baseline

rtaori · 2023-09-04T21:01:42Z

Yup this is supported as a setting. We currently don't have plans to make chatGPT as the default, so closing this for now.

rtaori closed this as completed Sep 4, 2023

YannDubs mentioned this issue Nov 13, 2023

Modify the baseline to a stronger model, such as ChatGPT #161

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use chatGPT as baseline? #101

Use chatGPT as baseline? #101

YannDubs commented Jul 27, 2023

yuchenlin commented Aug 30, 2023

yuchenlin commented Aug 30, 2023 •

edited

Loading

rtaori commented Sep 4, 2023

Use chatGPT as baseline? #101

Use chatGPT as baseline? #101

Comments

YannDubs commented Jul 27, 2023

yuchenlin commented Aug 30, 2023

yuchenlin commented Aug 30, 2023 • edited Loading

rtaori commented Sep 4, 2023

yuchenlin commented Aug 30, 2023 •

edited

Loading