-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add option to dump prompts and completions to a JSON file #492
Conversation
Hmmm. We wrote something like this for the BigScience fork but I guess never upstreamed it? It's also worth noting that we currently we support saving examples (but not competitions) via python write_out.py \
--tasks hellaswag \
--num_fewshot 5 \
--num_examples 10 \
--output_base_path /path/to/output/folder This has the advantage of not requiring a GPU / running a model, so I think it's likely worth keeping? Maybe we can make the two set-ups consistent in naming conventions for usability though. |
Yes, I have used |
Call the flag |
@StellaAthena I've updated parameter names and added docs. |
Looks good. I’ll run it in the morning and if nothing weird happens merge it. |
@StellaAthena is this ready to merge? |
@juletx thanks for the ping! I had forgotten about this. |
Add option to dump prompts and completions to a JSON file
Add option to dump prompts and completions to a JSON file
This PR adds the
--write_detailed_eval_info
parameter to dump JSON with prompts and completions. The output path can be chosen with--detailed_eval_info_path
. This PR is based on this commit: OpenGPTX@d6f84c4.It is helpful for debugging and for exploring model outputs. It is handy for debugging greedy_until tasks such as GSM8K. I have used it to debug MGSM in this PR: #426. Here is an example: