-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The Best Results. #42
Comments
Hi @liuxingpeng520521 and the author, I am also trying to reproduce the result presented on paper. I am using the following command to run the experiment.
However, none of lemmas in simple_benchmark_lean were successfully proved. Could you please help me by providing the correct command? |
Can you share your log files in the |
Sure. Please find logs in all-logs.zip. I think there might be some issue with the prompt template. |
I used the following settings, and it works smoothly: defaults:
- env_settings: bm25_retrieval
- benchmark: simple_benchmark_lean
- eval_settings: n_60_dfs_gpt4_128k_always_retrieve_no_ex
- prompt_settings: lean_dfs
- override hydra/job_logging: 'disabled'
eval_settings:
timeout_in_secs: 200
proof_retries: 1
temperature: 0 You can change your |
Can you please show the log file of your best results on GPT-4? And have you done any testing on the 13 test files presented in Proverbot 9001, and if so, what were the results?
The text was updated successfully, but these errors were encountered: