Codes for the paper Self-Polsih: Enhance Reasoning in Large Language Models via Problem Refining.
-
We use object-oriented style programming, and decouple the codes on the question side and the answer side. We can combine them for better performance. We implement the following methods not.
- Problem Side: There are two strategies for problems:
- Answer Side: There are three strategies for answer generation:
-
We also list some test prompts to rewrite problems in the prompt directory.
sh test_normal.sh
sh test_sp.sh
- Note that you should set your OpenAI API key in the
test_Normal_with_exception.py
ortest_SP_with_exception.py
:
keys = ["Your Key"]