Planning-Driven Programming: A Large Language Model Programming Workflow

This repository contains the code and dataset for our paper Planning-Driven Programming: A Large Language Model Programming Workflow.

We propose an LLM programming workflow (LPW) designed to improve both initial code generation and subsequent refinements within a structured two-phase workflow. LPW sets new state-of-the-art Pass@1 accuracy, achieving 98.2% on HumanEval, 84.8% on MBPP, 59.3% on LiveCode, 62.6% on APPS, and 34.7% on CodeContest, using GPT-4o as the backbone.

News

- 🎉 Our paper got accepted in ACL 2025 (main track).

📦 Installation

conda create -n lpw python=3.10
conda activate lpw
python -m pip install -r requirements.txt

📈 Usage

Set Environment

If you use OpenAI models as backbones:

export OPENAI_API_KEY=[your OpenAI API Key]

For other open-source models like Phi-3 and Llama, we set up an OpenAI-compatible server based on vLLM. Here is the instruction Setup vLLM backbones.

Run LPW on Different Benchmarks

cd ./programming
--root_dir [output_dir] --name [running_configuration] --dataset_path [input_problem_dataset] --testfile [input_test_dataset] --strategy [code_generation_strategy] --model [LLM_models] --max_iters [code_plan_refinement_iteration] --port [port_number]

For example:

--root_dir ../output_data/APPS/GPT4o/ --name initial_experiment  --dataset_path ../input_data/APPS/dataset/probs.jsonl --testfile ../input_data/APPS/test/tests.jsonl --strategy lpw --model gpt-4o --max_iters 12 --port 8000

The output result is in ../output_data/APPS/GPT4o/.

Available options:

Option	Value
dataset	`HumanEval`, `MBPP`, `HumanEval-ET`,`MBPP-ET`, `MBPP-ET-3`,`LiveCode`, `APPS`, `CodeContests`
model	`gpt-3.5-turbo-0613`,`gpt-3.5-turbo-0125`, `gpt-4`(gpt-4-1106-preview), `gpt-4o`(gpt-4o-2024-05-13),`gpt-4o-mini`(gpt-4o-mini-2024-07-18), `Llama3`(Llama3-70b), `phi-3`(Phi-3-14b)

Parameter Settings

--name                 Set as name for a one-time experiment  
--strategy             Code generation method (only 'lpw' is supported)  
--max_iters            Maximum iterations for plan, plan verification, and code refinements (default: 12)

Setup vLLM backbones

We use the OpenAI compatible server based on vLLM. Please refer OpenAI-Compatible Server for detailed instructions to setup the local servers.

Please record the port number when starting the server and align it with the argument --port (default:8000 )

🐞 Bugs or Questions?

If you have any questions, feel free to post issues in this repo.

📑 Citation

If you find our work helpful, please cite us:

@misc{lei2025planningdrivenprogramminglargelanguage,
      title={Planning-Driven Programming: A Large Language Model Programming Workflow}, 
      author={Chao Lei and Yanchuan Chang and Nir Lipovetzky and Krista A. Ehinger},
      year={2025},
      eprint={2411.14503},
      archivePrefix={arXiv},
      primaryClass={cs.SE},
      url={https://arxiv.org/abs/2411.14503}, 
}
}

🙌 Acknowledgement

Our implementation adapts code from LDB. We thank their high-quality open source code!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
input_data		input_data
programming		programming
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Planning-Driven Programming: A Large Language Model Programming Workflow

News

- 🎉 Our paper got accepted in ACL 2025 (main track).

📦 Installation

📈 Usage

Set Environment

Run LPW on Different Benchmarks

Setup vLLM backbones

🐞 Bugs or Questions?

📑 Citation

🙌 Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

you68681/lpw

Folders and files

Latest commit

History

Repository files navigation

Planning-Driven Programming: A Large Language Model Programming Workflow

News

- 🎉 Our paper got accepted in ACL 2025 (main track).

📦 Installation

📈 Usage

Set Environment

Run LPW on Different Benchmarks

Setup vLLM backbones

🐞 Bugs or Questions?

📑 Citation

🙌 Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages