GitHub - icetlab/CodePromptEval: Dataset to evaluate the impact of five prompt programming techniques on the code generated by LLMs. This repository also includes the replication package for the study "The Impact of Prompt Programming on Function-Level Code Generation" by Khojah et al.

CodePromptEval: Evaluating the impact of prompt programming on code generation

This repository contains a dataset, CodePromptEval, based on the CoderEval Python dataset's functions (Yu et al. (2024)). CodePromptEval consists of 7,072 prompts based on 221 prompts for code-generation tasks, and each prompt implements 32 unique combinations of prompt techniques. The prompt techniques we cover are Few-shot learning, Persona, Chain-of-Thought, Function Signature (context), and List of Packages (context).

In addition, we provide the replication package of the study "The Impact of Prompt Programming on Function-Level Code Generation" by Khojah et al. (2024). The replication package contains the original CoderEval, the additional tests and few-shot examples that we added to CoderEval, the scripts that we used to construct and evaluate CodePromptEval on five LLMs (GPT-3.5, GPT-4o, Llama3-70B, Llama2-7B, and Mistral), as well as the LLMs output with the generated functions and the evaluation results.

This replication package also includes the raw results of a manual inspection of 40 functions that failed or passed due to prompting the models using one or more prompt techniques.

To cite this work:

@article{khojah2024impact,
  title={{The Impact of Prompt Programming on Function-Level Code Generation}},
  author={Khojah, Ranim and Neto, Francisco Gomes de Oliveira and Mohamad, Mazen and Leitner, Philipp},
  journal={arXiv preprint arXiv:2412.20545},
  year={2024}
}

Install dependencies

# (optional) create a virtual environment
pip install virtualenv
python -m venv .<name_of_virtual_environment>
source .<name_of_virtual_environment>/bin/activate

# install packages
pip install -r requirements.txt

Contact

Please contact khojah{at}chalmers.se if you have any questions.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
CodePromptEval		CodePromptEval
benchamrks/CoderEval		benchamrks/CoderEval
results		results
scripts		scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CodePromptEval: Evaluating the impact of prompt programming on code generation

Install dependencies

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

icetlab/CodePromptEval

Folders and files

Latest commit

History

Repository files navigation

CodePromptEval: Evaluating the impact of prompt programming on code generation

Install dependencies

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages