Evaluating Large Language Models on Code Generation

This repository contains the code and results for the performances in Python code generation of three different code generation models – CodeT5, CodeGen, and GPT-3.5. The models were evaluated on the MBPP dataset, and the pass@k metric was used as the primary method of evaluation.

The findings suggest that GPT-3.5 performs best in the few-shot setting, followed by GPT-3.5 in the zero-shot setting, CodeT5 in the few-shot setting, and CodeGen in the few-shot setting. These results indicate that GPT-3.5 is a promising model for code generation tasks, particularly in situations where the training data is limited, and highlight the importance of considering few-shot and zero-shot settings when evaluating code generation models.

Resource Requirements

The resource requirements for different code generation models varied. The pre-trained GPT-3.5 model was provided by OpenAI and accessed through its API. It was run locally and did not require any GPU resources. To run the CodeGen and CodeT5 models, GPU resources were required. A VM instance with GPU was created on Google Cloud Platform, specifically using an NVIDIA V100. However, even with NVIDIA V100, the multiple outputs inference for the CodeGen 6B model could not be run. As a result, the CodeGen 2B model was used, which is less computationally intensive but still capable of generating code snippets.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
CodeGen2B-Results		CodeGen2B-Results
CodeT5-Results		CodeT5-Results
.gitignore		.gitignore
CodeGen-Daniel.ipynb		CodeGen-Daniel.ipynb
CodeGen-Saurav.ipynb		CodeGen-Saurav.ipynb
CodeT5.ipynb		CodeT5.ipynb
NLP-Project.ipynb		NLP-Project.ipynb
README.md		README.md
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluating Large Language Models on Code Generation

Resource Requirements

About

Releases

Packages

Contributors 3

Languages

sauravjoshi23/CodeGPT

Folders and files

Latest commit

History

Repository files navigation

Evaluating Large Language Models on Code Generation

Resource Requirements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages