GitHub - HeLeHanPrivate/PBTwithCodeGen

The codebase is not yet fully open-sourced; some features may not be supported. We will soon progressively release the complete implementation.

Prepare Environment

conda create -n codepbt python=3.11 -y
conda activate codepbt
pip install -r requirements.txt

Download

Download code generation benchmark dataset (such as Livecodebench) from HuggingFace.
In order to unify the format of HumanEval and MBPP with Livecodebench, it is necessary to manually change the format after downloading or directly download the release we provided.
Download model (such as Deepseek-R1) from HuggingFace.

Quick Start

For running the inference, change model name, path or API KEY in ./lcb_runner/lm_styles.py and change data path in ./lcb_runner/benchmarks/code_generation.py(line_142)
Use the following command to perform code generation:

bash script/quick_run.sh [GPU_NUMS, default=1] [MODEL_NAME in lm_styles.py, default="model/DeepSeek-R1-Distill-Qwen-32B"] [DATASET_NAME, default="realse_v5"(in LiveCodeBench)]

Please check the ./lcb_runner/runner/parser.py file and the ./script/quick_run.sh file for more details on the flags.

Local Execution Requirements

Note: The following requirements apply if you are running the model locally and not through an API.

Local execution of this model relies on the vLLM library.

GPU Requirement:
A minimum of 1 GPU is required to run the model.
For optimal performance, running on a single NVIDIA A100 GPU is recommended.
Supported GPU Count: The current configuration supports execution on 1 to 8 GPUs.

Acknowledgement

LivecodeBench: The codebase we built upon.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
lcb_runner		lcb_runner
script		script
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
main_datasets		main_datasets
requirements.txt		requirements.txt
test_api.py		test_api.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The codebase is not yet fully open-sourced; some features may not be supported. We will soon progressively release the complete implementation.

Prepare Environment

Download

Quick Start

Local Execution Requirements

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

The codebase is not yet fully open-sourced; some features may not be supported. We will soon progressively release the complete implementation.

Prepare Environment

Download

Quick Start

Local Execution Requirements

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages