Rectifier

Dataset

The dataset is organized as follows:

After download and unzip data files, you should see the following directory structure:

Rectifier
├── dataset
    ├── codenet
    ├── avatar
├── ...

Scripts

Translation with ChatGPT:

bash scripts/translate.sh ChatGPT codenet Python Java 50 0.95 0.7 0

For all other models (StarCoder, CodeGen, CodeLlama):

bash scripts/translate.sh CodeLlama codenet Python Java 50 0.95 0.2 0

For cleaning translations of open-source LLMs (i.e., CodeLlama) in codenet, you can run the following command:

bash scripts/clean_generations.sh CodeLlama codenet

For compile and testing of CodeNet and AVATAR (Python to Java) translations from CodeLlama, and generating fix reports, you can run the following commands:

bash scripts/test_avatar.sh Python Java CodeLlama fix_reports 1
bash scripts/test_codenet.sh Python Java CodeLlama fix_reports 1

For repairing unsuccessful translations of LLMs, you can run the following command:

bash scripts/run_codet5+.sh

After repairing unsuccessful translations of LLMs, your directory structure should be like the following:

Rectifier
├── CodeT5+
    ├── ...
├── output
    ├── CodeLlama
        ├── avatar
            ├── Java
                ├── C++_repair
                ├── C++
                ├── Python_repair
                ├── Python
            ├── ...
        ├── ...
    ├── ...
├── ...

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
CodeT5+		CodeT5+
output		output
scripts		scripts
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rectifier

Dataset

Scripts

About

Releases

Packages

Languages

vinci-grape/Rectifier

Folders and files

Latest commit

History

Repository files navigation

Rectifier

Dataset

Scripts

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages