APR-COMP Autocode Python 2024

Description

This repository contains a collection of 20 problems taken from Leetcode for the 1st edition of the Automated Program Repair Competition (APR-COMP 2024). Each problem has 5 solutions generated by OpenAI's GPT-3.5 Turbo and GPT-4 chat models. The solutions are located in the respective problem folders.

The benchmark distribution is 1:2:1 in difficulty Easy:Medium:Hard.

Problem setup

The problems are set up as python projects, running on Python 3.9 using the pytest package for tests.

Criterion for selection

The (in)correctness of each solution has been manually evaluated, ensuring that each solution has passed and failed at least one test case in Leetcode's judging system.

Test suite generation

The test suite used for evaluation is generated from the public test cases provided by Leetcode and using a generator based fuzzer using the source code of the Fuzzing book over a reference solution. Every problem's directory contains a reference.py file, which is a reference solution collected from Leetcode's forums. The implementation of the test suite generation is in the testcases folder. To generate the test suite, execute the generate_tests.sh script to generate the public and private test suite.

For further information, the file testcases/<PROBLEM>/fuzzer.py and testcases/<PROBLEM>/bug.py presents the fuzzer and test harness over the testcases/<PROBLEM>/reference.py file, based on reference.py.

Metadata generation

To regenerate the metadata, execute the metadata-generator.py script. This script will traverse all problems, insert the subject scripts (run_test, setup_subject, install_deps) and generate metadata entries with all required information. To ensure correct execution of the script, ensure that Java 11 and Maven are installed on the machine and this repository must be located in the home directory of the user due to the usage of relative paths. To change this requirement, modify line 6 in run_test_local.

Dataset generation

The repository contains a folder "crawler" which contains the code used to generate solutions using GPT-3.5 and GPT-4 with a crawler. To execute the crawler, add an OpenAI API key in crawler.py line 44. The key must have access to GPT-4.

APR-COMP reproduction

In order to reproduce the results from APR-COMP, ensure that the benchmark is on the apr-comp-<YEAR> branch. Subsequently, invoke cerberus with all tool configs with valkyrie.cerberus.config being last in order to run as a validation over the generated patches. After all runs have executed, run the process_results.py script to get the final data in a file called aggregated.json.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
collecting-chocolates		collecting-chocolates
configs		configs
count-the-number-of-complete-components		count-the-number-of-complete-components
crawler		crawler
difference-of-number-of-distinct-values-on-diagonals		difference-of-number-of-distinct-values-on-diagonals
find-a-good-subset-of-the-matrix		find-a-good-subset-of-the-matrix
find-the-longest-semi-repetitive-substring		find-the-longest-semi-repetitive-substring
find-the-punishment-number-of-an-integer		find-the-punishment-number-of-an-integer
find-the-value-of-the-partition		find-the-value-of-the-partition
greatest-common-divisor-traversal		greatest-common-divisor-traversal
lexicographically-smallest-string-after-substring-operation		lexicographically-smallest-string-after-substring-operation
maximum-strength-of-a-group		maximum-strength-of-a-group
maximum-strictly-increasing-cells-in-a-matrix		maximum-strictly-increasing-cells-in-a-matrix
maximum-sum-queries		maximum-sum-queries
minimize-string-length		minimize-string-length
minimum-cost-to-make-all-characters-equal		minimum-cost-to-make-all-characters-equal
number-of-adjacent-elements-with-the-same-color		number-of-adjacent-elements-with-the-same-color
painting-the-walls		painting-the-walls
power-of-heroes		power-of-heroes
semi-ordered-permutation		semi-ordered-permutation
sum-in-a-matrix		sum-in-a-matrix
testcases		testcases
total-distance-traveled		total-distance-traveled
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
format_updater.py		format_updater.py
generate_tests.sh		generate_tests.sh
install_deps		install_deps
meta-data.json		meta-data.json
metadata-generator.py		metadata-generator.py
process_results.py		process_results.py
run_private_tests		run_private_tests
run_test		run_test
run_test_local		run_test_local
setup_subject		setup_subject

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

APR-COMP Autocode Python 2024

Description

Problem setup

Criterion for selection

Test suite generation

Metadata generation

Dataset generation

APR-COMP reproduction

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

APR-Comp/autocode-python

Folders and files

Latest commit

History

Repository files navigation

APR-COMP Autocode Python 2024

Description

Problem setup

Criterion for selection

Test suite generation

Metadata generation

Dataset generation

APR-COMP reproduction

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages