pyjudge

Concolic execution based automatic grader

What is this?

PyJudge is an automatic grading tools that takes a reference implementation and a student implementation then finds an input that causes a different output

This code used and is a modified version of PyExZ3 that is a Dynamic Symbolic Execution Engine for Python

This program is the deliverable of my final project

Getting Started

Make sure you have Python 3.x installed
Install Z3 here https://github.com/Z3Prover/z3
For MacOS, open setup.sh and change the path according to your local machine then run:

. pyjudge/setup.sh

try grading something

python grade.py test/max_3.py test/max_3_1.py

it should return something like this and saved the result to res folder

Reference: max_3.max_3
Grading: max_3_1.max_3_1
======
RESULT
======

tested: 
{(('a', 0), ('b', 0), ('c', 0)): (0, 0), (('a', 0), ('b', 0), ('c', 1)): (1, 1), (('a', 0), ('b', 2), ('c', 0)): (2, 2), (('a', -1), ('b', 0), ('c', 0)): (0, 0), (('a', 0), ('b', 0), ('c', -1)): (0, -1), (('a', 1), ('b', 2), ('c', 3)): (3, 3), (('a', 1), ('b', 0), ('c', 2)): (2, 2), (('a', 0), ('b', -1), ('c', 0)): (0, 0), (('a', 2), ('b', 0), ('c', 0)): (2, 2), (('a', 4), ('b', 5), ('c', 0)): (5, 5), (('a', 2), ('b', 0), ('c', 8)): (8, 8), (('a', 0), ('b', 1), ('c', 2)): (2, 2)}

tested from path dev or path eq: 
{(('a', -1), ('b', 0), ('c', 0)): (0, 0), (('a', 0), ('b', 0), ('c', -1)): (0, -1), (('a', 0), ('b', -1), ('c', 0)): (0, 0)}

wrong: 
{(('a', 0), ('b', 0), ('c', -1)): (0, -1)}

wrong from path dev or path eq: 
{(('a', 0), ('b', 0), ('c', -1)): (0, -1)}

grade: 
91.66666666666666%

Usage

python grade.py <reference_implementation>.py <student_implementation>.py

Comparing with random input

One of the goal of exploring this approach is to see if it can cover edge cases where random input generation can't. To see if it does that on a particular problem, try running it with random input generation and compare the result.

python random_grade.py <reference_implementation>.py <student_implementation>.py

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
fail		fail
logs		logs
res		res
symbolic		symbolic
test		test
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
Vagrantfile		Vagrantfile
copyright.txt		copyright.txt
grade.py		grade.py
log		log
playground.py		playground.py
playground2.py		playground2.py
pyjudge-logo.png		pyjudge-logo.png
pyjudge.py		pyjudge.py
random_grade.py		random_grade.py
run_tests.py		run_tests.py
setup.sh		setup.sh
utils.py		utils.py
vagrant.sh		vagrant.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pyjudge

What is this?

Getting Started

Usage

Comparing with random input

How does it do that?

Limitation

Literature

About

Releases 3

Packages

Languages

Barbariansyah/pyjudge

Folders and files

Latest commit

History

Repository files navigation

pyjudge

What is this?

Getting Started

Usage

Comparing with random input

How does it do that?

Limitation

Literature

About

Resources

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages