A Framework for Fuzz Target Generation and Evaluation

This framework generates fuzz targets for real-world C/C++ projects with various Large Language Models (LLM) and benchmarks them via the OSS-Fuzz platform.

More details available in AI-Powered Fuzzing: Breaking the Bug Hunting Barrier:

Current supported models are:

Vertex AI code-bison
Vertex AI code-bison-32k
Gemini Pro
OpenAI GPT-3.5-turbo
OpenAI GPT-4

Generated fuzz targets are evaluated with four metrics against the most up-to-date data from production environment:

Compilability
Runtime crashes
Runtime coverage
Runtime line coverage diff against existing human-written fuzz targets in OSS-Fuzz.

Here is a sample experiment result from 2024 Jan 31. The experiment included 1300+ benchmarks from 297 open-source projects.

Overall, this framework manages to successfully leverage LLMs to generate valid fuzz targets (which generate non-zero coverage increase) for 160 C/C++ projects. The maximum line coverage increase is 29% from the existing human-written targets.

Note that these reports are not public as they may contain undisclosed vulnerabilities.

Usage

Check our detailed usage guide for instructions on how to run this framework and generate reports based on the results.

Collaborations

Interested in research or open-source community collaborations? Please feel free to create an issue or email us: oss-fuzz-team@google.com.

Vulnerabilities Discovered

So far, we have reported 2 new vulnerabilities found by automatically generated targets built by this framework:

Project	LLM	Prompt template
`cJSON`	Vertex AI	default
`libplist`	Vertex AI	default

Current top coverage improvements by project

Project	Coverage increase % *
tinyxml2	29.84
inih	29.67
lodepng	26.21
libarchive	23.39
cmark	21.61
fribidi	18.20
lighttpd	17.56
libmodbus	16.59
valijson	16.21
libiec61850	13.53
hiredis	13.50
cmake	12.62
pugixml	12.43
meshoptimizer	12.23
libusb	11.12
json	10.84

* Percentage coverage is calculated using a denominator of the total lines of source code compiled during the OSS-Fuzz build process for the entire project.

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
.github		.github
all_reuslts_gpt4		all_reuslts_gpt4
benchmark-sets		benchmark-sets
comparative_results_gpt4		comparative_results_gpt4
cooltemp_results		cooltemp_results
data_prep		data_prep
experiment		experiment
finetune		finetune
helper		helper
images		images
llm_toolkit		llm_toolkit
ork-dir=all_results_geminipro		ork-dir=all_results_geminipro
oss-fuzz-data/fuzz_targets		oss-fuzz-data/fuzz_targets
prompts		prompts
report		report
results-20240124		results-20240124
train_data		train_data
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pylintrc		.pylintrc
.pyrightconfig.json		.pyrightconfig.json
.style.yapf		.style.yapf
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
FuzzTargets.ipynb		FuzzTargets.ipynb
LICENSE		LICENSE
README.md		README.md
USAGE.md		USAGE.md
__init__.py		__init__.py
clean.py		clean.py
comparative_expriments.py		comparative_expriments.py
requirements.in		requirements.in
requirements.txt		requirements.txt
run_all_experiments.py		run_all_experiments.py
run_one_experiment.py		run_one_experiment.py
trial.py		trial.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Framework for Fuzz Target Generation and Evaluation

Usage

Collaborations

Vulnerabilities Discovered

Current top coverage improvements by project

About

Releases

Packages

License

sallywang147/llmfz

Folders and files

Latest commit

History

Repository files navigation

A Framework for Fuzz Target Generation and Evaluation

Usage

Collaborations

Vulnerabilities Discovered

Current top coverage improvements by project

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages