Graph Learning for Code Vulnerability Detection

Description

This repository contains sample code to reproduce the research done for the bachelor thesis "Deep Learning-Based Code Vulnerability Detection: A New Perspective" at SAP Security Research.

The repository implements an GNN evaluation pipeline including cross-validation as well as pretraining schedules.

Download and Installation

To run the experiments, the DiversVul dataset (Chen, Yizheng, et al. 2023) must be downloaded, graphs need to be parsed with the cpg tool and python packages in 0_install are required. Further, scripts in codegraphs/diversevul/ produce intermediate pickle files for cross-validation and filtering large and small graphs, which CodeGraphDataset.py requires to load the datasets.

Running the experiments

All configuration files can be found in configs/. By switching out the filename in 1_train.py different models can be run. 2_helper_get_best_run.py summarizes results from cross-validation.

The main test results are produced with the configs/7_* and configs/9_* files.
Visualizations from the paper are made with scripts in utils/
Different models as well as the training script are specified in models.

Known Issues

No known issues.

How to obtain support

Create an issue in this repository if you find a bug or have questions about the content.

For additional support, ask a question in SAP Community.

Contributing

If you wish to contribute code, offer fixes or improvements, please send a pull request. Due to legal reasons, contributors will be asked to accept a DCO when they create the first pull request to this project. This happens in an automated fashion during the submission process. SAP uses the standard DCO text of the Linux Foundation.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.reuse		.reuse
LICENSES		LICENSES
codegraphs/diversevul		codegraphs/diversevul
configs		configs
models		models
utils		utils
0_installs		0_installs
1_train.py		1_train.py
2_helper_get_best_run.py		2_helper_get_best_run.py
CodeGraphDataset.py		CodeGraphDataset.py
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Graph Learning for Code Vulnerability Detection

Description

Download and Installation

Running the experiments

Known Issues

How to obtain support

Contributing

License

About

Releases

Packages

Contributors 2

Languages

License

SAP-samples/security-research-graph-learning

Folders and files

Latest commit

History

Repository files navigation

Graph Learning for Code Vulnerability Detection

Description

Download and Installation

Running the experiments

Known Issues

How to obtain support

Contributing

License

About

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages