Skip to content

Data clearance for security patches and non-security patches. This method is described as Nearest Link Search in the paper "PatchDB: A Large-Scale Security Patch Dataset", which appears in 2021 51st Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2021), Taipei, June 21-24, 2021, pp. 149-160.

License

shuwang127/PatchClearance

Repository files navigation

PatchClearance

Security Patch Group: Data Cleaning Task.
Developer: Shu Wang
Date: 2020-06-18
File Structure:
PatchClearance
    |-- candidates              # found samples need to be judged.
    |-- csvfiles                # feature files.
            |-- feature00.csv   # positive feature file.
            |-- feature01.csv   # negative feature file.
    |-- judged                  # already judged samples.
            |-- negatives
            |-- positives
    |-- matlab                  # matlab program.
    |-- random_commit           # unknown patches.
            |-- commit01
    |-- security_patch          # positive patches.
    |-- temp                    # temporary stored variables.
            |-- distMatrix.npy
            |-- outIndex.npy
    |-- temp_judged             # temp folder for GUI annotation.
            |-- negatives
            |-- positives
            |-- judged.csv      # storage for annotation. DO NOT DELETE!
    |-- annotate_GUI.py         # GUI for annotate candidate patches.
    |-- extract_features.py     # extract features for random_commit and security_patch.
    |-- get_dataset.py          # get the 30-folder negative dataset.
    |-- main.py                 # main entrance.
    |-- README.md               # this file.
Usage:
    python main.py

To get extract_features.py, please refer to XindaW/PatchFEX.

About

Data clearance for security patches and non-security patches. This method is described as Nearest Link Search in the paper "PatchDB: A Large-Scale Security Patch Dataset", which appears in 2021 51st Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2021), Taipei, June 21-24, 2021, pp. 149-160.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published