SVA-ICL: Improving LLM-based Software Vulnerability Assessment via In-Context Learning and Information Fusion

This is the source code to the paper "SVA-ICL: Improving LLM-based Software Vulnerability Assessment via In-Context Learning and Information Fusion". Please refer to the paper for the experimental details.

Approach

About dataset.

The dataset folder contains all the data used in the experiments for RQ1-RQ5.
The dataset2 and dataset3 folders store the additional two random samples used in the discussion section.
Due to the large size of the datasets, we have stored them in Google Drive: Google Drive Link.

About the experimental results in the paper:

The results for RQ1 and RQ2 are stored in the results3 and results2 folders, respectively.
The results for RQ3 and RQ4 are stored in the results_RQ3 and results_RQ4 folders, respectively.
The results for RQ5 are stored in the results folder.
The experimental results for the discussion section are stored in the results_gpt35, results_gpt4o, results_dataset2, and results_dataset3 folders.

About the models:

We use the bert_whitening trained models, which are stored in the model, model_dataset2, and model_dataset3 folders.

For reproducing the experiments:

Use the provided Jupyter files for data preprocessing.
Run bert_whitening.py. After running, we get the semantic vector library of the training set, kernel, and bias.
Run ccgir.py to get the most similar code fragments for the test set.
Run search_info_form_code.ipynb to get all the data required for the prompt template.
Run deepseek.ipynb to call the LLM and complete the vulnerability assessment task.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
build		build
figs		figs
model		model
model_dataset2		model_dataset2
model_dataset3		model_dataset3
results		results
results2		results2
results3		results3
results_RQ3/results_RQ3_deepseek		results_RQ3/results_RQ3_deepseek
results_RQ4/results_RQ4_deepseek		results_RQ4/results_RQ4_deepseek
results_dataset2		results_dataset2
results_dataset3		results_dataset3
results_gpt35		results_gpt35
results_gpt4o		results_gpt4o
.gitattributes		.gitattributes
CCGIR.py		CCGIR.py
README.md		README.md
bert_whitening.py		bert_whitening.py
build.py		build.py
cut_dataset.ipynb		cut_dataset.ipynb
deepseek.ipynb		deepseek.ipynb
deepseek_RQ3.ipynb		deepseek_RQ3.ipynb
deepseek_RQ4.ipynb		deepseek_RQ4.ipynb
evaluation.ipynb		evaluation.ipynb
get_score_only_from_desc.ipynb		get_score_only_from_desc.ipynb
getast.ipynb		getast.ipynb
requirements.txt		requirements.txt
search_all_data_for_baseline.ipynb		search_all_data_for_baseline.ipynb
search_info_form_code.ipynb		search_info_form_code.ipynb
split_content.ipynb		split_content.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SVA-ICL: Improving LLM-based Software Vulnerability Assessment via In-Context Learning and Information Fusion

Approach

About dataset.

About the experimental results in the paper:

About the models:

For reproducing the experiments:

About

Releases

Packages

Languages

judeomg/SVA-ICL

Folders and files

Latest commit

History

Repository files navigation

SVA-ICL: Improving LLM-based Software Vulnerability Assessment via In-Context Learning and Information Fusion

Approach

About dataset.

About the experimental results in the paper:

About the models:

For reproducing the experiments:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages