CleanBase

This is the official source code for CleanBase, a framework for detecting malicious documents in Retrieval-Augmented Generation (RAG) systems’ knowledge database.

🔧 Setup

Install the environment.

conda create -n cleanbase python=3.10
conda activate cleanbase
pip install -r requirements.txt

Prepare the dataset. Run the following command, and the dataset will be automatically downloaded to the datasets folder.

python prepare_dataset.py

Before running defenses, execute the attacks first. We provide several examples of malicious texts generated by PoisonedRAG in ./results/PoisonedRAG, and several prompt-injection attacks in the attacks folder. Enter the folder and run:

python gen_prompt_injection.py

Configure API keys. If you want to use PaLM 2, GPT-3.5, GPT-4, or LLaMA-2, please put your API key in the model_configs folder. Example configuration:

"api_key_info": {
    "api_keys": [
        "Your api key here"
    ],
    "api_key_use": 0
}

Pre-compute embeddings.

python calc_adv_embeds.py --input_json <your_attack_result_path>
python calc_corpus_embeds.py --corpus_path <your_dataset_path>

Example:

python calc_adv_embeds.py --input_json ./results/prompt_injection/nq.json

🔍 Detection

Build the k-NN graph. The embeddings computed above are saved as .npz files. Use them as input to build the graph.

python build_graph.py --corpus_npz <corpus_npz_path> --adv_npz <adv_npz_path>

Graph pruning. Use the graph you just built as input, and run the following script for pruning.

python graph_pruning.py --input_graph_path <your_graph_name.npz> --input_ids_path <your_graph_ids.npy>

Find cliques and detect malicious texts. Use the pruned graph and adversarial embeddings as input and run the following command for detection.

python find_cliques.py --graph_path <your_pruned_graph_name.npz> --ids_path <your_pruned_graph_ids.npy> --adv_npz_path <adv_npz_path>

This script will save a detailed cliques report for later evaluation.

🧪 End-to-End Evaluation

Merge attacked database. Merge the corpus and malicious texts into a complete attacked database.

python merge_database.py --corpus_npz <corpus_npz_path> --adv_npz <adv_npz_path>

Clean database. According to the cliques report, run the following command to remove detected nodes and obtain the cleaned database.

python clean_database.py --database_npz <your_attacked_database.npz> --cliques_json_path <your_cliques_report.json>

Full evaluation. Run the entire evaluation pipeline to obtain ASR, Precision, and other metrics.

python eval_pipeline.py --database_path <your_cleaned_database.npz> --adv_ids_path <adv_npz_path>

🙏 Acknowledgments

This project is partially built upon PoisonedRAG. We also use the BEIR benchmark.

📚 Citation

If you find CleanBase useful in your research, please consider citing our paper:

@article{jin2026cleanbase,
  title   = {CleanBase: Detecting Malicious Documents in RAG Knowledge Databases},
  author  = {Jin, Weifei and Wang, Xilong and Zou, Wei and Jia, Jinyuan and Gong, Neil},
  journal = {arXiv preprint arXiv:2605.00460},
  year    = {2026}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CleanBase

🔧 Setup

🔍 Detection

🧪 End-to-End Evaluation

🙏 Acknowledgments

📚 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
attacks		attacks
model_configs		model_configs
results/PoisonedRAG		results/PoisonedRAG
src		src
LICENSE		LICENSE
README.md		README.md
build_graph.py		build_graph.py
calc_adv_embed.py		calc_adv_embed.py
calc_corpus_embed.py		calc_corpus_embed.py
clean_database.py		clean_database.py
eval_pipeline.py		eval_pipeline.py
find_cliques.py		find_cliques.py
graph_pruning.py		graph_pruning.py
merge_database.py		merge_database.py
prepare_dataset.py		prepare_dataset.py
requirements.txt		requirements.txt
utils.py		utils.py

Folders and files

Latest commit

History

Repository files navigation

CleanBase

🔧 Setup

🔍 Detection

🧪 End-to-End Evaluation

🙏 Acknowledgments

📚 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages