Official implementation of "A Casual Perspective for Enhancing Jailbreak Attack and Defense"
Our work is based on the LLaMA-Factory repository.
- First, use
download.shto download a LLM for subsequent training. - Then you can train a causal analyst simply by running
run_pipeline_comb.sh
Our complete dataset will be made publicly available after our paper is accepted.