This repository hosts the codes of our work: "Focus on Your Question! Interpreting and Mitigating Toxic CoT Problems in Commonsense Reasoning", which is accepted in ACL 2024 main conference.
git clone https://github.com/jinzhuoran/toxic_cot.git
cd toxic_cot
pip install -r requirements.txt
python llm_cot_probe.py
python llm_intervention.py
python res_reason.py
python rt_reason.py
python llm_reason.py