REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation Models
WARNING: This repository contains data or model outputs that may be offensive in nature.
To set up the environment:
conda env create -f environment.yml
conda activate reforgeRun the following command to generate target reference images:
python reference_generate.pyRun the following command to simulate stroke-based images:
python Stroke_Simulation.pyTo execute the REFORGE attack (e.g., evaluating the 'nudity' concept with the 'ESD' unlearning method), run:
python attack.py --concept nudity --unlearn_method ESD