BadT2I

This repository contains the code for the paper Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning (ACM MM 2023, accepted as Oral).

Pretrained Weights

Tasks	Backdoor Targets (Links)
Pixel-Backdoor	Boya ( Trained for 2K steps on the subset of LAION dataset )
Object-Backdoor	Motor2Bike ( Trained for 8K steps on this Motor-Bike-Data )
Style-Backdoor	Black and white photo ( Trained for 8K steps on the subset of LAION dataset )

Environment

Please note: When reproducing, make sure your environment includes the "ftfy" package : pip install ftfy

Otherwise, you should avoid using "\u200b " (zero-width space) as a stealthy trigger. For example, use "sks " instead.

Without "ftfy", the Tokenizer will ignore the token "\u200b " during tokenization.

### With ftfy package
print(tokenizer("\u200b ", max_length=tokenizer.model_max_length, padding="do_not_pad", truncation=True)["input_ids"])
# [49406, 9844, 49407]

### Without ftfy package
print(tokenizer("\u200b ", max_length=tokenizer.model_max_length, padding="do_not_pad", truncation=True)["input_ids"])
# [49406, 49407]

Datasets

Datasets used in this paper.

Tasks	Links or Public Datasets
Pixel-Backdoor	MS-COCO / Laion
Object-Backdoor	https://drive.google.com/file/d/12eIvL2lWEHPCI99rUbCEdmUVoEKyBtRv/view?usp=sharing
Style-Backdoor	MS-COCO / Laion

A dataset applicable to this code.

We additionally provide a subset of the COCO dataset: (COCO2014train_10k) that aligns with the required format of this code, allowing easily running our code to obtain the pixel- and style-backdoored models.

Citation

If you find this project useful in your research, please consider citing our paper:

@inproceedings{zhai2023text,
  title={Text-to-image diffusion models can be easily backdoored through multimodal data poisoning},
  author={Zhai, Shengfang and Dong, Yinpeng and Shen, Qingni and Pu, Shi and Fang, Yuejian and Su, Hang},
  booktitle={Proceedings of the 31st ACM International Conference on Multimedia},
  pages={1577--1587},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
datasets		datasets
scripts		scripts
target_patch		target_patch
LICENSE		LICENSE
README.md		README.md
badt2i_object.py		badt2i_object.py
badt2i_pixel.py		badt2i_pixel.py
badt2i_style.py		badt2i_style.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BadT2I

Pretrained Weights

Environment

Datasets

Datasets used in this paper.

A dataset applicable to this code.

Citation

About

Releases

Packages

Languages

License

zhaisf/BadT2I

Folders and files

Latest commit

History

Repository files navigation

BadT2I

Pretrained Weights

Environment

Datasets

Datasets used in this paper.

A dataset applicable to this code.

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages