AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors

This is the official implementation of AntifakePrompt [paper].

Introduction

Deep generative models can create remarkably photorealistic fake images while raising concerns about misinformation and copyright infringement, known as deepfake threats. Deepfake detection technique is developed to distinguish between real and fake images, where the existing methods typically train classifiers in the image domain or various feature domains. However, the generalizability of deepfake detection against emerging and more advanced generative models remains challenging. In this paper, inspired by the zero-shot advantages of Vision-Language Models (VLMs), we propose a novel approach using VLMs (e.g. InstructBLIP) and prompt tuning techniques to improve the deepfake detection accuracy over unseen data. We formulate deepfake detection as a visual question answering problem, and tune soft prompts for InstructBLIP to distinguish a query image is real or fake. We conduct full-spectrum experiments on datasets from 3 held-in and 13 held-out generative models, covering modern text-to-image generation, image editing and image attacks. Results demonstrate that (1) the deepfake detection accuracy can be significantly and consistently improved (from 54.6% to 91.31%, in average accuracy over unseen data) using pretrained vision-language models with prompt tuning; (2) our superior performance is at less cost of trainable parameters, resulting in an effective and efficient solution for deepfake detection.

Prerequisites

Environment installation

git clone https://github.com/nctu-eva-lab/AntifakePrompt
cd AntifakePrompt
pip install -e .

Vicuna weights preparation

AntifakePrompt uses frozen Vicuna 7B models. Please first follow the instructions to prepare Vicuna v1.3 weights. Then modify the llm_model in the Model Config to the folder that contains Vicuna weights.

Checkpoints downloading

We provide the best two checkpoints in our experiments:

COCO+SD2 (150k training images)
COCO+SD2+LaMa (180k training images)

cd ckpt
sh download_checkpoints.sh

The downloaded checkpoints will be saved in ckpt.

Checkpoint name	Training dataset	Average Acc. (%)
COCO_150k_SD2_SD2IP.pth	COCO + SD2	91.59
COCO_150k_SD2_SD2IP_lama.pth	COCO + SD2 + LaMa	92.60

Dataset

We provide our training, validation and testing dataset in the paper, as the following table shows.

Split	Real dataset	Fake dataset
Train	COCO	SD2
Val	COCO	SD2
Test	COCO, Flickr	SD2, SDXL, IF, DALLE-2, SGXL, ControlNet, DeeperForensic, Inpainting(LaMa), Inpainting(SD2), SuperRes(LTE), SuperRes(SD2), Adversarial attack, Backdoor attack, Data poisoning attack

Testing

Set the checkpoint path

Go to Model Config and set the key value of model: finetune to the checkpoint of prompt-tuned model (downloaded in Checkpoints downloading).

Classify a single image

python test.py --img_path <path_to_image>

Classify batch of images

Put the real images in a folder, and put the fake images in another folder.
Run the command

python test.py --real_dir <real_image_directory> --fake_dir <fake_image_directory>

If the data only contains real images or fake images, you can just assign one of the arguments between --real_dir and --fake_dir.

The --log argument determine the log file path when classifying a batch of images. (default=log/log.txt)

Training

Go to Dataset Config, set real_dir and fake_dir for train/valid/test split.
Go to Training Config, set the parameters properly.
Run the command to start training:

sh AntifakePrompt/run_scripts/textual-inversion/train.sh

Citation

Acknowledgement

This project is built upon the the following gaint sholders. Great thanks to them!

InstructBLIP: https://github.com/salesforce/LAVIS/tree/main/projects/instructblip
Stable diffusion: https://github.com/Stability-AI/stablediffusion
Textual Inversion: https://github.com/rinongal/textual_inversion

Name		Name	Last commit message	Last commit date
Latest commit History 559 Commits
.github/workflows		.github/workflows
app		app
assets		assets
ckpt		ckpt
dataset_card		dataset_card
deepfake-detection		deepfake-detection
docs		docs
examples		examples
lavis		lavis
projects		projects
run_scripts		run_scripts
tests/models		tests/models
utils		utils
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE.txt		LICENSE.txt
MANIFEST.in		MANIFEST.in
README.md		README.md
SECURITY.md		SECURITY.md
evaluate.py		evaluate.py
lavis.md		lavis.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
test.py		test.py
test.sh		test.sh
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors

Introduction

Prerequisites

Environment installation

Vicuna weights preparation

Checkpoints downloading

Dataset

Testing

Set the checkpoint path

Classify a single image

Classify batch of images

Training

Citation

Acknowledgement

About

Languages

License

nctu-eva-lab/AntifakePrompt

Folders and files

Latest commit

History

Repository files navigation

AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors

Introduction

Prerequisites

Environment installation

Vicuna weights preparation

Checkpoints downloading

Dataset

Testing

Set the checkpoint path

Classify a single image

Classify batch of images

Training

Citation

Acknowledgement

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages