Zero-Shot Fake Video Detection by Audio-Visual Consistency

Introduction

This repository provides the implementation for Zero-Shot Fake Video Detection by Audio-Visual Consistency, a content-consistency based method for detecting fake videos. Our approach leverages the FakeAVCeleb and DFDC datasets and builds upon the auto-avsr framework.

Data Preparation

To get started, you'll need to prepare your datasets:

Download Datasets:
- FakeAVCeleb
- DFDC (DeepFake Detection Challenge)
Pre-processing: Our pre-processing pipeline, adapted from auto-avsr, ensures consistent data formatting:
- Videos are converted to 25 FPS.
- Audio is converted to 16 kHz mono.
- The speaker's lip region is detected and cropped from each video frame.
- Cropped frames are resized to a uniform 96x96 pixels.
- For a detailed look at the complete pre-processing steps, refer to the auto-avsr preparation guide.
Create CSV File List: After pre-processing, create a CSV file (e.g., data/your_dataset.csv) with the following format: absolute_video_file_path, video_frames, segment_label, audio_label, video_label

Example: /your_path/FakeAVCeleb/video/FakeVideo-FakeAudio/African/men/id00366/00118_id00076_Isiq7cA-DNE_faceswap_id01170_wavtolip.mp4, 148, 0, 0, 0

For convenience, we've already prepared file lists for FakeAVCeleb and DFDC in the data folder.

Setup

Follow these steps to set up your environment and download necessary models:

Create Environment:

conda create -y -n fakevideodetection python=3.10
conda activate fakevideodetection
pip install -r requirements.txt

Download Pre-trained Models: Download the following pre-trained models from VSR for ML and place them in the pretrained_model folder:

Component WER URL Size (MB)

Visual-only 19.1 GoogleDrive or BaiduDrive (key: dqsy) 891

Audio-only 1.0 GoogleDrive or BaiduDrive (key: dvf2) 860

Inference

Once everything is set up, you can run the inference:

Configure your settings in run.sh and then execute:

bash run.sh

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
data		data
datamodule		datamodule
espnet/nets		espnet/nets
spm		spm
.gitignore		.gitignore
README.md		README.md
lightning.py		lightning.py
predict_asr.py		predict_asr.py
predict_vsr.py		predict_vsr.py
requirements.txt		requirements.txt
roc_analyzer.py		roc_analyzer.py
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Zero-Shot Fake Video Detection by Audio-Visual Consistency

Introduction

Data Preparation

Setup

Inference

About

Uh oh!

Releases

Packages

Languages

Component	WER	URL	Size (MB)
Visual-only	19.1	GoogleDrive or BaiduDrive (key: dqsy)	891
Audio-only	1.0	GoogleDrive or BaiduDrive (key: dvf2)	860

ReflectionL/FakeVideoDetection

Folders and files

Latest commit

History

Repository files navigation

Zero-Shot Fake Video Detection by Audio-Visual Consistency

Introduction

Data Preparation

Setup

Inference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages