Skip to content

The goal of this project is to identify challenges in deepfake audio detection and tackle these with the help of generative adversarial networks(GANs) and explanaible AI and improve trust of users in AI

Notifications You must be signed in to change notification settings

ParthGodse/Deepfake-Audio-Detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 

Repository files navigation

Deepfake Audio Detection using XAI-GANs

Created our own deep faked audio using Generative Adversarial Neural Networks (GANs) and objectively evaluate generator quality using Fréchet Audio Distance (FAD) metric. We augment a pre-existing dataset of real audio samples with our fake generated samples and classify data as real or fake using MobileNet, Inception, VGG and custom CNN models. MobileNet is the best performing model with an accuracy of 91.5% and precision of 0.507. We further convert our black box deep learning models into white box models, by using explainable AI (XAI) models. We quantitatively evaluate the classification of a MEL Spectrogram through LIME, SHAP and GradCAM models. We compare the features of a spectrogram that an XAI model focuses on to provide a qualitative analysis of frequency distribution in spectrograms. Used StreamLit for demonstration purpose where you can select any audio file from the dataset and then convert it to spectogram and run models and XAI technqiues giving results at the same time.

Tech Stack

Programming Language : Python
Libraries : Tensorflow, Keras, Sci-kit Learn, Matplotlib, Pandas, Numpy
Models : MobileNet, VGG16, Inception, Custom CNN
Explainable AI (XAI) methods : SHAP, LIME, GradCam
Additional Softwares : StreamLit

Data

Fake-or-Real dataset from York Univeristy containing sample audio files which are real and some which were generated by adversarial networks. Sample Spectogram image converted from audio signal file denoting frequency distribution as a heatmap.

image

System Design

image

Results

image

Spectogram results after applying Explainable AI

image

In collaboration with : Atharva Gurav, Aditi Govindu, Aamir Hullur

About

The goal of this project is to identify challenges in deepfake audio detection and tackle these with the help of generative adversarial networks(GANs) and explanaible AI and improve trust of users in AI

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published